Marketo

Marketo is a marketing automation platform used by B2B and B2C companies to manage and deliver personalized multi-channel programs and campaigns to prospects and customers. Marketo enables companies to curate their raw user data and create programs and focussed campaigns for different marketing activities, from lead generation to marketing ROI measurement, across multiple channels.

With the help of Pipelines in Hevo, you can synchronize Marketo with a database or data warehouse Destination to always have access to the latest data, which you can feed into your enterprise BI solution for custom reporting and analysis. Hevo Pipelines use Marketo’s bulk (preferred) and REST APIs to fetch both historical and changed data, which you can replicate to the Destination after performing any necessary transformations on it.

Marketo API Limits

Marketo imposes strict limits on the API calls you can make within a given time frame to retrieve the data. Some of these include:

  • Rate Limit: Maximum of 100 API calls per 20 seconds per instance.
  • Concurrency Limit: Maximum of 10 concurrent API calls.
  • Daily Quota: Up to 50,000 API calls per day with a max export of 500 MB for bulk jobs for paid subscription (quota resets daily at 12:00 midnight CST).

Read more about Marketo API limits.

Hevo Pipelines overcome these limitations by using Bulk APIs to fetch the data.

Data Ingestion using Bulk APIs

The Hevo connector uses Bulk APIs by default for all Marketo objects that allow this, namely, Program members, Activities and Leads. As a result, Hevo can minimize the number of API calls, while maximising the number of Events fetched per API call, and thereby, help you to remain within the imposed limits to the extent possible. This becomes specially useful while retrieving historical data.

Bulk APIs in Marketo uses the same permissions as the REST APIs, therefore, the job or API type that is running is transparent to you except for the API endpoint.

Compared to a REST API, for a bulk extract:

  1. The Hevo connector submits the job for the data you need, with the required metadata, to Marketo.

  2. Marketo queues and runs the job.

  3. Hevo queries for the status intermittently and when the job completes, Hevo makes one single call to fetch the data, extract the input stream, and process the records in order.

See Appendix 1 - Destination Tables for the list of Marketo objects that allow bulk operations.

Data Replication Method and Schedule

The Hevo connector runs the following jobs to handle historical and incremental data extraction from Marketo:

Historical jobs start from 12.00 midnight (12.00 a.m.) UTC of the current day and fetch data for the past six months, a month at a time, in reverse chronology.

Incremental jobs run at the scheduled interval of the Pipeline to fetch new Events.

Refresher jobs refresh the data for the past six months on every run.

Connector-specific jobs:

  • The Lead Soft Deletes job fetches any activity that indicates that a lead has been deleted, and marks that lead as deleted in the Destination.

Refer to the table below for the list of jobs of each type.

Job Type Job Names Schedule Additional Information
Historical Activities Historical Load During the first run of the Pipeline  
Incremental Activities, Activity Types, Campaigns, Leads, Programs During each run of the Pipeline Campaigns job fetches data for the past six months on the first run.
Connector-specific Lead Soft Deletes Starts after the first run of the Leads Refresher job, and subsequently as per the Pipeline schedule  
Refresher Lead Refresher Every 24 hours post-completion Ensures that leads, opportunity, opportunity roles, and salespersons data is up to date.

Note: The time taken for the Historical data load is determined by the amount of data and processing time for the jobs submitted to Marketo.

Configuring Marketo as a Source

To configure Marketo as a Source:

  1. Obtain authenticated access credentials for your Marketo instance.

    Note: While creating a new role, you must either individually select all “Read-Only” privileges or provide all Access-API privileges.

  2. Click on Pipelines in the left navigation bar, and then, CREATE PIPELINE.

  3. In the Select Source Type page, select Marketo as the Source.

  4. In the Configure Your Source page, provide the following information, including the access credentials obtained in Step 1:

    • Pipeline Name: A name for your Pipeline.

    • Client ID - Available at the newly created service.

    • Client Secret - Available at the newly created service.

    • Endpoint - The base URL used to make all the API calls.

      Example: https://469-HOO-201.mktorest.com/rest

    • Identity Endpoint - The endpoint used to retrieve access tokens using the Client ID and Client Secret.

      Example: https://469-HOO-201.mktorest.com/identity

  5. Click Test and Continue. If the credentials provided are correct, the Marketo Source is configured and you can proceed to setting up the Destination.

Note: If IP restriction is enabled in your Marketo account, you must either whitelist Hevo IPs or disable IP restriction to allow Hevo to make API calls.

Data Model

Hevo uses the following schema to upload the records in the Destination:

marketo-er-diagram

Refer to Appendix 1 - Destination Tables for the list of tables created at the destination.

If you have selected AutoMapping, Hevo creates the tables in the Destination automatically. Else, you must manually create and map the tables. See sample image here:

Marketo Tables automapped to Destination Tables

Appendix 1 - Destination Tables

The following is the list of tables (objects) and their primary keys that are created at the Destination when you run the Pipeline.

Linked objects: Wherever there is a dependency between objects, the job is created for only the higher-level object. The same job fetches the data for the linked object also. However, you can see separate tables in Schema Mapper for each such linked object. For example, when you fetch Campaigns data, Smart List data is automatically fetched. So, while you see the Smart List table in Schema Mapper, there is no job created by this name.

Note: The table names are written in small case, except for the Snowflake data warehouse tables which are written in uppercase.

Table Primary Key Parent Object Bulk API
activity_types
(Custom activity types as well as Marketo-provided activity types are loaded into the same table.)
ID NA NA
campaigns ID NA NA
lead_activities marketoGUID NA Bulk activity extract
leads leadId NA Bulk lead extract
opportunity marketoGUID lead_activities NA
ppportunity_role marketoGUID lead_activities NA
program ID NA NA
program_members ID program Bulk program member extract
sales_persons ID lead_activities NA
smart_list ID campaigns NA