Glossary of Terms
On This Page
The interaction of a user on ad-based Sources, such as a click on an advertisement or a video view.
A collection of one or more ads where details regarding a Snapchat campaign such as budget, audience, and schedule are defined.
A change in the Destination data type to accommodate variations in the Source data type to avoid sidelining of Events for data type mismatches.
Application Programming Interface (API)
An interface that allows two applications to communicate and exchange data. This exchange of data happens through API Endpoints, which act as touchpoints for the APIs to send and receive requests for data. For example, the Google Maps API allows apps and websites to access a variety of Google’s map functions by using the various URLs the API provides for different actions such as:
Display a map — launch Google Maps with no markers or directions:
Assertion Consumer Service URL
The URL where the identity provider (IdP or IDP) sends SAML responses. An IdP is a service that stores and manages digital identities. For example, you can use https://auth.hevodata.com/auth/saml/assertion for Hevo to receive SAML responses from your IdP when setting up Hevo as an application in your SAML-enabled IdP.
A feature in Hevo that automatically maps new Event Types and their fields to the existing Event Types and their fields in the Destination. Read Auto Mapping Event Types.
When the attributes associated with an action are traced to an Event in the past. For example, the conversion on New Year’s Eve was back-dated to the Christmas Day sale.
A set of log files in MySQL that record every statement that is executed (adding, deleting, updating records or tables). The log files have a retention period after which they are deleted by the MySQL server. BinLog, short for Binary Log, can be selected as a Pipeline mode for MySQL Sources.
Any key or group of keys that uniquely identifies the rows in a table. The candidate key defines which keys can form the primary key for that table. A table can have multiple candidate keys.
Single key: The teacher’s employee ID is a candidate key for the faculty table in the students’ database.
Group of keys: The teacher’s name and the subject taught by the teacher together form a candidate key for the subjects table in the students’ database.
A collection refers to a logical grouping of data within databases like MongoDB or Amazon DocumentDB. It acts as a container that holds multiple documents, providing an organizational structure for storing and managing data.
An index created on multiple columns in a table. For example, consider the following SQL statement:
CREATE INDEX subject_key on SUBJECTS (teacher_name, subject_name);
Here, subject_key is a combined index created on two columns teacher_name and subject_name from the subjects table in the students database.
A synonym for combined index. See Combined Index.
A type of primary key that comprises two or more columns to uniquely identify a record in a database table. For example, a student’s enrollment ID and name in the school’s database.
A general term for a system that Hevo integrates with another service/software. In Hevo, connectors are Sources, for example, MySQL or Google Ads.
The process of an interaction of a user leading to a subsequent purchase or sign-up.
The form that data exists in. Each field or value has a data type, for example, document, scalar, set, string, or integer.
Fields that can be used to uniquely identify records in Marketo’s standard as well as custom objects. There can be up to three de-dupe fields. In some Activate Targets, a combination of these fields is used as an identifier.
A function used in Hevo search to display suggestions only after a short delay. Instead of showing suggestions each time you type a letter in the search bar, it waits until you have finished or paused typing to show the result, thus improving the user experience.
The act of removing duplicates from the replicated data.
Guarantees the number of times the Events are loaded to the Destination without any data loss. It can be of three types:
At-least-once delivery semantics: This ensures that the Events are loaded to the Destination at least once. In case of any failures, Hevo retries to replicate these Events until they are successful.
Exactly-once delivery semantics: This ensures that the Events are loaded to the Destination exactly once, without any duplicates, even if there are any failures during the process.
At-most-once delivery semantics: This ensures that the Events are loaded to the Destination at most once. In case of any failures, the data is not replicated again, leading to data loss.
A database, file system, or data warehouse that acts as the endpoint of a data Pipeline, and into which, the data from the Source is finally loaded.
A document is a JSON-like data structure consisting of key-value pairs. A key is a string that represents a field name, and a value can be a string, a number, a boolean, an array, or even nested documents.
A type of non-relational database designed to store and query data as JSON-like documents. A document is a record in a Document Database.
When you switch to a subscription plan that has fewer features than your current plan. For example, you downgrade if you switch from a business plan to a starter plan.
The act of deleting all the data from a table (in the Destination system). This is a permanent action. Hevo does not delete the table itself; users have an option to do so if required.
Enum Data Type
In MySQL, enum data type is a string object. An enum (enumerated) column can only contain values from a list of pre-defined values specified during table creation.
For example, consider the following table definition:
CREATE TABLE ribbons ( material VARCHAR(40), colors ENUM('red', 'pink', 'blue', 'green', 'black') );
Here the column,
colors can only contain the values red, pink, blue, green, or black.
A fundamental unit of data that represents the creation, update, or deletion of information in the Source and can be replicated to a Destination system. For example, a new document in a MongoDB collection, or updated contact details in the Contacts table in MySQL.
Groups of Events created based on the entity they are ingested from at the Source. For example, a table Customers in your Shopify data can be considered an Event Type, and all Events ingested from this table fall under this type.
The level of detail in a data set. For example, Daily is the lowest time granularity that you can have on Bing Ads API.
The data present in the Source application or database before the first ingestion post-Pipeline creation. For some Sources, you can specify the duration for which this existing data must be ingested. However, for other Sources, Hevo may ingest the data either entirely or for a predetermined duration, depending on the Source setup.
The data that cannot be modified after creation. For example, the values assigned to a string in Java.
The new or modified data in the Source post-Pipeline creation.
Special lookup tables that the database search engine can use to speed up data retrieval. An index is like a pointer to the data in a table.
The act of retrieving or fetching data from the Source.
The data stored in a database at a particular moment in time.
Java Database Connectivity (JDBC)
An SQL-based API created by Sun Microsystems to enable Java applications to use SQL for database access. A JDBC Source is one that supports JDBC connectivity. The JDBC driver for that data source and the URL format is required to set up the connection.
A single column or a group of columns that can uniquely identify rows (or tuples) in a table.
The time it takes to load the data into the Destination once it is ingested from the Source. Also called data latency or end-to-end latency.
The process of replicating data change in a database as per a primary key. Logical replication can be used to perform actions such as, consolidating multiple databases, sending triggers for incremental and individual data change, or sharing a subset of the database between multiple databases.
A type of field that contains a value retrieved from another object or field. The lookup field can be used to search and associate records between two objects that share a lookup or master-detail relationship. For example, in Salesforce, a user can associate a contact record to an account record using the Account Name lookup field.
A data repository that contains the server-level settings such as the structure and primary configuration settings of all the databases, and the server-level metadata such as the user authorizations.
A column that contains information about the data present in a row. For example,
__hevo_database_name is a metadata column created by Hevo to indicate the name of the database from which a particular row is ingested.
Read Hevo-generated Metadata to know about the metadata columns created by Hevo.
A measure for insights such as clicks, engagement, likes, impressions, followers for your ad campaigns, social media posts or pages.
The data that can be modified after creation. Everything except a String type value is mutable by default in Java. For example, the elements in an array can be assigned new values after initialization.
Table columns that do not accept NULL values. This forces a field to always contain a value, which means that you cannot insert a new record or update an existing record, if this column field does not contain a value.
An open standard for authorization that allows applications or websites limited access to resources hosted by other apps and websites on behalf of the user without sharing the user’s password.
The entities that Hevo ingests from the Source based on the type of Source. For example, collections from MongoDB or reports from Twitter ads.
A value in the latest record fetched. Offset is used to identify the starting point for the next set of results to return.
A collection present in MongoDB that keeps a record of all the operations that modify the data stored in the database.
A process in which when multiple tasks run at the same time. For example, an application can split its tasks up into smaller subtasks that can be processed in parallel, for instance on multiple CPUs at the same time.
The essential information sent or received with the HTTP methods such as GET or POST, in an API.
For example, in the following JSON response:
The payload is the message, ‘Welcome, world!”.
A framework in Pipedrive that allows you to manage your sales, and keep track of them using multiple stages, from lead arrival, to deal closure. The admin of a Pipedrive account can add these stages, and the team members who you want to allow access to these stages according to their requirements.
A pre-defined framework of user-configured processes in Hevo that move data from one system to another, typically with Transformations that make it easier to analyze the data.
The frequency at which a Pipeline ingests data from its Source. Read Scheduling a Pipeline.
The process by which the data is read from the Source periodically, depending on a set schedule.
A primary key is a non-null candidate key selected to uniquely identify every row in that table. A primary key is a candidate key but the reverse may not be true. For example, the roll number of a student in the students’ marks database uniquely identifies a student.
The server that manages the primary copy of the database, and is responsible for receiving read and write operations directly from clients and executing them against the database. It also sends data to its replica servers.
Primitive Data Type
A data type pre-defined by the programming language. For example, int, long, float, and double in Java.
An app built by a company in Intercom for its internal use; not listed on the Intercom App Store. For example, ChargeDesk built a private Messenger app for their customer support team to easily share invoices with their customers.
An intermediary server that allows multiple clients to route traffic to an external network. A proxy server protects the identity of the connecting client. For example, an organization may have a proxy server set up to route and filter employee traffic to the public Internet.
The act of deletion of a BinLog file, an Event, an account, a table, and data in general.
The process by which Hevo acts as a receiver and the Source holds the responsibility to send/post data to Hevo. This applies to webhook-based Sources.
Query Batch Size
The number of rows or records returned by a SELECT query. The SQL keyword LIMIT is used to define the batch size. For example, consider the following SQL statement:
SELECT id from employees LIMIT 50;
Here, the query batch size is 50. So, the SELECT query returns only 50 records from the employees table.
Values used to specify the minimum and maximum limit within which a field’s value may lie.
The limits imposed by an API vendor, such as Intercom, HubSpot, on the number of API requests sent to their public APIs. Rate limits only apply on calls to the REST API, applications using OAuth, and Integrations using API keys.
A user who is only allowed to read or access all data from the Source, and not make changes to it.
A copy of the master database that allows read-only requests. They help in disaster recovery and in reducing the traffic or load on the master database by serving the read requests.
The collective process of ingesting data from a Source application, running Transformations on it, if any is needed, and subsequently loading it to the Destination database or data warehouse.
The special keywords used by SQL relational databases to perform various operations, such as, SELECT and COMMIT. It is not recommended to use these keywords as names for databases, tables, columns, variables, or other objects.
Reverse Proxy Server
A type of proxy server that acts as a gateway between clients, users, and application servers. All incoming client requests are routed via this server to the appropriate server inside the private network. A reverse proxy server protects the identity of the server processing the client request. For example, network administrators can configure the reverse proxy server to whitelist or blacklist specific IP addresses.
RSA key pair
The key pair consisting of a private key and a public key which are used by the RSA algorithm for data encryption. The public key is used for encryption, whereas the private key, which is only available to the user is used for decryption.
SaaS (Software as a Service)
A software distribution model where the application is hosted by a company on its servers and is accessed by clients via the internet by paying a subscription fee. For example, Salesforce.
Security Assertion Markup Language (SAML)
A protocol for authenticating web applications. SAML provides a way to authenticate users to third-party web applications such as Gmail for Business, Office 365, and Salesforce.
A simple, primitive data type with values like a number or text. For example, int, bool and character in Java.
The organization of data within an event type which comprises of details such as its name, fields within the event type, and the data types of these fields. For example, the schema of the employees event type may contain fields such as name and ID with datatypes varchar(25) and int respectively.
A mapping between fields in the Source schema and the fields of the Destination Schema. It determines the columns in the Destination table into which the incoming data of each Source field must be loaded.
Secure Shell (SSH)
A network protocol that is used to create a secure channel over a network between a client and server application to transfer data.
A type of reporting applicable to ad-based Sources in which data is retrieved based on metrics segmented by a particular target type. For example, a report segmented by location.
A fully-managed platform that provides compute, storage, and network resources on which developers can build and run their applications without having to provision or manage the underlying infrastructure.
A Google account associated with a team rather than an individual user. It is user-independent and requires a Key to authenticate the connection. Service accounts are used when workloads are run on virtual machines.
Service Account Key
A public and private RSA key pair which a Google service account uses to securely authenticate the connection.
An interface that implements the mathematical set. A set contains no duplicate elements.
When Events are not loaded in the Destination table due to mismatching of data type with the Source table for the same column.
Signed Data Type
A numeric data type that can hold both negative and positive integers.
A synonym for Destination. See Destinations.
The application or database from which Hevo ingests the data. For example, GitHub, Facebook Pages, and MySQL.
The geographical region in which your account is created and its data stored. For example, the subscription region of your Pendo account can be the EU or US region.
An abbreviated term used to mean synchronization. The terms sync and synchronization are used interchangeably in Hevo’s documentation.
Any CRM application to which data may be loaded using Hevo Activate. For example, HubSpot, Salesforce.
The table in your CRM applications such as Salesforce and HubSpot with which data is synchronized by an Activation.
The process of changing the format and structure of data. Read Transformations.
Removes all rows from a table while preserving the table structure, such as its columns, constraints, and indexes. It frees up the storage space used by the table data, making it a faster operation than DELETE, especially for large tables.
For example, consider the following SQL statement:
TRUNCATE TABLE sales_leads;
Here, the data in the sales_leads table is deleted, but the columns and other characteristics are maintained.
Note: This is a non-recoverable operation, which means once the data is truncated, it cannot be recovered.
A field that has not been mapped from the Source to the Destination. The Events related to these fields are marked as Failed until you map the fields to the appropriate Destination table fields or skip them. Read Auto Mapping Event Types.
Unsigned Data Type
A numeric data type that can hold only positive integers or zero.
Universally Unique Identifier (UUID)
A data type used to uniquely identify records.
The UUID is written as a sequence of lowercase hexadecimal digits of 128 bits. For example, a0eebc99-9c0b-4ef8-bb6d-6bb9bd380a11.
These are virtual tables that does not exist physically in the database but is created by combining data from one or more tables in the Destination. For example, consider the following SQL statement:
CREATE VIEW sales AS SELECT city FROM customers WHERE city = 'New York'
Here, a view called sales is created, which consists of all the customers residing in New York.
Views does not store data in a physical location. It allows you to simplify queries and provide a specific perspective on the data, without actually modifying the underlying data in the tables.
Virtual Private Cloud (VPC)
A virtual private network (VPN) set up by an enterprise that comprises an on-demand, configurable pool of shared resources allocated within a public cloud environment. A VPN is dedicated to the enterprise creating it and is logically isolated from other virtual networks in the public cloud. For example, Amazon VPC is a VPN dedicated to a user’s AWS account that enables the user to launch AWS resources in that network.
Write-Ahead Logging (WAL log)
A logging mechanism used by PostgreSQL that maintains the transactions taking place in the database. So, even the smallest change in data is written to this log before it is applied to the PostgreSQL database.
A place to store the data accumulated from a wide range of heterogeneous Sources, generally used for data analysis and reporting. For example, Amazon Redshift, Google BigQuery or Snowflake.
A methodology that allows a web application to communicate based on an event rather than a request, often with a payload of data.
A space created on Slack consisting of organized conversations such as channels and DMs, where team members can communicate and work together.
A space created in an Intercom account to store users’ data. Users can create different workspaces within an account to organize the data based on their functionality, such as Support, Customer Onboarding, Lead Generation, or Customer Engagement.
A space created in Hevo, that allows team members to create and access Pipelines, Models, and Workflows in that workspace. You can add any number of members to your workspace. An organization can create up to five workspaces in Hevo, using the same domain name.
A system-generated column present in all the PostgreSQL tables. Hevo uses this column to query data in the XMIN query mode.