Azure Synapse Analytics

Last updated on Sep 24, 2024

Azure Synapse is a cloud-based analytics service that combines the technologies required for data integration, data warehousing, and big data analytics. Users can query and perform analytics on data using resources organized in both serverless and dedicated SQL pools.

Hevo integrates with dedicated SQL pools, which are SQL data warehouses in Azure Synapse Analytics. You must provision the dedicated SQL pool from within a Synapse workspace. A Synapse workspace helps to securely collaborate between your Azure resources, which are logically grouped together in a container called resource group.

If you are new to Azure Synapse Analytics and need assistance with setting up an instance, refer to these sections:

The following image illustrates the key steps that you need to complete to configure Azure Synapse Analytics as a Destination in Hevo:

Azure Synapse Analytics Destination Setup Overview


Handling Source Data with Different Data Types

Hevo automatically modifies the data type of an Azure Synapse Analytics Destination table column to accommodate Source data with a different data type. Read Handling Different Data Types in Source Data.

Note: Your Hevo release version is mentioned at the bottom of the Navigation Bar.


Handling JSON Fields

Read Parsing Nested JSON Fields in Events to know how Hevo parses the JSON data and makes it available at the Destination. JSON data ingested from the Source is either collapsed into JSON strings or passed as-is to the Destination. Read JSON Functions (Transact-SQL) for information on querying and extracting data from your Azure Synapse Analytics Destination tables using JSON functions.


Destination Considerations

  • In Azure Synapse Analytics, tables in dedicated SQL pools are created with clustered columnstore index by default, as this reduces data storage costs and improves query performance. However, any variable-length column in these tables cannot contain more than 8000 bytes of data. As a result, any Events containing data that exceeds this size limit are sidelined by Hevo. You can use Transformations to reduce the data size of these Events and replay them.

  • Azure Synapse Analytics does not support the ORDER BY clause in Common Table Expressions (CTEs). When creating Models for this Destination, avoid using the ORDER BY clause within a CTE in your SQL query, as Hevo runs a CREATE TABLE AS SELECT (CTAS) command to generate the Model. The ORDER BY clause may cause the Model generation to fail.


Limitations

  • Hevo does not support data replication to serverless SQL pools in Azure Synapse Analytics.



Revision History

Refer to the following table for the list of key updates made to this page:

Date Release Description of Change
Sep-30-2024 NA Updated sections, Create a Dedicated SQL Pool (Optional), Whitelist Hevo’s IP addresses and Obtain the Azure Synapse Analytics Data Warehouse Connection Settings as per the latest Azure Synapse Analytics UI.
Jul-15-2024 NA Updated section, Destination Considerations to add the limitation for using ORDER BY clause in CTEs.
Apr-25-2023 2.12 Updated section, Configure Azure Synapse Analytics as a Destination to add information that you must specify all fields to create a Pipeline.
Mar-10-2023 2.09 New document.

Tell us what went wrong