Amazon Aurora is a drop-in replacement for MySQL that combines the performance and availability of traditional enterprise databases with the simplicity and cost-effectiveness of open source databases. Amazon Aurora is fully managed by Amazon Relational Database Service (RDS), which automates time-consuming administration tasks like hardware provisioning, database setup, patching, and backups.
You can ingest data from your Amazon Aurora MySQL database using Hevo Pipelines and replicate it to a Destination of your choice.
Prerequisites
Perform the following steps to configure your Amazon Aurora MySQL Source:
Set up MySQL Binary Logs for Replication
A binary log is a collection of log files that records information about data modifications and data object modifications made on a MySQL server instance. Typically binary logs are used for data replication and data recovery.
Hevo supports data ingestion for replication from MySQL servers via binary logs (BinLog). For this, binary logging must be enabled on your MySQL instance.
Perform the following steps to enable binary logging for your Amazon Aurora MySQL server:
-
Log in to the Amazon RDS console.
-
In the left navigation pane, click Databases.
-
In the Databases section on the right, click the DB identifier of your Amazon Aurora database instance.
data:image/s3,"s3://crabby-images/42450/4245037e72987e6d6e300b59b654700295391af0" alt="Select DB identifier"
-
Click the Configuration tab, and then click the link text under DB cluster parameter group.
Note: If you are using the default Aurora DB cluster parameter group, you need to create one with the Type as DB cluster parameter group, as the default one is not editable.
data:image/s3,"s3://crabby-images/ae21f/ae21f55fb6837c8548f818b50585e97fe14c05a4" alt="Parameter group"
-
Click Edit.
data:image/s3,"s3://crabby-images/93f76/93f7670a611d893a5f0f0a0f343319a5d24b2efb" alt="Edit parameters"
-
On the page that appears, search and update the value of the following parameters:
data:image/s3,"s3://crabby-images/93f74/93f74e4b6aa53fefd8dcf85d7c85a6ddc685b45a" alt="Parameter value"
Parameter Name |
Value |
binlog_format |
ROW |
binlog_row_image |
full |
gtid-mode |
ON |
enforce_gtid_consistency |
ON |
Note:
-
Enabling GTIDs (Global Transaction Identifiers) is recommended because it simplifies replication by uniquely identifying transactions, making it easier to track them.
-
Ensure that the value of binlog_row_value_options
parameter is not set to PARTIAL_JSON.
-
Click Save Changes.
-
Reboot the database instance that you are using to connect to Hevo, to apply the above changes.
To do this:
-
In the left navigation pane, under Dashboard, click Databases.
-
In the Databases section on the right, select the DB identifier of your Amazon Aurora MySQL database instance.
-
In the Actions drop-down, click Reboot.
Note: If you reboot a database instance with the Writer instance role in the DB cluster, all the remaining reader instances of that database in the cluster are rebooted as well.
data:image/s3,"s3://crabby-images/cc443/cc443d24c50f9dcf3ebbbcf0137d219efbb48d22" alt="Click Reboot"
-
On the Reboot page, click Confirm to reboot your DB instance.
data:image/s3,"s3://crabby-images/e9ab9/e9ab9bdfacee9b29353683f950df5be142311a25" alt="Confirm Reboot"
The replication reference guide on MySQL’s documentation portal provides a complete reference of the options available for replication and binary logging.
-
Log in to your Amazon Aurora MySQL database instance with ADMIN
privileges.
-
Run the following command to view the current BinLog retention period (in hours):
call mysql.rds_show_configuration;
-
If the BinLog retention period is less than 72 hours, run the following command to set it to at least 72 hours (three days).
call mysql.rds_set_configuration('binlog retention hours', 72);
Allowlist Hevo IP addresses for your region
You need to allowlist the Hevo IP address for your region to enable Hevo to connect to your Amazon Aurora MySQL database. To do this:
-
Log in to the Amazon RDS console.
-
In the left navigation pane, click Databases.
-
In the Databases section on the right, click the DB identifier of your Amazon Aurora database instance.
data:image/s3,"s3://crabby-images/42450/4245037e72987e6d6e300b59b654700295391af0" alt="Click DB identifier"
-
In the Connectivity & security tab, click the link text under Security, VPC security groups.
data:image/s3,"s3://crabby-images/6fb8e/6fb8e3b4fdcc1e4ce5702dc8feaf0c2ef67452cd" alt="Security group"
-
On the Security Groups page, select the check box for your Security group ID, and from the Actions drop-down, click Edit inbound rules.
data:image/s3,"s3://crabby-images/a6368/a6368985929ab60213a7337defa49c2a4d69566a" alt="Edit inbound rules"
-
On the Edit inbound rules page:
data:image/s3,"s3://crabby-images/418a8/418a8f9fe2b4b8c90e9192f000ccfa44434bfb0c" alt="Add rule"
-
Click Add rule.
-
In the Type column, select MySQL/Aurora from the drop-down.
-
In the Source column, select Custom from the drop-down and enter Hevo’s IP addresses for your region. Repeat steps 1-3 to allowlist all the IP addresses.
-
Click Save rules.
Create a Database User and Grant Privileges
1. Create a database user (Optional)
Perform the following steps to create a database user in your Amazon Aurora MySQL database:
-
Connect to your Amazon Aurora MySQL database as a root user with an SQL client tool, such as MySQL workbench.
-
Run the following command to create a user in your database:
CREATE USER <database_username>@'%' IDENTIFIED BY '<password>';
Note: Replace the placeholder values in the command above with your own. For example, <database_username> with hevo.
2. Grant privileges to the user
The following table lists the privileges that the database user for Hevo requires to connect to and ingest data from your MySQL database:
Privilege |
Grants access to |
SELECT |
Retrieve rows from the database tables. |
RELOAD |
Clear or reload internal caches, flush tables, or acquire locks. |
SHOW DATABASES |
View the list of database names in the server. |
REPLICATION CLIENT |
Access the MySQL server’s BinLog for replication. |
REPLICATION SLAVE |
View replication status and log details. |
EXECUTE |
Run procedures to read BinLog settings on the MySQL server. |
Connect to your Amazon Aurora MySQL database as a root user with an SQL client tool, such as MySQL Workbench, and run the following script. These commands grant only the necessary privileges required by Hevo to ingest data from your Amazon Aurora MySQL database.
# Grant Privileges to the Database User
GRANT SELECT, RELOAD, SHOW DATABASES, REPLICATION CLIENT, REPLICATION SLAVE ON *.* TO <database_username>@'%';
GRANT EXECUTE ON PROCEDURE mysql.rds_show_configuration TO '<database_username>'@'<hostname>';
# Grant Hevo access to the Database
GRANT SELECT ON <database_name>.* TO <database_username>;
# Finalize the User’s Permissions
FLUSH PRIVILEGES;
Note:
-
The SELECT
, RELOAD
, and SHOW DATABASES
privileges are required only for the historical load.
-
Replace the placeholder values in the commands above with your own. For example, <database_username> with hevo.
Retrieve the Database Hostname and Port Number (Optional)
Note: The Amazon Aurora MySQL hostnames start with your database name and end with rds.amazonaws.com. For example, database-1-instance-1.xxxxxxxxx.rds.amazonaws.com
-
In the left navigation pane of the Amazon RDS console, click Databases.
-
In the Databases section on the right, click the DB identifier of your Amazon Aurora MySQL database instance.
data:image/s3,"s3://crabby-images/42450/4245037e72987e6d6e300b59b654700295391af0" alt="Click DB identifier"
-
In the Connectivity & security tab, do the following:
data:image/s3,"s3://crabby-images/f3c3f/f3c3f5b6cd9b0ce2eba5cf94e064caabda345e5a" alt="Copy endpoint & port"
-
Click the copy (
) icon to copy the Endpoint and save it.
-
Copy the Port and save it.
Use these values as your Database Host and Database Port, respectively, while configuring your Amazon Aurora MySQL Source in Hevo.
Perform the following steps to configure your Amazon Aurora MySQL Source:
-
Click PIPELINES in the Navigation Bar.
-
Click + CREATE PIPELINE in the Pipelines List View.
-
On the Select Source Type page, under All Sources, click Edge, and then select Amazon Aurora MySQL.
data:image/s3,"s3://crabby-images/032ff/032ff0522f35295f34f34902e084a152792ab24c" alt="Select Edge Sources"
-
In the Amazon Aurora MySQL screen, specify the following:
data:image/s3,"s3://crabby-images/37406/374061fddf0b28542ff04b25be61420c8094552f" alt="Aurora MySQL settings"
-
Source Name: A unique name for your Source, not exceeding 255 characters. For example, Amazon Aurora MySQL Source.
-
In the Connect to your MySQL section:
-
Database Host: The MySQL host’s IP address or DNS. This is the endpoint that you obtained in Step 4 above.
Note: For URL-based hostnames, exclude the http:// or https:// part. For example, if the hostname URL is http://mysql-database.westeros.inc, enter mysql-database.westeros.inc.
-
Database Port: The port on which your Amazon Aurora MySQL server listens for connections. This is the port number that you obtained in Step 4 above. Default value: 3306.
-
Database User: The authenticated user who has the permissions to read tables in your database. This user can be the one you created in Step 3 above or an existing user. For example, hevouser.
-
Database Password: The password of your database user.
-
Database Names: The comma separated list of databases from where you want to replicate data. For example, demo1, demo2.
-
(Optional) In the Additional Settings section:
-
Use SSH: Enable this option to connect to Hevo using an SSH tunnel instead of directly connecting your MySQL database host to Hevo. This provides an additional level of security to your database by not exposing your MySQL setup to the public.
If this option is turned off, you must configure your Source to accept connections from Hevo’s IP addresses.
-
Use SSL: Enable this option to use an SSL-encrypted connection. Specify the following:
-
CA File: The file containing the SSL server certificate authority (CA).
-
Client Certificate: The client’s public key certificate file.
-
Client Key: The client’s private key file.
-
Click TEST & CONTINUE to test the connection to your Amazon Aurora MySQL Source. Once the test is successful, you can proceed to set up your Destination.
Read the detailed Hevo documentation for the following related topics:
Data Type Mapping
Hevo maps the MySQL Source data type internally to a unified data type, referred to as the Hevo Data Type, in the table below. This data type is used to represent the Source data from all supported data types in a lossless manner.
The following table lists the supported MySQL data types and the corresponding Hevo data type to which they are mapped:
MySQL Data Type |
Hevo Data Type |
- BIT(1) - BOOLEAN - TINYINT(1) - TINYINT UNSIGNED(1) |
BOOLEAN |
- TINYINT(>1) - SMALLINT - TINYINT UNSIGNED(>1) |
SHORT |
- INT - MEDIUMINT - SMALLINT UNSIGNED - MEDIUMINT UNSIGNED - YEAR |
INTEGER |
- BIGINT - INT UNSIGNED - BIGINT UNSIGNED |
LONG |
- FLOAT(0-23) |
FLOAT |
- REAL - DOUBLE - FLOAT(24-53) |
DOUBLE |
- NUMERIC - DECIMAL |
DECIMAL |
- CHAR - VARCHAR - TINYTEXT - TEXT - MEDIUMTEXT - LONGTEXT - JSON - ENUM - SET |
VARCHAR |
- TIMESTAMP |
TIME_TZ |
- DATE |
DATE |
- TIME |
TIME |
- DATETIME |
TIMESTAMP |
- BIT(>1) - BINARY - VARBINARY - TINYBLOB - BLOB - MEDIUMBLOB - LONGBLOB |
BYTEARRAY |
At this time, the following MySQL data types are not supported by Hevo:
Note: If any of the Source objects contain data types that are not supported by Hevo, they are marked as unsupported during object configuration in the Pipeline.
Source Considerations
- MySQL does not generate log entries for cascading deletes. So, Hevo cannot capture these deletes for log-based Pipelines.
Limitations
-
Hevo only fetches tables from the MySQL database. It does not fetch other entities such as functions, stored procedures, views, and triggers.
-
Hevo does not set the metadata column __hevo__marked_deleted to True for data deleted from the Source table using the TRUNCATE command. This action could result in a data mismatch between the Source and Destination tables.