Connecting to your Amazon Redshift database? Here is an overview of things you can expect.
Connecting to Amazon Redshift
Before starting, please collect your Amazon Redshift Host, Port, Database name, username and password. Additionally, please whitelist these IP addresses before connecting.
- Click on the Data tab at the top left of the global navigation. Then click on Connections in the Data Library section.
- Click on the blue Connect button and search for Amazon Redshift.
- Click on the Amazon Redshift logo to connect.
- Type in the connection nickname, this can be anything that you would like to name it.
- Type in the host, port, database name, username, and password.
Data Warehousing
Google BigQuery is on the Grow Data Warehouse, which we highly recommend you use. For more information about warehousing your data, visit the Data Warehouse Help Article.
Endpoints
Amazon Redshift does not have set reports through which to pull data. The endpoints are the tables inside your database.
Warehousing Your Data
- Starting with the Connections Overview Page, navigate to your Amazon Redshift connection and click on it to open the Connection Details page.
- Select the Manage Connection button inside the Connection Details section to open the warehouse table selection flow.
- If your data source isn't already connected, refer to the Connecting to a Data Source for more details. If your data source is already connected, you can immediately select which tables of data you want to warehouse. Once you have finished selecting the data that you want stored in the Data Warehouse, click the Sync & Store button at the bottom of the Manage Connection page. This will begin the initial population of your warehouse with the tables you defined. Populating your data warehouse for the first time may take a while, even up to several hours.
- While your data is "In Queue" or "Processing", you may continue working within the app.
Sync Interval
Full Sync
The default sync interval is 12 hours per table for all warehoused tables.
Incremental Sync
Incremental sync is enabled for Amazon Redshift and when incremental sync is available, the default incremental sync interval is 1 hour. You can adjust this interval as you want.
To learn more about sync intervals and how to adjust the default settings, please visit the Sync section of the Data Warehouse help article.
FAQs + Tips and Tricks
Random Disconnects
Sometimes Amazon Redshift disconnects from Grow randomly. We have found that it is an issue with Redshift and not the connector of the Grow. If your metrics become disconnected, reconnect to your database.
How can I make sure my data is secure?
We have several items in place to keep your data safe. Check out our security article to see what steps you can take to keep control of your data.
Connecting via SSH
Redshift itself doesn't use SSH.
In order to to use a SSH tunnel, Grow will need to connect to a server on the same VPC that has access to your Redshift DB. It is usually an EC2 Instance that has the PostgreSQL CLI client installed. This is because Redshift itself doesn't support SSH tunneling, but we can connect through a server that has access to the Redshift DB.
Info Required
Once you have setup an additional EC2 instance inside of the same VPC, please install PostgreSQL CLI client, as well as SSH server. Make sure port #22 is open, and send us the whitelisted IP address from which we can access the server.