Connect to Your Data on Dataiku Online

Supported Connections

Dataiku Online allows you to connect to multiple sources of data as read-only sources or read-and-write storage.

Note

A (read-only) Data Source will be used to inform Dataiku how it can access data stored externally: Dataiku remembers the location of the original source datasets. This is read-only; no data is stored or modified in the original system.You typically use these datasets as the entrypoint (leftmost part) to your Flow.

A (read-and-write) Data Storage will be used not only to allow Dataiku to read the data, but also to create new datasets (write) and, in SQL Data Storage, perform in-database computation, thus improving performance.

From Dataiku Online, you can connect to the following:

Type

Read / Data Sources

Read and Write / Data Storage

Snowflake

X

X

Azure Synapse

X

X

Google BigQuery

X

X

Amazon Redshift

X

X

PostgreSQL

X

X

Oracle

X

X

SQL Server

X

X

MySQL

X

X

Amazon S3

X

X

Azure Blob Storage

X

X

Google Cloud Storage

X

X

MongoDB

X

With Data Connector Plugins, you can also connect to the following: Salesforce, Zendesk, Google Sheets

Note

Depending on your subscription plan, not all connectors may be available.

How to Add a New Data Connection

  • First navigate to the Launchpad to get started.

  • In your space, open the Connections tab and click on the button Add a connection:

../../_images/add-a-connection.png

  • Choose your connection type from the Read Only Data Sources or Read/Write Data Storage sections:

../../_images/add-a-feature.png

  • Fill the connection details, and then click on Test:

../../_images/snowflake-connection-4.jpg

Once the test is OK, you can add the feature. You will get a confirmation message as well as a message letting you know the IP addresses you might need to whitelist to allow connection.

../../_images/snowflake-connection-5.jpg