Reference | Supported data connections#

Dataiku Cloud allows you to connect to multiple sources of data as read-only sources or read-and-write storage.

Important

A read-only data source will be used to inform Dataiku how it can access data stored externally. Dataiku remembers the location of the original source datasets. This is read-only; no data is stored or modified in the original system. You typically use these datasets as the entry point (leftmost part) to your Flow.

A read-and-write data storage will be used not only to allow Dataiku to read the data, but also to create new datasets (write) and, in SQL data storage, perform in-database computation, thus improving performance.

Warning

For convenience, every Dataiku Cloud subscription includes a designated amount of object storage in an S3 bucket. This data storage is not backed up. Versioning is not activated, and no lifecycle policies are defined. We recommend customers connect their own data storage to Dataiku Cloud so they can fully manage and govern it, including backups. We do not recommend using the managed S3 storage included for convenience with Dataiku Cloud for production use cases.

From Dataiku Cloud, you can connect to the following sources:

Type

Read Only Data Sources

Read / Write Data Storage

Snowflake

X

X

Azure Synapse

X

X

Google BigQuery

X

X

Amazon Redshift

X

X

PostgreSQL

X

X

Oracle

X

X

SQL Server

X

X

MySQL

X

X

Amazon S3

X

X

Azure Blob Storage

X

X

Google Cloud Storage

X

X

Databricks

X

X

Athena

X

X

MongoDB

X

DynamoDB

X

X

ElasticSearch

X

X

SCP/SFTP

X

X

FTP

X

X

SAP HANA

X

X

Note

Depending on your subscription plan, not all connectors may be available.