Connecting to and Exploring Data¶
Connect to Your Existing Infrastructure¶
- SQL databases
The list of supported SQL databases, and information on how to connect to them, is available from our documentation on SQL datasets.
- Hadoop HDFS
Accessing Cloud Storage and Databases¶
Cloud File Storage
Fetching Data From Remote Sources¶
It is possible to fetch data using various protocols, and caching the resulting dataset on the filesystem.
Dataiku can read and write in various file formats for files-based connections: filesystem, HDFS, Amazon S3, HTTP, FTP, SSH… See the list of readable file formats.
Accessing Data Through Plugins¶
Many applications such as Google Sheets, SalesForce, Slack… provide capabilities to access their data through APIs. Dataiku DSS plugins allow the addition of custom connections leveraging these APIs to easily define datasets that fetch data from a wide variety of applications.