Use a Dataiku Application

In this section, you will learn how to consume Dataiku Applications.

Note

Dataiku Applications are a kind of customization that allows the user to reuse projects. A typical process for creating and using a Dataiku application is:

  • A user creates a Dataiku project and wants to share it with colleagues who need to apply the existing project’s workflow to new data, or a subset of the existing data, in order to consume insights, but don’t need to understand the details of the project;

  • The project owner (or app developer) converts the project into a Dataiku application;

  • Each colleague using the application creates their own instance of the Dataiku application.

Access a Dataiku Application from the Homepage

As an Explorer user, you cannot create Dataiku applications yourself, but you can consume the applications that others have built. Let’s see how this works.

  • Go to the Dataiku homepage by clicking on the bird logo in the top left corner.

  • From the Applications section of the homepage, click the AI Consumer App (Tutorial) application.

  • Click + Start Using the Application to create your own instance of the app.

  • Give your instance a name like My AI Consumer App, and click Create.

Use a Dataiku Application to Add New Data & Filter Dashboard Results

The application page contains three sections, “Upload New Data and Filter by Year”, “View Purchase Patterns”, and “Generate Predictions”.

../../../_images/ai-consumer-app.png

Let’s first upload a dataset with new transactions data.

  • Download the transactions_2019_2020 dataset here.

  • Under “Upload New Data and Filter by Year”, in the “Upload transactions dataset” tile, upload the transactions_2019_2020 dataset into the dropbox.

  • Delete the original transactions dataset from the dropbox, by clicking the “trash” button.

We have replaced the original transactions dataset with a new one, containing transactions from 2019 and 2020. We can further customize our view by filtering only the transactions made in 2020.

  • In the “Enter year here:” field, enter 2020.

Next, let’s rebuild the visualizations in the “Purchase Patterns” dashboard on the new data.

  • Under “View Purchase Patterns”, in the ‘Rebuild “Purchase Patterns” dashboard with new data’ tile, click the Run Now button.

This activates a scenario which rebuilds a part of the project, as well as the dashboard results, on the new data. Once the scenario has finished running, you will see a “Scenario finished” message pop up in the bottom right corner.

  • Once the scenario has finished running, click Purchase Patterns in the application to see the results in the dashboard.

You can see that the displayed results are different from the ones you saw in the same dashboard earlier. The Dataiku app uses the same project and dashboard, but it lets you modify certain parameters (such as the input data and the purchase year in this case) and output different results.

Use a Dataiku Application to Generate ML Predictions on New Data

Now that you have already uploaded the dataset containing new transactions data, you could also use the Dataiku application to “score”, or generate predictions on this new data with the machine learning model used in the project, and thus identify potentially fraudulent transactions made in 2020.

With Dataiku apps, you can use machine learning models built by your colleagues to generate predictions on new data with just a few clicks, without having to touch the model yourself! Let’s see how this works.

  • In the top navigation menu, click My Consumer App to go back to the application user interface.

  • In the “Generate Predictions” section, in the “Run ML model on new data” tile, click Run Now.

  • Once the scenario has finished running, click on Model Report & Predictions to display the dashboard.

As the Dataiku app is using the same version of the model, the insights in the “Model Report” slide are the same as the ones you already saw.

  • Click Predictions in the lower left corner to navigate to the Predictions slide.

Notice the results in this slide have changed, and it is now displaying the 2020 transactions that have been detected as potentially fraudulent by the ML model.


Finally, you can optionally use the Dataiku app to download a dataset containing all transactions from 2020 with the predictions that the model has made about whether they are fraudulent or not.

  • In the top navigation menu, click My Consumer App to go back to the application user interface.

  • At the bottom, in the “Download predictions” tile, click the Download button.

The transactions_unknown_scored dataset, which contains all the scored transactions from 2020, will be downloaded as a CSV file.

../../../_images/download-predictions.png