Join the datasets#

We now have three datasets at the same level of granularity: the Asset, i.e., an individual car. Joining them together will give us the most possible information for a model. With the same Asset ID in each dataset, we can easily join the datasets with a visual recipe.

  • From the failure dataset, initiate a Join recipe.

  • Add usage_by_Asset as the second input dataset.

  • Name the output data_by_Asset, and click Create Recipe.

  • Add a third dataset maintenance_by_Reason to join to failure.

  • Both joins should be Left Joins. Asset should be the joining key in all cases.

  • Run the recipe, and confirm the output has 21 columns.

Dataiku screenshot of the Join step of a Join recipe

Note

To learn more on the Join recipe, see the following articles:

The data_by_Asset dataset now holds information from maintenance and usage, labelled by failures. Congratulations, great work!