Merging the Datasets¶
We now have three datasets at the same level of granularity: the Asset, i.e., an individual rental car. Joining them together will give us the most possible information for a model. With the same Asset ID in each dataset, we can easily join the datasets with a visual recipe.
From the failure dataset, initiate a Join recipe.
Add usage_by_Asset as the second input dataset.
Name the output
data_by_Asset
. Click Create Recipe.Add maintenance_by_Reason as the third input dataset.
Both joins should be Left Joins. Asset should be the joining key in all cases.
Run the recipe and update the schema to 21 columns.
Note
Check the reference documentation for more information on the Join recipe.
data_by_Asset now holds information from maintenance and usage, labelled by failures. Congratulations, great work!