Joining Datasets

In Excel you can use the VLOOKUP function to enrich one dataset with columns from another, by matching values of a key column that is common to both datasets. A separate VLOOKUP statement should be created for each column added from the other dataset.

../../../_images/excel-vlookup.png

In Dataiku DSS, the Join recipe matches values of a key column that is common to both datasets. Records with the same key values are paired up and share information. Depending on the type of join, records without a match may be kept or dropped. By default, all other columns are included in the output dataset, but you can choose a subset.

See also

Join recipe lessons in the Visual Recipes Overview course

Join recipe in the reference documentation