Concept | Dataset conditional formatting#

Let’s learn how conditional formatting can help you organize and visualize your dataset with color.

Introduction#

Dataiku provides a few formatting options that can add colors to cells in a column. A preview of the column colors will always appear.

A Dataiku screenshot of a color scale formatted dataset.

Enable from context menu#

You can enable different color formats in the context menu of a column under Conditional formatting.

A Dataiku screenshot the context menu of a column.

Enable from Display menu#

You can also enable conditional formatting from the Display menu. However, this will apply the conditional formatting to each column, rather than to individual columns.

A Dataiku screenshot the Display menu options of a dataset.

Color by meaning validity#

Color by meaning validity is the default display of a dataset.

Cells are colored depending on if their value reflects or does not reflect the column meaning. This allows you to analyze your data quality at a high level. For instance, seeing the color red across all of your columns may indicate that you need to perform some data cleaning.

A Dataiku screenshot highlighting a column with invalid cells according to meaning.

Note

Learn about column meanings in Concept | Dataset characteristics.

Color by rules#

Color by rules provides a way to color cells by certain logical conditions. You can choose one color per condition, as show here:

A Dataiku screenshot highlighting a column with invalid cells according to meaning.

Color scale#

The Color scale option uses a sequence of colors to represent a column value’s numerical scale. For example, a column of integers may show the largest value in a dark color and the smallest value in a light color.

A Dataiku screenshot with cells colored by scale.