Concept | Prompt Studios and Prompt recipe#

Prompts are the input to a Generative AI model such as a Large Language Model (LLM). You can modify and craft prompts to achieve a desired outcome — a process known as prompt engineering.

Prompts guide the model’s behavior and output, but they don’t modify the underlying model.

Prompt engineering in Dataiku#

In Dataiku, Prompt Studios and the Prompt recipe work together to test and iterate on LLM prompts, then operationalize them in your data pipeline.

Prompt Studios and the Prompt recipe allow you to securely connect to LLMs either through a Generative AI service or a private model, including open-source models. You can first experiment with different prompt designs in Prompt Studios, before deploying a desired prompt in your data pipeline with the Prompt recipe.

The prompt engineering workflow in Dataiku.

Prompt Studios#

Prompt Studios is the playground where you can iterate and test different prompts and models to engineer the optimal prompt for a specific use case. You can find Prompt Studios in the top navigation bar under the Visual Analyses menu.

Within a Prompt Studio, you can create multiple prompts and iterations, and even compare them. For each new prompt, you can choose from three different modes:

Mode

Description

Managed mode

You specify the task using plain language, optionally include other details, and Dataiku generates the final prompt for you. You can use templates for common tasks.

Advanced mode

You create the prompt and examples using placeholders, giving you more control over the final prompt sent to the LLM.

Prompt without inputs

You create a one-off prompt that queries the LLM and is designed for quick experimenting. These prompts can’t be converted into a Prompt recipe and their results cannot be compared in Prompt Studio.

The new prompt info window where you choose the mode to write your prompt in.

Designing prompts#

The Prompt design steps will differ depending on the mode you’re using. In general, this process includes the elements described below.

Element

Description

Prompt

Includes the main instructions to the LLM.

Inputs

Lists the variables inserted into the prompt. You can map the inputs to columns from a dataset and use values from the dataset as test cases to check the prompt’s output. You can also choose to manually write test cases.

Examples

Provides sample input-output pairs to guide the model’s expected results.

Test cases

Evaluates prompt performance using either dataset rows or manually written examples.

You can quickly iterate on your prompt design, including updating the task or adding new inputs, examples, and test cases, before deploying your prompt.

Each time you run inference on your prompt and test cases, the model will also return an estimated cost for 1,000 records, helping you manage costs associated with LLM usage.

Prompt settings for validation and formatting.

Using images in prompts#

You can also process images within Prompt Studios for use cases where visual input is useful. To do this, you will need:

  • Dataiku version 13.2 or above.

  • A connection to an LLM that supports images as inputs.

  • Images stored in a managed folder, which you can connect to within Prompt Studios. (See How to | Prepare images for use in a model for instructions.)

Adding an image folder as input in Prompt Studios.

Settings#

You can change several hyperparameters or set validation rules for the prompt.

Tab

Description

LLM settings

Changes hyperparameters such as temperature or the maximum output tokens, that control the behavior of the LLM.

Validation & formatting

Sets rules such as validating the expected output format as a JSON object or forbidding certain words from the output. You will receive a warning if the output is not validated.

Prompt settings for validation and formatting.

Comparing prompts#

Prompt Studios track your prompts in the left sidebar. You can navigate between prompts, view each prompt’s history, restore previous versions, and duplicate prompts.

When using managed or advanced modes, you can also compare the output of multiple prompts by selecting them in the sidebar and clicking Compare. Prompt Studios will build a table comparing the outputs and costs of the selected prompts.

A comparison of the inputs and outputs of two similar prompts.

Tip

Prompt Sudios also tracks each run of a prompt as you iterate. To view the version history of a prompt and revert back to any version, click on the clock icon in the top right of the Prompt design screen.

Prompt recipe#

The Prompt recipe puts your prompt into action on a dataset. It generates an output dataset and appears in the Flow.

You can create a new Prompt recipe directly from Prompt Studios by saving a prompt template that is mapped to a dataset as a recipe. This allows you to experiment with your prompts before operationalizing them with the recipe.

You can also create a new Prompt recipe directly from the Flow or from the Actions panel of a dataset. With this method, you can write a Structured prompt or a Text prompt directly in the recipe, with the same settings as a prompt template in a Prompt Studio.

The Prompt recipe settings screen.

What’s next?#

Continue learning about prompt engineering with LLMs by working through the Tutorial | Prompt engineering with LLMs article.