Prompt Embeddings

Last updated: June 3, 2026

asset_MW7XULFQcKo94RjGU5BfGNxs_Model family_ FLUX.2 (exemplo de modelo de personagem consistente)_Prompt for banner (horizontal web banner, high-res)__Create a clean, warm-studio desk style web banner in the style .png

Overview of prompt embeddings for Custom Models in the Scenario web app.

The short version

A prompt embedding appends a fixed set of tokens to every prompt for a trained model — without typing them each time.
Use embeddings for style descriptors, brand language, trigger words, or quality cues across all generations.
Configure embeddings on the model Details tab under Prompt Embedding.
Embeddings apply to generations in the web app, API, and MCP — for every team member.

A prompt embedding injects tokens into every prompt used with a model. Bake in style descriptors, brand language, or quality cues so they apply across all generations from that model.

This article covers what embeddings do, supported training families, setup, and how they interact with Custom Models.

What prompt embeddings do

When generating from a model with a prompt embedding configured:

The creator types a normal prompt: a young woman holding a coffee cup
The system appends the embedding tokens — for example, , MYSTYLE, vibrant colors, soft lighting
The model receives the combined prompt and generates accordingly

The extra tokens run automatically. Embeddings keep output consistent across generations, team members, and API calls.

Purpose	Example tokens
Steer generation toward a style or brand look	`MYSTYLE`, `cinematic lighting`, `3D cartoon style`

Family support

Prompt embeddings work across all current Scenario training families — single-image and edit LoRAs alike:

FLUX.2 Dev / FLUX.2 Klein 9B / FLUX.2 Klein 4B
FLUX.2 Edit
Qwen Image
Qwen Edit
Z-Image / Z-Image Turbo
FLUX.1 Kontext

Set up a prompt embedding

Open the trained Custom Model in the Scenario web app.

Open the Details tab.

Find the Prompt Embedding field.

Enter the tokens to append automatically — comma-separated, in the order they should apply.

Save. The embedding is active for all generations from this model.

When to use embeddings

Team or brand consistency: every member's outputs share style cues without coordinating prompts.
Public model distribution: the model works without users learning trigger words.
API and MCP integrations: embeddings reduce per-call prompt construction.

Common pitfalls

Too many tokens. Five to ten well-chosen tokens beats twenty. Extra tokens dilute one another.

Conflicting tokens. cinematic plus flat illustration fight each other. Keep the embedding internally coherent.

Trigger word missing from prompts and embedding. Confirm the trigger appears via the embedding or via user prompts — not neither.

Stale embedding after retraining. After a retrain, verify the embedding still matches the new model's vocabulary.

Prompt embeddings are ready to use on the next generation from that Custom Model. For training basics, see the Custom Model Training overview.