Ideogram V4: The Essentials

Last updated: June 12, 2026

Covers Ideogram V4Ideogram V4 Image to Image, and Ideogram V4 Tiling

asset_SC4ZNvDiW81W71zUwZ7k7MzT_Model_ Ideogram V4 (use model_ideogram-v4; include variants model_ideogram-v4-image-to-image and model_ideogram-v4-tiling in captions)_Prompt (single-image banner, landscape_16_9, (1).png

The Ideogram V4 family on Scenario is built around one signature: typography that actually renders. The base text-to-image model puts spell-perfect words inside posters, menus, packaging, signage, UI, and brand surfaces. Two specialized siblings extend it: Image to Image restyles an existing frame from a text prompt while keeping the composition intact, and Tiling generates seamless textures and patterns that repeat perfectly across any surface. All three share the same speed tiers, the same Magic Prompt expansion control, and the same multilingual typography engine.


Which Model Should I Use?

Model

ID

Capability

Best for

Ideogram V4 Generation

model_ideogram-v4

txt2img

Posters, logos, menus, packaging, signage, UI mockups with spell-perfect in-image typography

Ideogram V4 Image to Image Editing

model_ideogram-v4-image-to-image

img2img

Restyle a source image with a text prompt while preserving the composition. Strength dial from subtle to full reinvention.

Ideogram V4 Tiling Generation

model_ideogram-v4-tiling

txt2img and img2img

Seamless textures and repeating patterns. Three tiling modes (both, horizontal, vertical), prompt-from-scratch or image-to-tile.

Use the base V4 when the deliverable is a one-shot graphic with text on it. Use Image to Image when you already have a source frame and want to swap the style, mood, or medium without losing the subject. Use Tiling when the result must wrap, like wallpaper, fabric prints, game-ready ground textures, brand surfaces, and material libraries.


How to Use the Models

Ideogram V4 (Text to Image)

The base V4 model excels at design work that needs accurate text rendering. The minimum call is just a prompt:

prompt:         "A vintage travel poster for TOKYO, JAPAN, 1950s midcentury modern style"
imageSize:      "square_hd"       // default, 1024x1024
renderingSpeed: "BALANCED"        // default
expansionModel: "Medium"          // default Magic Prompt

The signature feature is in-image typography. Quote the literal words you want rendered. "FRESH BREAD" beats "the words fresh bread": Ideogram's text engine treats quoted strings as the literal type to render. Describe placement and hierarchy alongside the quoted text: which line is the headline, which is a tagline, where on the canvas they sit.

Three speeds trade time for letterform sharpness:

  • TURBO: fastest. Right for thumbnails and rapid iteration when typography is short and bold.

  • BALANCED: default. The right middle ground for everyday work.

  • QUALITY: sharpest letterforms and texture. Use for final delivery, multilingual scripts (Japanese, Arabic, Cyrillic), and intricate layouts.

Turn Magic Prompt (expansionModeloff when literal output matching your wording is essential. Magic Prompt set to Medium or Large can introduce stylistic detail you did not ask for, including stray decorative typography. For final spelling verification on the live model page, leave expansionModel: None.

Six aspect ratios are available: square_hd (1024 by 1024, default), square (512 by 512), portrait_16_9 (576 by 1024), portrait_4_3 (768 by 1024), landscape_16_9 (1024 by 576), landscape_4_3 (1024 by 768). Pick portrait for social, landscape for banners and key art, square for icons and avatars.

Featured Examples

The pinned examples on the Ideogram V4 model page show the text engine across formats. The comic splash below renders "THIS ENDS TONIGHT!" inside a jagged speech bubble plus "SECTOR 9" in a caption box, all baked into the artwork at QUALITY rendering:

Retro burger restaurant ad

Retro print ad, landscape 4:3

Jazz night flyer with guitar player

Jazz night flyer, portrait 9:16

Tarot card with woman and cat

Tarot card, portrait 3:4

Metal wolf badge

Metal wolf badge, square HD

More on the Ideogram V4 model page: 39 pinned examples spanning posters, brand identity, signage, character cards, retro print, and game key art.


Ideogram V4 Image to Image

The Image to Image variant takes a source image and a prompt, then restyles the source while keeping the underlying composition. The minimum call is a prompt plus a source:

image:    asset_xxx
prompt:   "Transform into a pencil sketch on cream paper with cross-hatching"
strength: 0.85

Below, a photoreal perfume bottle becomes a soft graphite study at strength: 0.85. The bottle's silhouette and gold liquid level stay where they were; the medium and surface treatment swap entirely:

Photoreal perfume bottle restyled as a pencil sketch

Click to open asset_Fff9MQcbTRYhLkseEniUwmXz on Scenario.

Tuning the Strength Dial

strength is the single most important control. It runs from 0 to 1 and decides how far the result departs from the source:

  • 0.3 to 0.5: subtle. Lighting, color grading, mood adjustments. The subject is preserved.

  • 0.55 to 0.7: moderate. Style and medium swap clearly, the subject's shape and pose stay recognizable. The "CITY STYLE" magazine cover example below used strength: 0.55.

  • 0.75 to 0.95: dramatic. Pose is preserved but the medium, palette, and fine geometry get reinvented. Use for medium swaps that turn a photograph into a print or painting.

Default is 0.8. Lower it when the source must remain identifiable; raise it when you want the model to commit to the new look.

Re-rendering Text in Images

Ideogram's text rendering carries through Image to Image. Quote text inside the prompt and the model bakes it cleanly into the restyle. Below: a watercolor postcard with "VAN LIFE" hand-lettered in the sky, and a magazine cover restyle with "CITY STYLE" set across the top. Lower the strength (around 0.55 to 0.65) on text-heavy restyles so the model has room to draw the type without warping the underlying scene:

Street fashion group restyled as CITY STYLE magazine cover

"T" magazine cover, strength: 0.55

Featured Examples:

Eight pinned examples from the model page, each linking back to the source asset on Scenario. Same composition, different medium:

Dancer as Y2K rave poster

Dancer as Y2K rave poster

Red dress fashion as editorial pencil

Red dress as editorial pencil

More pinned restyles live on the Ideogram V4 Image to Image model page.


Ideogram V4 Tiling

The Tiling variant generates seamless textures and patterns that repeat perfectly across any surface. The simplest call needs nothing more than a prompt:

prompt:      "Hand-painted gouache polka dots, mustard yellow on cream"
tilingMode:  "both"          // default, repeats in every direction
imageSize:   "square_hd"     // default, 1024x1024

To prove a tile is really seamless, paint it as a CSS background and watch the seams disappear. The block below is one Art Deco tile repeated. Click the tile name underneath to open the original asset on Scenario:

Art deco gold tile (asset_GEqb7PNntRpKVnV76pvVUAeV) repeated 4 by 2 at 200px. No seams visible.

Choosing the Tiling Mode

The tilingMode input decides which axes repeat:

  • both (default): repeats horizontally AND vertically. Wallpaper, fabric prints, ground textures.

  • horizontal: repeats left-to-right only. Borders, banner backgrounds, marble counter strips.

  • vertical: repeats top-to-bottom only. Vertical wood plank, column patterns, stripe materials.

Liberty chintz both-axis

both: Liberty chintz

Featured Examples:

Moroccan zellige tile

Moroccan zellige mosaic

Damask navy and gold

Damask navy and gold

Otomi embroidery

Otomi embroidery

More pinned tiles on the Ideogram V4 Tiling model page.

Optional Image-to-Tile

Pass an image to the image input and Ideogram will convert your reference into a seamless tile rather than starting from scratch. The strength dial controls how far the tile departs from the source: low values (around 0.3) tile the original closely, high values (0.8 to 0.95) treat it as inspiration. Combine with a mask (black-and-white image) to redo only specific regions of an existing tile.


Parameters

Shared Across the Family

prompt

Required, up to 2048 characters. Describe the subject, style, mood, and any text that should appear. Quote literal strings for in-image typography.

renderingSpeed

One of TURBOBALANCED (default), or QUALITY. Trades speed for letterform sharpness and texture detail.

expansionModel

One of NoneMedium (default), or Large. Whether Ideogram's Magic Prompt enriches your input with extra style detail. None obeys your prompt exactly; Large is the most descriptive.

acceleration

One of none (default), lowregularhigh. Extra speed boost on top of renderingSpeed. Keep at none for dense typography and high-detail renders.

numOutputs

Range 1 to 4, default 1. How many variations to generate from the same prompt.

seed

0 to 2147483647. Reusing the same seed and settings reproduces the same image. Leave empty for fresh variation.

Ideogram V4 (txt2img)

imageSize

One of square_hd (default, 1024 by 1024), square (512 by 512), portrait_16_9 (576 by 1024), portrait_4_3 (768 by 1024), landscape_16_9 (1024 by 576), landscape_4_3 (1024 by 768). Match the dimension to the publication format.

Ideogram V4 Image to Image

image

Required. Source image asset on Scenario.

strength

Range 0 to 1, default 0.8. How far the result departs from the source. The most important dial in the model.

imageSize

Default auto matches the source dimensions. Override with any of the seven shaped sizes when the restyle should reshape the canvas at the same time.

Ideogram V4 Tiling

image (optional)

Optional starting image. When supplied, the model converts it into a seamless tile.

strength

Range 0 to 1, default 0.8. How far the tile departs from the input image.

mask

Black-and-white image marking regions to regenerate (white) versus keep (black). Requires an input image.

tilingMode

One of both (default), horizontal, or vertical. Which axes repeat seamlessly.

imageSize

Default square_hd. Six sizes available; portrait and landscape for directional patterns.


Use Cases

  • Marketing and brand graphics (V4 base). Posters, lookbook covers, social key art, packaging, signage, UI mockups. Wherever the deliverable carries readable typography baked into the image.

  • Multilingual typography (V4 base). Japanese calligraphy, Arabic, Cyrillic, Korean, Devanagari, chrome movie titles. The text engine handles non-Latin scripts at QUALITY without falling back to placeholder glyphs.

  • Editorial restyles and campaign variants (Image to Image). Take one product or character shot and ship multiple stylistic versions for a campaign: editorial pencil, sumi-e, stained glass, risograph.

  • Concept art and game character mood passes (Image to Image). Generate alternate medium versions of a finalized character for marketing or pitch decks.

  • Marketing collateral with text overlays (Image to Image). Write the headline INTO the restyle rather than pasting on top. One-shot social posts, postcards, lookbook covers.

  • Surface design and wallpaper (Tiling). Seamless wallpaper, gift wrap, packaging surfaces. Ships at native 2K.

  • Fabric and apparel prints (Tiling). Otomi, damask, polka dots, chintz patterns tile cleanly for textile production.

  • Game-ready ground and material textures (Tiling). Drop into Unity, Unreal, or Blender as a tiling material. The cyberpunk circuit and honeycomb brass examples above repeat without seam artifacts at any UV scale.

  • Interior and architectural visualization (Tiling). Marble, brick, wood plank tiles slot directly into a 3D scene's UV-tiled materials.


Tips for Better Results

  1. Quote literal text for in-image rendering. Use double quotes around the words you want rendered: "VAN LIFE" beats "the words van life". Works in all three variants.

  2. Disable Magic Prompt for spelling-critical text. Set expansionModel: None when you need the model to obey your wording exactly, especially for brand names, dates, addresses, and prices.

  3. Start at Balanced, finalize at Quality. Iterate quickly with renderingSpeed: BALANCED, then re-render the chosen seed at QUALITY for delivery. Quality matters most for intricate typography and complex patterns.

  4. Image to Image: start at strength: 0.7 and adjust. 0.7 is the sweet spot for a clear stylistic shift that still preserves the source. Push higher for medium swaps; pull lower if the result drifts from the subject.

  5. Name the medium concretely, not the "style". "Risograph print, two-color overprint of fluorescent pink and electric blue" outperforms "stylized print look". The more material the prompt names (paper, ink, brush, print process), the more committed the restyle.

  6. Tiling: match tilingMode to the pattern's logic. Marble veins should not loop vertically; wood planks should not loop horizontally. Pick the mode that matches real-world repetition.

  7. Tiling: default to square_hd unless the pattern has a direction. Square is the most flexible. Reach for landscape or portrait when there is an obvious primary axis.

  8. Lock a seed when iterating colors or text. Reuse the seed and change only the color words or the quoted text to keep the composition or pattern structure constant while swapping detail.

  9. Tiling: test the tile by repeating it. Before shipping, paint the tile as a CSS background or load it into a 3D viewer at the intended UV scale. Seams hidden in a single tile become obvious across a four-by-four repeat.

  10. Keep acceleration at none for dense typography. Higher acceleration shaves time but can soften letterforms.


Known Limitations

  • Long text passages degrade. All three variants excel at short labels, headlines, and brand words. Multi-sentence paragraphs still produce errors. Keep on-image type concise.

  • Image to Image preserves composition, not fine geometry. At high strength the model can drift on small details (number of fingers, exact letter shapes in long passages, jewelry detail). Drop strength or use QUALITY when those details matter.

  • Tiling guarantees edges, not perfect symmetry. The edges line up by design, but the pattern within the tile is not always perfectly symmetrical. For strict geometric symmetry, prompt explicitly for it.

  • Plan access restrictions apply. Some workspaces may not have access to the full V4 family.