How do I use Wan v2.6 Text to Image via API?

Wan v2.6 Text to Image is available through the eachlabs unified API. Provide a text prompt and optional parameters such as style or resolution; the model returns a generated image. eachlabs handles authentication and billing on a pay-as-you-go basis.

What is Wan v2.6 Text to Image best suited for?

Wan v2.6 Text to Image is best suited for visual content creation, product concept visualization, and marketing asset generation. Its strong prompt fidelity makes it effective for workflows requiring precise control over generated image content, including e-commerce, advertising, and creative design.

Wan 2.6 API

Image·wan-v2.6·by Alibaba

Wan 2.6 Text-to-Image is a model that generates high-quality images from text prompts with consistent visual results.

Try it now →

API reference

Runtime (p50): 40s
Estimated price: $0.03 / image

Call the API

prediction.sh

curl -X POST \
  -H "X-API-Key: $EACHLABS_API_KEY" \
  -H "Content-Type: application/json" \
  --data '{
    "model": "wan-v2-6-text-to-image",
    "version": "0.0.1",
    "input": {
        "prompt": "A woman standing in a quiet museum, viewed from a side angle, calmly observing a painting as soft daylight filters through the gallery",
        "negative_prompt": "low resolution, error, worst quality, low quality, deformed",
        "image_size": "square_hd",
        "max_images": 1,
        "enable_safety_checker": true
    },
    "webhook_url": ""
}' \
  https://api.eachlabs.ai/v1/prediction/

Documentation8 sections

Overview
wan-v2.6-text-to-image — Text-to-Image AI Model

Developed by Alibaba as part of the wan-v2.6 family, wan-v2.6-text-to-image generates high-quality, consistent images from text prompts, enabling creators and developers to produce professional visuals without complex setups. This Alibaba text-to-image model stands out in the competitive landscape by leveraging the advanced multimodal architecture of Wan 2.6, delivering photorealistic outputs with superior subject fidelity and style adherence that rival video-grade generation capabilities. Ideal for users seeking a reliable text-to-image AI model for rapid prototyping or e-commerce visuals, it supports high-resolution rendering up to 1080p equivalents in image quality.
Capabilities
- Pure Text-to-Image Generation: Produce images directly from text without any image input.
- Reference-Guided Generation: Use one reference image to guide style or composition.
- Multiple Outputs: Generate up to 5 images per request (actual number may vary).
- Flexible Output Sizes: Predefined presets or custom width/height supported.
- Negative Prompting: Control unwanted visual elements via negative prompt input.
Use cases
Use Cases for wan-v2.6-text-to-image

Content creators building mood boards or concept art can input prompts like "a futuristic cityscape at dusk with neon lights reflecting on wet streets, cyberpunk style, highly detailed" to generate consistent, cinematic images that align perfectly with storytelling needs, leveraging the model's narrative engine for atmosphere control.

Marketers for e-commerce use wan-v2.6-text-to-image to create product visuals by describing scenes such as lifestyle placements, producing photorealistic composites with stable lighting and framing that enhance conversion rates without photography sessions.

Developers integrating AI image APIs rely on its high-resolution 1080p outputs and style fidelity for apps requiring custom assets, like generating user-specific avatars or backgrounds with precise prompt adherence for text-to-image AI model integrations.

Designers prototyping visuals benefit from the model's subject consistency, crafting iterative designs across photorealistic to animated styles, ideal for rapid feedback loops in client projects.
Tips & tricks
How to Use wan-v2.6-text-to-image on Eachlabs

Access wan-v2.6-text-to-image through Eachlabs' Playground for instant testing with text prompts, or integrate via API and SDK using parameters like detailed prompts, resolution (up to 1080p), and optional image references. Generate high-fidelity outputs in seconds, downloadable in standard formats for immediate use in workflows.
---
Technical spec
What Sets wan-v2.6-text-to-image Apart

The wan-v2.6-text-to-image model distinguishes itself through Alibaba's Wan 2.6 innovations, including rebuilt narrative engines for precise prompt interpretation and multi-style adaptability. It excels in generating stable, high-fidelity images that maintain lighting, framing, and subject consistency, enabling seamless transitions to video workflows if needed.
- Advanced prompt understanding for cinematic quality: Handles detailed text descriptions with scene logic and atmosphere control, producing images ready for professional use in text-to-image AI model applications. This allows developers integrating wan-v2.6-text-to-image API to create assets with built-in narrative depth.
- High-resolution support up to 1080p: Outputs sharp, detailed images at 720p or 1080p resolutions, optimized for fast inference without quality loss. Users benefit from production-ready visuals for e-commerce or marketing without post-processing.
- Consistent visual fidelity across styles: Preserves photorealistic details, character stability, and style transfers, setting it apart from generic generators. This capability supports versatile Alibaba text-to-image use in creative pipelines requiring reliability.
Technical specs include JPG/PNG input compatibility for references, MP4-equivalent quality outputs, and processing tuned for quick turnaround, making it a top choice for scalable wan-v2.6-text-to-image API deployments.
Things to be aware of
- .Very abstract or vague prompts may lead to unpredictable outputs.
- Higher resolution requests may require longer processing time.
- Content moderation (safety checker) may modify or restrict output based on prompt.
Key considerations
- Prompt Quality Matters: Detailed and clear prompts improve visual fidelity.
- Reference Images Can Guide Style: Providing an image can help maintain consistent styles or aesthetics.
- Resolution Control: Choose presets or explicit width/height for fixed aspect ratio.
- Safety Checker: Enabled by default to ensure content compliance.
- image_size: square, square_hd, portrait, landscape
- max_images: (minimum 1, maximum 5)
Limitations
- Designed for still image generation (no animation).
- Maximum of up to 5 images per request.
- Reference image guidance supports one image per generation session.
- Prompt length capped at 2000 characters, potentially limiting highly detailed descriptions

Related models

4 models

Recraft v4 Pro · Text to Vector AI model preview

Recraft v4 Pro · Text to Vectorrecraft

Krea 2 Large · Text to Image AI model preview

Krea 2 Large · Text to ImageKrea

Recraft v4.1 · Text to Image AI model preview

Recraft v4.1 · Text to Imagerecraft

Recraft v4 · Text to Vector AI model preview

Recraft v4 · Text to Vectorrecraft

* FAQ

About Wan 2.6 API

01 / 03

What is Wan v2.6 Text to Image?

Wan v2.6 Text to Image is an AI image generation model by Alibaba that creates high-quality visuals from natural language prompts. It produces detailed, photorealistic and artistic images with strong prompt adherence, reflecting Alibaba's multimodal AI research capabilities.

Wan 2.6 API

wan-v2.6-text-to-image — Text-to-Image AI Model

Use Cases for wan-v2.6-text-to-image

How to Use wan-v2.6-text-to-image on Eachlabs

What Sets wan-v2.6-text-to-image Apart

Related models

About Wan 2.6 API

What is Wan v2.6 Text to Image?