each::sense is live
Eachlabs | AI Workflows for app builders
wan-2-5-preview-image-to-image

WAN-2.5

Wan 2.5 Preview Image to Image transforms an input photo into a new, high-quality image while preserving the main structure and enhancing details with realistic style.

Avg Run Time: 75.000s

Model Slug: wan-2-5-preview-image-to-image

Playground

Input

Output

Example Result

Preview and download your result.

Preview
Unsupported conditions - pricing not available for this input format

API & SDK

Create a Prediction

Send a POST request to create a new prediction. This will return a prediction ID that you'll use to check the result. The request should include your model inputs and API key.

Get Prediction Result

Poll the prediction endpoint with the prediction ID until the result is ready. The API uses long-polling, so you'll need to repeatedly check until you receive a success status.

Readme

Table of Contents
Overview
Technical Specifications
Key Considerations
Tips & Tricks
Capabilities
What Can I Use It For?
Things to Be Aware Of
Limitations

Overview

wan-2-5-preview-image-to-image — Image-to-Image AI Model

Developed by Alibaba as part of the wan-2.5 family, wan-2-5-preview-image-to-image transforms input photos into enhanced, high-quality images while preserving core structure and adding realistic details. This image-to-image AI model excels in single-image editing and multi-image fusion, making it ideal for developers seeking an Alibaba image-to-image solution for precise visual refinements. Unlike generic editors, it maintains subject fidelity and layout integrity, delivering outputs in PNG format at resolutions up to 1080P for professional-grade results.

Technical Specifications

What Sets wan-2-5-preview-image-to-image Apart

The wan-2-5-preview-image-to-image model stands out in the competitive landscape of image-to-image AI models with its support for multi-image fusion, allowing seamless blending of multiple references into a cohesive output. This enables users to combine elements from several photos while preserving overall composition, a feature refined in Alibaba's Wan 2.5 architecture for superior detail enhancement.

It supports high-resolution outputs at 480P, 720P, or 1080P in PNG format, optimized for fast inference in API workflows. Developers benefit from quick processing times, ideal for scalable AI image editor API integrations without compromising quality.

  • Multi-image fusion capability: Merges inputs like product shots with background references, creating composites that retain structural accuracy for e-commerce visuals.
  • Structure-preserving edits: Enhances details realistically while keeping the main subject and layout intact, reducing artifacts common in other models.
  • High-resolution PNG outputs: Delivers up to 1080P images, supporting aspect ratios suitable for web and print without post-processing.

Key Considerations

  • Ensure input images are within recommended resolution and format specifications to avoid processing errors
  • For best results, use detailed and well-structured prompts that clearly describe desired enhancements or style changes
  • Negative prompts can be used to exclude unwanted features or artifacts
  • Quality and speed may vary depending on image complexity and hardware resources; higher resolutions may require more processing time
  • Iterative refinement with prompt adjustments can significantly improve output quality
  • Avoid using images with transparency (alpha channels), as these are not supported

Tips & Tricks

How to Use wan-2-5-preview-image-to-image on Eachlabs

Access wan-2-5-preview-image-to-image through Eachlabs' Playground for instant testing with image uploads and text prompts, or integrate via the wan-2-5-preview-image-to-image API and SDKs for production apps. Provide an input image (JPG, PNG), optional multi-images for fusion, and a descriptive prompt specifying enhancements; receive high-quality PNG outputs up to 1080P in seconds.

---

Capabilities

  • Transforms input photos into high-quality, realistic images while preserving core structure
  • Enhances fine details and textures for photorealistic results
  • Supports nuanced style transfer based on prompt instructions
  • Handles a wide range of input formats and resolutions
  • Optimized for efficient GPU usage and large-scale workflows
  • Capable of batch processing for multiple images
  • Robust against common image generation artifacts when negative prompts are used

What Can I Use It For?

Use Cases for wan-2-5-preview-image-to-image

E-commerce developers can use wan-2-5-preview-image-to-image to automate product photo enhancements, feeding an original item image plus references for "replace background with luxury studio lighting, add realistic shadows." This image to image AI model fuses elements to produce polished listings ready for online stores, saving hours on manual edits.

Graphic designers leverage its multi-image fusion for concept art, combining character sketches with environment photos to generate "integrate this figure into a cyberpunk cityscape at dusk, enhance metallic textures." The result maintains pose and proportions, streamlining iterative design workflows.

Marketers building AI photo editing for e-commerce campaigns transform lifestyle shots by inputting user-generated content and prompts like "enhance clothing details, apply golden hour lighting while preserving fabric folds." This ensures brand-consistent visuals without expensive reshoots.

Content creators experiment with style transfers, uploading portraits and specifying scene changes to create artistic variations that honor original facial structures for social media assets.

Things to Be Aware Of

  • Some experimental features, such as prompt expansion, may yield unexpected results and should be reviewed for consistency
  • Users report that memory usage can be significant for high-resolution or batch workflows; optimized VRAM management is recommended
  • Occasional edge cases include minor artifacts or loss of detail in complex scenes, especially with ambiguous prompts
  • Consistency across outputs is generally high, but random seed variation can affect reproducibility
  • Positive feedback highlights the model’s photorealism, structure preservation, and versatility
  • Common concerns include processing time for very large images and the need for careful prompt engineering to avoid unwanted artifacts

Limitations

  • Requires substantial GPU resources for high-resolution or batch processing
  • May struggle with highly abstract or ambiguous prompts, leading to less predictable results
  • Not optimal for images with transparency or non-standard formats

Pricing

Pricing Type: Dynamic

Charge $0.05 per image generation

Pricing Rules

ParameterRule TypeBase Price
num_images
Per Unit
Example: num_images: 1 × $0.05 = $0.05
$0.05