Flux Kontext Pro · Multi Image image previewinference · 8.6s

Flux Kontext Pro · Multi Image

Image·flux-kontext·by Black Forest Labs

An experimental model with FLUX Kontext Pro that can combine two input images

Runtime (p50)
15s
Estimated price
$0.04
Call the API
prediction.sh
sh
curl -X POST \
  -H "X-API-Key: $EACHLABS_API_KEY" \
  -H "Content-Type: application/json" \
  --data '{
    "model": "multi-image-kontext-pro",
    "version": "0.0.1",
    "input": {
        "prompt": "Put the woman just to the left of the house",
        "aspect_ratio": "1:1",
        "input_image_1": "https://storage.googleapis.com/magicpoint/inputs/flux-kontext-pro-multi-image-input1.webp",
        "input_image_2": "https://storage.googleapis.com/magicpoint/inputs/flux-kontext-pro-multi-image-input2.webp",
        "output_format": "png",
        "safety_tolerance": 2
    },
    "webhook_url": ""
}' \
  https://api.eachlabs.ai/v1/prediction/
Documentation8 sections
  • Overview

    multi-image-kontext-pro — Image-to-Image AI Model

    Developed by Black Forest Labs as part of the FLUX Kontext family, multi-image-kontext-pro is a professional-grade image-to-image AI model designed for precise, context-aware image editing. Unlike standard image editors, this model combines multiple reference images with natural language instructions to produce photorealistic edits that maintain consistency across complex transformations. Whether you're refining product photography, iterating on character designs, or compositing scenes, multi-image-kontext-pro delivers the control and quality that creators and developers expect from an advanced image-to-image AI model.

    The core strength of multi-image-kontext-pro lies in its ability to understand both visual context and textual intent simultaneously. You provide reference images, describe your desired edit in plain language, and the model interprets both inputs to execute precise, localized changes—eliminating the trial-and-error cycles common with generic image editing tools.

  • Capabilities

    Dual image fusion with accurate context extraction

    Generates images that maintain visual consistency across source images

    Supports detailed prompts and flexible aspect ratios

    Enables image style transfer, character preservation, and layout guidance

    Supports reproducible outputs via seed control

  • Use cases

    Use Cases for multi-image-kontext-pro

    E-commerce product photography. Marketing teams can feed product photos plus a text prompt like "place this product on a marble kitchen counter with morning light and soft shadows" and receive photorealistic composites—eliminating expensive studio reshoot cycles. The multi-image reference capability allows you to maintain consistent product appearance while varying backgrounds and lighting for different campaign needs.

    Character design and animation preparation. Character artists and game developers use multi-image-kontext-pro to iterate on character designs across multiple editing rounds. By providing reference character sheets and describing modifications ("change the character's outfit to cyberpunk style while maintaining facial features and pose"), the model preserves identity consistency—critical for maintaining character continuity across assets.

    Scene composition and visual effects. Filmmakers and concept artists leverage the multi-image reference editing capability to composite complex scenes. You can combine foreground elements from one image with backgrounds from another, then refine with text instructions like "add volumetric fog and adjust lighting to match sunset hour"—all within a single AI-powered workflow rather than jumping between multiple tools.

    Professional image editing API integration. Developers building custom image editing applications can integrate multi-image-kontext-pro via the Eachlabs API to add intelligent, context-aware editing capabilities. The model's support for structured prompts and multiple references makes it ideal for building specialized tools for fashion, real estate, or product design workflows.

  • Tips & tricks

    How to Use multi-image-kontext-pro on Eachlabs

    Access multi-image-kontext-pro through Eachlabs via the interactive Playground or programmatically through the API and SDK. Provide your input images, write a natural language description of the edit you want to perform, and optionally specify resolution and aspect ratio preferences. The model returns high-quality edited images in PNG or JPEG format, ready for immediate use or further refinement. Eachlabs handles all infrastructure, so you can focus on creative iteration without managing hardware or deployment complexity.

    ---END---
  • Technical spec

    What Sets multi-image-kontext-pro Apart

    Multi-image reference control with composition precision. multi-image-kontext-pro accepts multiple input images for style and subject transfer, allowing you to blend references while maintaining exact control over composition, camera angles, and object positioning. This enables complex creative workflows—such as combining a character from one image with a background from another while preserving lighting and perspective—without manual masking or post-processing.

    Character and object consistency across iterative edits. The model maintains character features and object identity across multiple editing steps, enabling you to refine details progressively without losing consistency. This is particularly valuable for character-driven creative work, where maintaining a subject's identity through multiple rounds of editing is essential.

    Context-aware editing with natural language precision. multi-image-kontext-pro understands both image content and text instructions simultaneously, enabling nuanced edits that would be difficult to express in traditional image-to-image workflows. You can specify complex requirements like "change the background to a forest scene while keeping the person in exactly the same position and pose, maintaining the original lighting"—and the model executes precisely what you describe.

    Technical specifications: Supports up to 4MP resolution output, multiple aspect ratios, and accepts both PNG and JPEG input formats. Processing is optimized for efficient inference on capable hardware, making it suitable for both interactive workflows and batch processing through the Eachlabs API.

  • Things to be aware of

    Combine a celebrity face with a fashion photo to create styled character designs

    Use landscape photography + painting as inputs to generate fantasy scenes

    Experiment with 3:2 or 4:5 ratios for editorial-style compositions

    Adjust safety tolerance and compare the visual clarity of different outputs

  • Key considerations

    Overly abstract prompts without clear visual guidance from images may lead to unpredictable outputs.

    Avoid using heavily stylized or inconsistent images together, as Flux Kontext Pro Multi Image might fail to establish visual harmony.

    The safety_tolerance slider can restrict certain visual generations; lowering it too much may remove essential features, while raising it too high may lead to risky or distorted results.

    The seed value, if fixed, can help with reproducibility, but will not guarantee identical results in all scenarios due to stochastic processes.

  • Limitations

    Outputs may struggle with complex prompt logic when reference images conflict

    Some extreme aspect ratios can introduce composition issues

    Flux Kontext Pro Multi Image does not support animation or video generation

    Safety filters may limit creative freedom in certain themes

    Cannot guarantee perfect pose transfer or spatial consistency between inputs

    Output Format: JPG,PNG

Related models

4 models
* FAQ

About Flux Kontext Pro · Multi Image

01 / 03

What is Multi-Image Kontext Pro?

Multi-Image Kontext Pro is an in-context image editing model by Black Forest Labs that accepts multiple reference images and applies instruction-guided modifications with strong visual consistency. It balances high output quality with efficient inference, making it suitable for production-scale multi-image editing workflows.