What can I do with Luma Ray 3.2 Image-to-Video?

You can animate product shots, portraits, illustrations, and campaign visuals for marketing, social media, and creative production. Because it starts from an existing image, it is useful when you want to add motion without losing the original visual direction.

How is image-to-video different from text-to-video in Luma Ray 3.2?

Image-to-video starts with an image you provide, while text-to-video creates the entire clip from a written prompt. Luma Ray 3.2 Image-to-Video is the better choice when you already have a visual asset and want to bring it to life with controlled motion.

Example inputhover

prompt: "The father and son sit by the campfire roasting marshmallows over the flames — the fire flickers warmly, small sparks drift up into the night, and the marshmallows slowly toast over the heat. The father turns his head to look lovingly at his son and gently ruffles the boy's hair with his hand; the boy laughs happily and looks up at his father. Keep both faces and features fully preserved and unchanged. Warm flickering firelight on their faces, soft natural movement, intimate and heartwarming mood. Realistic gentle motion, slight handheld camera, no fast or exaggerated movement, no music."
image_url
aspect_ratio: "16:9"
resolution: "720p"
duration: "5s"
reference_image_urls: []

Luma Ray 3.2 API

Video·luma-ray·by Luma

Luma Ray 3.2 Image-to-Video animates still images into cinematic clips using text prompts, while preserving the look and structure of the original frame.

Try it now →

API reference

Runtime (p50): 1m
Estimated price: From $0.5

Call the API

prediction.sh

curl -X POST \
  -H "X-API-Key: $EACHLABS_API_KEY" \
  -H "Content-Type: application/json" \
  --data '{
    "model": "luma-ray-v3-2-image-to-video",
    "version": "0.0.1",
    "input": {
        "prompt": "The father and son sit by the campfire roasting marshmallows over the flames — the fire flickers warmly, small sparks drift up into the night, and the marshmallows slowly toast over the heat. The father turns his head to look lovingly at his son and gently ruffles the boy's hair with his hand; the boy laughs happily and looks up at his father. Keep both faces and features fully preserved and unchanged. Warm flickering firelight on their faces, soft natural movement, intimate and heartwarming mood. Realistic gentle motion, slight handheld camera, no fast or exaggerated movement, no music.",
        "image_url": "https://cdn-us.eachlabs.ai/defaults/1d102faef43a4c059de848086d0bbc41.png",
        "aspect_ratio": "16:9",
        "resolution": "720p",
        "duration": "5s",
        "reference_image_urls": []
    },
    "webhook_url": ""
}' \
  https://api.eachlabs.ai/v1/prediction/

Documentation8 sections

Overview
Luma | Ray 3.2 | Image to Video Overview

Luma | Ray 3.2 | Image to Video is an advanced image-to-video generation model from Luma, built on the Ray 3.2 family of video foundation models. It transforms a single still frame into a smooth, cinematic clip while preserving the original image’s composition, style, and fine details. This preservation-first approach, combined with strong camera-motion and 3D-awareness, is the primary differentiator of Luma image-to-video compared to many generic motion filters. Creators can start from a photo, concept art, or rendered frame, add a guiding text prompt, and produce short videos that feel like native footage rather than simple pan-and-zoom effects. Integrated through each::labs, Luma | Ray 3.2 | Image to Video gives developers and creative teams reliable, production-ready image-to-video capabilities inside their own workflows.
Capabilities
Capabilities
- Transforms a single still image into a short, coherent video while preserving the image’s original composition and style.
- Generates cinematic camera movements such as pans, tilts, push-ins, and orbits that feel physically plausible and 3D-aware.
- Supports prompt-guided motion, allowing users to describe how elements should move or how the virtual camera should behave.
- Maintains fine-grained texture and lighting detail from high-quality source images, reducing the “overstylized” look common in some image-to-video pipelines.
- Handles a wide range of visual styles—from photorealistic photography to digital art and 3D renders—without collapsing the original aesthetic.
- Produces clips suitable for social posts, marketing snippets, title sequences, and in-product animations with minimal manual keyframing.
- Integrates through the Luma | Ray 3.2 | Image to Video API on each::labs, enabling programmatic generation, batching, and automation.
Use cases
Use Cases for Luma | Ray 3.2 | Image to Video

Content creators and filmmakers. Take a concept frame, storyboard panel, or key art and use Luma | Ray 3.2 | Image to Video to explore motion ideas: “Slow cinematic push-in on the hero, dust particles floating, warm sunset glow.” The model’s camera-motion control makes it ideal for previz and animatics.

Marketing and social teams. Turn product photos or campaign visuals into dynamic teasers without reshoots. A prompt like “Subtle orbit around the product, shimmering reflections, clean studio lighting, loopable motion” uses the model’s detail retention for high-end brand visuals.

Designers and illustrators. Animate static illustrations or key art to add life to landing pages and UI. For example: “Gentle parallax of background mountains, clouds drifting slowly, soft zoom out.” The structure-preserving behavior keeps the original layout intact.

Developers and product teams. Integrate the Luma | Ray 3.2 | Image to Video API into creative tools, enabling users to upload a frame and get instant motion previews or auto-generated hero videos from static assets.
Tips & tricks
Tips and Tricks

To get the most from Luma | Ray 3.2 | Image to Video, write prompts that describe motion and mood, not just objects. Reference camera moves explicitly (for example, “slow dolly forward” or “orbiting camera”) so the model knows how to traverse the scene. Keep prompts focused and concise; too many competing ideas can cause unstable motion or style drift away from the original frame. Where the interface allows, choose shorter durations for complex scenes to maintain quality.

Good example prompts for this model include:
- “Slow cinematic push-in through soft fog, gentle breathing motion, ultra-realistic lighting, 24fps film look.”
- “Orbiting camera around the character, subtle hair motion in the wind, dramatic sunset sky, moody color grading.”
- “Handheld camera with slight shake, neon reflections shimmering in puddles, cyberpunk street at night.”
In a workflow, iterate by reusing the same base image and adjusting only the motion-related parts of the prompt to fine-tune the final clip.
Technical spec
Technical Specifications
- Model family: Luma Ray 3.2 image-to-video model, derived from Luma’s video foundation architecture.
- Input: Single still image plus optional text prompt and basic motion/length parameters (where exposed via the Luma | Ray 3.2 | Image to Video API).
- Output: Short video clip (commonly a few seconds), designed for social and editorial use; standard web video formats such as MP4 are typically supported by platforms integrating the model.
- Resolution: High-resolution generation aimed at modern social/video standards, with strong detail retention from the source image; exact pixel limits depend on the hosting platform.
- Aspect ratios: Supports multiple aspect ratios (e.g., landscape, portrait, square) based on API or platform configuration.
- Processing time: Optimized for near real-time or short-wait generation on modern GPU infrastructure, suitable for interactive creative workflows.
- Architecture details: Uses a diffusion-style, 3D-aware video generation backbone tuned for consistent motion and structural faithfulness to the input frame.
Things to be aware of
Things to Be Aware Of

Highly complex scenes with many overlapping moving parts can be harder for Luma | Ray 3.2 | Image to Video to animate cleanly, sometimes leading to subtle warping or temporal flicker. Extremely long prompts or conflicting instructions can reduce motion coherence and may push the output away from the original image style. Very low-resolution or heavily compressed inputs can limit the model’s ability to preserve detail once animated. When using the Luma | Ray 3.2 | Image to Video API through each::labs, keep an eye on resolution and duration settings, as pushing both to the maximum can increase latency and cost without always improving perceived quality.
Key considerations
Key Considerations

Luma | Ray 3.2 | Image to Video is designed for cases where you must keep the original look of an image while adding realistic motion. Users should start from reasonably clean, well-lit source images, because artifacts or heavy compression can be amplified once animated. The model works best for cinematic camera moves, environmental motion, and subtle character actions rather than fast, complex choreography. When exposed through the Luma | Ray 3.2 | Image to Video API on each::labs, developers should plan for short clip lengths and moderate resolution to balance cost and latency. For pure text-driven video with no starting frame, a dedicated text-to-video model may be a better fit.
Limitations
Limitations

Luma | Ray 3.2 | Image to Video cannot invent full, long-form narratives from a single frame; it is optimized for short, visually consistent clips. It may struggle with extreme, fast-paced motion or rapid character animation, where limbs or small objects need frame-perfect consistency. The model expects a single static input image, not full video or multi-image sequences, and it does not guarantee perfectly loopable outputs unless carefully guided. As with most generative systems, fine-grained control over every object’s path or timing is limited, so traditional animation tools may still be required for highly precise motion design.

Related models

4 models

P Video AvatarPruna AI

Alibaba HappyHorse 1.0 · Image to Video AI model preview

Alibaba HappyHorse 1.0 · Image to VideoAlibaba

FFmpeg API · Images to Video AI model preview

FFmpeg API · Images to VideoFfmpeg Api

alibaba-happyhorse-1.1-image-to-video AI model preview

alibaba-happyhorse-1.1-image-to-videoAlibaba

* FAQ

About Luma Ray 3.2 API

01 / 03

What is Luma Ray 3.2 Image-to-Video?

Luma Ray 3.2 Image-to-Video is a model from Luma that turns a static image into a moving video clip. It applies cinematic motion based on your text prompt while keeping the starting image recognizable.

Luma Ray 3.2 API

Luma | Ray 3.2 | Image to Video Overview

Capabilities

Use Cases for Luma | Ray 3.2 | Image to Video

Tips and Tricks

Technical Specifications

Things to Be Aware Of

Key Considerations

Limitations

Related models

About Luma Ray 3.2 API

What is Luma Ray 3.2 Image-to-Video?