Example inputhover

seed: 1
prompt: "Using the reference image as the environment and composition: a cinematic and realistic video of a ferry moving slowly across a calm sea during sunset. The camera gently approaches the ferry from a distance with a smooth forward motion (slow dolly-in), keeping the movement natural and steady. The ferry is not fast, moving at a realistic speed, creating soft waves and a subtle wake behind it. Seagulls fly in and out of the frame, some passing close to the camera. The water surface has small natural ripples reflecting the warm orange and pink sky. The sunlight gradually softens toward early evening. No exaggerated motion, no fast cuts. Ultra realistic, cinematic lighting, smooth motion, 4K, immersive and calm atmosphere."
quality: "720p"
duration: "10"
image_url
resolution: "720p"
thinking_type: "auto"
generate_audio_switch: true

PixVerse V6 Image to Video API

Name: PixVerse V6 Image to Video
Brand: PixVerse
Availability: InStock

Video·PixVerse V6·by Pixverse

PixVerse V6 animates a still image into a cinematic video — up to 1080p, 1 to 15 seconds, with synchronized audio and physically accurate motion. Supports single or multi-clip storytelling and prompt-reasoning enhancement.

Try it now →

API reference

Runtime (p50): 50s
Estimated price: $0.005 / credit

Call the API

prediction.sh

curl -X POST \
  -H "X-API-Key: $EACHLABS_API_KEY" \
  -H "Content-Type: application/json" \
  --data '{
    "model": "pixverse-v6-image-to-video",
    "version": "0.0.1",
    "input": {
        "seed": 1,
        "prompt": "Using the reference image as the environment and composition: a cinematic and realistic video of a ferry moving slowly across a calm sea during sunset. The camera gently approaches the ferry from a distance with a smooth forward motion (slow dolly-in), keeping the movement natural and steady. The ferry is not fast, moving at a realistic speed, creating soft waves and a subtle wake behind it. Seagulls fly in and out of the frame, some passing close to the camera. The water surface has small natural ripples reflecting the warm orange and pink sky. The sunlight gradually softens toward early evening. No exaggerated motion, no fast cuts. Ultra realistic, cinematic lighting, smooth motion, 4K, immersive and calm atmosphere.",
        "quality": "720p",
        "duration": "10",
        "image_url": "https://cdn-us.eachlabs.ai/uploads/d62585f2-be3c-414d-b1ff-262a11f6c68c.png",
        "resolution": "720p",
        "thinking_type": "auto",
        "generate_audio_switch": true
    },
    "webhook_url": ""
}' \
  https://api.eachlabs.ai/v1/prediction/

Documentation8 sections

Overview
PixVerse | V6 | Image to Video Overview

PixVerse | V6 | Image to Video from Pixverse transforms static images into smooth, cinematic videos up to 1080p resolution and 15 seconds long, solving the challenge of adding realistic motion and storytelling to still visuals without complex editing. This model excels in first-frame-to-last-frame transitions, taking one or two images plus a text prompt to generate fluid animations with physically accurate movement. Provided by Pixverse, a leader in AI video generation, PixVerse | V6 | Image to Video stands out with its prompt optimization mode, stylistic versatility like anime or 3D, and native audio synchronization for professional results. Available via each::labs at eachlabs.ai, it empowers creators to produce high-fidelity content for social media, films, and marketing directly from uploads.
Capabilities
Capabilities
- Generates smooth cinematic transitions from one or two input images with text-guided motion
- Supports up to 1080p resolution and 1-15 second durations with customizable pacing
- Applies diverse styles including anime, 3D, clay, comic, and cyberpunk for stylistic versatility
- Includes native audio generation synchronized to video motion and lip movements
- Features prompt reasoning enhancement via optimization mode for better prompt adherence
- Handles advanced effects like bullet time, time-lapse, and realistic physics in animations
- Offers flexible aspect ratios from ultrawide 21:9 to vertical 9:16 for any platform
- Enables multi-clip storytelling through reference-guided consistency
Use cases
Use Cases for PixVerse | V6 | Image to Video

Content Creators: Animate product photos into engaging reels—upload a still of a gadget and prompt "device rotates 360 degrees with glowing effects, cyberpunk style, 8 seconds" for viral social clips leveraging style versatility.

Marketers: Build brand stories with transitions; start with a logo image, end with a team photo, prompt "logo morphs into diverse team walking forward, realistic, 10 seconds with upbeat audio" using first-to-last-frame capability.

Motion Designers: Prototype film effects like bullet time—input action pose, prompt "surround subject with slow-motion debris explosion, 3D clay style, 5 seconds" for pre-vis with physics accuracy.

Developers: Integrate PixVerse | V6 | Image to Video API on each::labs for apps; generate personalized avatars from user selfies with "lip-sync greeting in anime, 4 seconds" via audio sync and prompt optimization.
Tips & tricks
Tips and Tricks

For best results with PixVerse | V6 | Image to Video, use detailed prompts describing motion, camera angles, and style, like specifying "slow pan right with falling leaves." Enable prompt optimization mode to automatically refine simple inputs into complex instructions, improving adherence and realism. Pair starting and ending images for precise transitions; without an end image, focus prompts on outward animation from the first frame. Optimize parameters by selecting 16:9 for cinematic feels or 9:16 for vertical platforms, and test 3-5 second durations first. Workflow tip: Generate at 720p for previews, then upscale to 1080p.

Example prompts:
- "A serene forest in autumn transitions smoothly to winter as snow falls gently, camera zooms out slowly, realistic style."
- "Single portrait of a woman smiles and turns head left, lip-sync to 'Hello world,' anime style, 5 seconds."
- "Cyberpunk cityscape at dusk, neon lights flicker on as cars drive by, bullet time effect, 10 seconds."
Technical spec
Technical Specifications
- Resolution: Up to 1080p, with options from 360p for drafts
- Duration: Customizable from 1 to 15 seconds
- Aspect Ratios: 16:9 (widescreen), 9:16 (vertical), 1:1 (square), 4:3, 21:9 (ultrawide), 2:3, 3:2, 3:4
- Input: One or two static images plus text prompt; supports image-to-video mode
- Output: MP4 video with optional synchronized audio
- Styles: Realistic, anime, 3D animation, clay, comic book, cyberpunk
- Processing Time: Varies by duration and resolution; suitable for quick iterations
- Prompt Features: Optimization mode (manual, auto, or off) for enhanced reasoning
Things to be aware of
Things to Be Aware Of

PixVerse | V6 | Image to Video performs best with clear, single-subject inputs; crowded scenes or low-res images lead to motion artifacts or inconsistencies. Complex prompts without optimization may cause drift from intent, so review auto-enhanced versions first. Longer 15-second clips at 1080p demand more compute, potentially slowing generation on shared queues at each::labs. Common mistakes include vague motion descriptions—always specify direction, speed, and camera moves. Edge cases like rapid multi-object interactions can appear less physically accurate than simpler animations.
Key considerations
Key Considerations

Before using PixVerse | V6 | Image to Video, ensure your input images are high-quality single-person or clear subjects for optimal motion consistency, as multi-subject scenes may reduce accuracy. It shines in scenarios needing smooth transitions or animations from stills, outperforming basic text-to-video for image-driven control. Processing time increases with higher resolutions like 1080p or longer durations up to 15 seconds, so start with shorter clips for testing on each::labs. Balance cost with output quality—lower resolutions speed up workflows for social media, while full HD suits cinematic needs. Access via PixVerse | V6 | Image to Video API on each::labs enables scalable integration for developers.
Limitations
Limitations

PixVerse | V6 | Image to Video struggles with highly complex multi-character scenes or extreme deformations, often prioritizing single subjects for consistency. It cannot extend videos beyond 15 seconds in one generation and may flicker in very fast motions without reference images. Input images must maintain aspect ratios to avoid cropping issues, and audio sync works best for short dialogues. No support for direct video inputs in this image-to-video mode.