How do I use Vidu 2.0 Start-End to Video via API?

Vidu 2.0 Start-End to Video is available through the eachlabs unified API. Submit a start image and an end image; the model returns a high-quality interpolated video. Billing is pay-as-you-go through eachlabs no ShengShu account is required.

What is Vidu 2.0 Start-End to Video best suited for?

Vidu 2.0 Start-End to Video is best suited for high-fidelity visual transitions in marketing campaigns, creative video production, and product transformation showcases. Its improved motion quality makes it the preferred choice when the transition needs to look natural and professional.

Example inputhover

duration: 4
end_image_url
model: "vidu2.0"
model_name: "-"
movement_amplitude: "auto"
prompt: "a smooth time-lapse transition from a snowy frozen forest to a blooming spring landscape, snow melting slowly, trees budding, flowers emerging, birds appearing and flying through the scene, sunlight becoming warmer, nature awakening, magical and emotional transformation, realistic animation, high detail"
start_image_url
resolution: "720p"

Vidu 2.0 · Start End to Video

Video·vidu-2.0·by Vidu

Vidu 2.0 Start End to Video generates a natural video transition from a starting image to an ending image.

Try it now →

API reference

Runtime (p50): 40s
Estimated price: $0.005 / credit

Call the API

prediction.sh

curl -X POST \
  -H "X-API-Key: $EACHLABS_API_KEY" \
  -H "Content-Type: application/json" \
  --data '{
    "model": "vidu-2-0-start-end-to-video",
    "version": "0.0.1",
    "input": {
        "duration": 4,
        "end_image_url": "https://storage.googleapis.com/magicpoint/inputs/vidu-2.0-start-end-last-frame-image.webp",
        "model": "vidu2.0",
        "model_name": "-",
        "movement_amplitude": "auto",
        "prompt": "a smooth time-lapse transition from a snowy frozen forest to a blooming spring landscape, snow melting slowly, trees budding, flowers emerging, birds appearing and flying through the scene, sunlight becoming warmer, nature awakening, magical and emotional transformation, realistic animation, high detail",
        "start_image_url": "https://storage.googleapis.com/magicpoint/inputs/vidu-2.0-start-end-first-image-input.webp",
        "resolution": "720p"
    },
    "webhook_url": ""
}' \
  https://api.eachlabs.ai/v1/prediction/

Documentation8 sections

Overview
vidu-2-0-start-end-to-video — Image-to-Video AI Model

Developed by Vidu as part of the vidu-2.0 family, vidu-2-0-start-end-to-video generates natural video transitions from a starting image to an ending image, solving the challenge of creating seamless motion between two static visuals for dynamic content creation. This image-to-video AI model excels in producing fluid animations that bridge key frames, ideal for creators needing precise control over video beginnings and ends without manual editing. Vidu's vidu-2-0-start-end-to-video stands out in the competitive landscape by leveraging the provider's advanced temporal coherence from the vidu-2.0 architecture, enabling smooth morphing effects up to 1080p resolution that maintain visual consistency across transitions.

Users searching for "Vidu image-to-video" or "image to video AI model" will find vidu-2-0-start-end-to-video delivers production-ready clips, typically processing in minutes via API, with support for aspect ratios suited to social media and ads.
Capabilities
- Generates smooth, natural video transitions between two images
- Supports multiple output resolutions for flexible content creation
- Maintains high visual fidelity and temporal coherence throughout the video
- Adaptable to a wide range of image subjects, including portraits, landscapes, and abstract art
- Robust workflow integration for creative and professional use cases
- Delivers consistent quality across various input types
Use cases
Use Cases for vidu-2-0-start-end-to-video

Content creators building short films can upload a static scene as the start image—like a character standing still—and an end image of them in motion, using vidu-2-0-start-end-to-video to generate a seamless walking transition with realistic physics, saving hours of keyframe animation.

Marketers for e-commerce "image to video AI model" applications feed product photos: start with a plain white background shot and end with the item on a lifestyle table, producing a 10-second reveal clip in 1080p that showcases details fluidly for ads.

Developers integrating "Vidu image-to-video API" into apps provide prompts like "Transition from a serene mountain start image to a sunset peak end image with gentle camera pan and wind audio sync," leveraging the model's multi-shot awareness for interactive demos or training videos.

Designers crafting social media reels use it for before-after visuals, such as start image of a sketch and end of polished render, creating engaging evolutions with smooth motion for portfolios or client previews.
Tips & tricks
How to Use vidu-2-0-start-end-to-video on Eachlabs

Access vidu-2-0-start-end-to-video through Eachlabs Playground for instant testing—upload start and end images, add a descriptive prompt, set duration up to 16 seconds, resolution to 1080p, and aspect ratio. Integrate via Eachlabs API or SDK for batch jobs, polling task IDs for high-quality MP4 outputs with natural transitions. Key inputs include clear images and motion prompts for optimal results.
---
Technical spec
What Sets vidu-2-0-start-end-to-video Apart

vidu-2-0-start-end-to-video differentiates itself through precise start-end frame control, generating up to 16-second videos in native 1080p resolution with exceptional temporal smoothness, unlike many image-to-video models limited to shorter, stitched clips. This capability allows users to define exact visual endpoints, ensuring narrative flow without artifacts, enabling professional transitions for storytelling.

Building on Vidu's physics-aware reasoning, it handles complex motion between start and end images, maintaining stability in multi-subject scenes during the transition. Developers benefit from this for automated workflows, producing coherent outputs faster than competitors requiring post-processing.

Key technical specs include 1080p output, up to 16 seconds duration, support for realistic and anime-style inputs, and API parameters like movement amplitude and aspect ratio. For "Vidu image-to-video API" users, processing times average quick async jobs, with high fidelity in camera-like pans derived from the vidu-2.0 family.
- Start-end image bridging for natural morphing, reducing flicker in dynamic scenes.
- 1080p native resolution with physics stability for complex interactions.
- Extended 16s clips from dual images, perfect for "AI video transition generator" needs.
Things to be aware of
- Some users report experimental features, such as variable transition speeds and advanced blending modes, that may not be fully stable
- Known quirks include occasional artifacts when input images are highly dissimilar or poorly aligned
- Performance benchmarks highlight efficient processing for short videos, but longer transitions may require significant memory and compute resources
- Consistency is generally high, but edge cases with complex images can lead to less predictable results
- Positive feedback centers on visual quality, ease of use, and versatility
- Common concerns include resource requirements for high-resolution outputs and occasional lack of control over fine-grained transition details
Key considerations
- Ensure input images are of similar aspect ratio and resolution for optimal transition quality
- Best results are achieved with clear, well-lit images that share some visual or thematic elements
- Avoid using highly dissimilar images (e.g., drastically different colors or subjects), as this may result in unnatural transitions
- Quality improves with higher resolution inputs, but this may increase processing time and resource requirements
- Experiment with transition duration to balance smoothness and speed; longer durations yield more gradual transitions
- Prompt engineering is less relevant, but careful selection and preprocessing of input images is critical
Limitations
- Limited control over specific transition effects beyond duration and resolution
- May produce unnatural results with highly divergent or poorly matched input images
- Resource-intensive for high-resolution or long-duration video generation

Related models

4 models

Bytedance Seedance 2.0 Image to Video · Fast AI model preview

Bytedance Seedance 2.0 Image to Video · FastBytedance

Kling v3 4K · Image to Video AI model preview

Kling v3 4K · Image to VideoKling

Veo 3.1 Lite · First Last Frame to VideoGoogle

P Video AnimatePruna AI

* FAQ

About Vidu 2.0 · Start End to Video

01 / 03

What is Vidu 2.0 Start-End to Video?

Vidu 2.0 Start-End to Video is an AI video interpolation model by ShengShu that generates realistic video transitions between a provided start image and end image. Built on the Vidu 2.0 architecture, it produces smoother motion and higher visual fidelity than earlier start-end interpolation models.

Vidu 2.0 · Start End to Video