PixVerse v4.5 · Text to Video

Video·pixverse-v4.5·by Pixverse

Turn your imagined scenes into text with pixverse v4 5 text to video; obtain cinema quality video outputs with enhanced lighting and texture details.

Runtime (p50)
45s
Estimated price
$0.00627 / credit
Call the API
prediction.sh
sh
curl -X POST \
  -H "X-API-Key: $EACHLABS_API_KEY" \
  -H "Content-Type: application/json" \
  --data '{
    "model": "pixverse-v4-5-text-to-video",
    "version": "0.0.1",
    "input": {
        "aspect_ratio": "16:9",
        "duration": 5,
        "motion_mode": "normal",
        "prompt": "The scenery outside the car moves rapidly, and the characters look out curiously",
        "quality": "540p",
        "sound_effect_switch": true,
        "lip_sync_switch": false
    },
    "webhook_url": ""
}' \
  https://api.eachlabs.ai/v1/prediction/
Documentation8 sections
  • Overview

    pixverse-v4-5-text-to-video — Text to Video AI Model

    Developed by Pixverse as part of the pixverse-v4.5 family, pixverse-v4-5-text-to-video transforms detailed text prompts into high-quality cinematic videos with enhanced lighting, texture details, and realistic motion, solving the challenge of creating professional short-form content without expensive production setups. This text-to-video AI model stands out for its fast generation speeds and support for resolutions up to 1080p, delivering cinema-quality outputs ideal for creators seeking Pixverse text-to-video capabilities. Users praise its ability to handle complex scenes with strong prompt adherence, making it a go-to for rapid video prototyping in marketing and social media workflows.

  • Capabilities

    Converts detailed text descriptions into short video clips.

    Supports multiple visual styles to fit varied creative directions.

    Allows customization of video format and resolution.

    Provides motion controls for dynamic video pacing.

    Facilitates reproducibility with seed-based output control.

    Optimizes output for social media, marketing previews, and content generation.

  • Use cases

    Use Cases for pixverse-v4-5-text-to-video

    Content creators producing social media reels can input prompts like "A sleek sports car speeding through neon-lit city streets at night, dynamic camera pan, cyberpunk style" to generate 8-second 1080p clips with fluid motion and glowing textures, streamlining viral video production without filming equipment.

    Marketers building promotional assets use the model's fast modes for quick tests, turning product descriptions into animated demos that highlight lighting and details, saving time on e-commerce video campaigns with Pixverse text-to-video efficiency.

    Developers integrating pixverse-v4-5-text-to-video API into apps create custom short videos for user-generated content platforms, leveraging resolution options like 720p for balanced quality and speed in real-time previews.

    Filmmakers prototyping scenes rely on its motion consistency for storyboarding, inputting detailed scene prompts to visualize complex actions like "waves crashing on rocky cliffs during sunset, slow-motion foam details," enabling rapid refinements before full production.

  • Tips & tricks

    How to Use pixverse-v4-5-text-to-video on Eachlabs

    Access pixverse-v4-5-text-to-video seamlessly through Eachlabs Playground for instant testing, API for scalable integrations, or SDK for custom apps—simply provide a detailed text prompt describing scene, motion, lighting, and style, select resolution (360p-1080p), aspect ratio, and duration up to 8 seconds. Expect MP4 outputs with cinema-quality visuals in 30-120 seconds, optimized for professional short videos without audio.

    ---
  • Technical spec

    What Sets pixverse-v4-5-text-to-video Apart

    The pixverse-v4-5-text-to-video model excels with diffusion-based architecture optimized for temporal stability, producing videos with superior motion consistency compared to earlier versions. This enables seamless animations from static text descriptions, reducing artifacts in dynamic scenes for more professional results.

    It supports versatile resolutions from 360p to 1080p in Normal and Fast modes, with aspect ratios like 16:9 and 9:16, and max durations around 8 seconds—ideal for high-volume pixverse-v4-5-text-to-video API integrations. Higher quality modes like 1080p Normal consume more credits but yield detailed textures and lighting, perfect for polished outputs.

    • Fast processing at 30-120 seconds depending on mode, outperforming many competitors for quick iterations in text-to-video AI model workflows.
    • Strong benchmark performance in visual quality and instruction following, handling detailed prompts for cinematic realism without native audio needs.
    • Multiple quality tiers (Turbo, Normal, Fast) allow balancing speed and fidelity, with 720p Normal as a sweet spot for professional use.
  • Things to be aware of

    Generate the same prompt with different styles to compare visual effects.

    Adjust motion mode to see how speed changes the feel of the video.

    Use negative prompts to filter out unwanted elements such as noise or artifacts.

    Create portrait and landscape versions of the same video for different platforms.

    Fix the seed to create matching videos for multiple uses or users.

    Experiment with lower quality settings for faster results when high fidelity is not critical.

  • Key considerations

    Video length is limited to short clips (5 or 8 seconds), which may not suit long-form content needs.

    Style selection significantly impacts output; some styles like clay or cyberpunk may add unique color palettes and textures.

    Aspect ratio choice affects framing; portrait formats suit mobile screens, while widescreen is better for desktop or presentations.

    Negative prompts should be specific and clear to reduce unwanted visual noise.

    The motion mode impacts perceived speed and smoothness; fast motion may cause less detail clarity.

    Seed control is optional but essential if exact video reproducibility is required.


    Legal Information for PixVerse v4.5 Text to Video

    By using this PixVerse v4.5 Text to Video, you agree to:

    Pixverse Terms Of Service

    Pixverse Privacy Policy

  • Limitations

    Limited to short video durations (5 or 8 seconds).

    Not suitable for detailed or complex long-form videos.

    Certain styles may introduce visual noise or reduce clarity.

    High-quality videos require longer processing times.

    Motion modes trade-off between smoothness and speed, affecting detail.

    Seed control does not guarantee identical output if underlying model updates.


    Output Format: MP4

Related models

4 models
* FAQ

About PixVerse v4.5 · Text to Video

01 / 03

What is PixVerse v4.5 text-to-video and how does it generate video from prompts?

PixVerse v4.5 text-to-video is PixVerse's fourth-generation plus text-to-video model that generates short video clips from natural language descriptions. It produces coherent motion video with solid scene accuracy and temporal consistency, serving as the established baseline version prior to the v5 and v5.5 releases in the PixVerse model lineage.