Who is Seedance 2.0 Mini Text-to-Video for?

It suits high-volume, budget-sensitive creators such as social teams, marketers, and storyboard artists who need quick drafts and many variations. With Seedance 2.0 Mini Text-to-Video, you can turn a single prompt into 720p to 1080p clips in vertical, square, or landscape formats, ready for fast iteration.

What kind of output does Seedance 2.0 Mini Text-to-Video produce?

The model returns a 5 to 12 second video with native audio, including ambient sound and dialogue, at resolutions up to 1080p. Compared with image-to-video, text-to-video from ByteDance gives you full control over the scene through your prompt, since there is no reference image anchoring the look.

Example inputhover

prompt: "Ultra realistic modern cinematic reinterpretation of a 1920s gothic silent horror scene, tall thin vampire-like silhouette slowly climbing staircase, exaggerated shadow stretching along wall, high contrast black and white lighting, dramatic expressionist set design with distorted angles, slow cinematic push in, subtle film grain texture, eerie atmosphere,"
duration: "10"
resolution: "720p"
aspect_ratio: "16:9"
generate_audio: true

ByteDance Seedance 2.0 Mini · Text to Video

Video·seedance-2.0·by Bytedance

Seedance 2.0 Mini Text-to-Video generates short clips from a written prompt, with motion, camera moves, and synced audio. Low-cost AI video on each::labs.

Try it now →

API reference

Runtime (p50): 4m
Estimated price: From $0.06

Call the API

prediction.sh

curl -X POST \
  -H "X-API-Key: $EACHLABS_API_KEY" \
  -H "Content-Type: application/json" \
  --data '{
    "model": "bytedance-seedance-2-0-mini-text-to-video",
    "version": "0.0.1",
    "input": {
        "prompt": "Ultra realistic modern cinematic reinterpretation of a 1920s gothic silent horror scene, tall thin vampire-like silhouette slowly climbing staircase, exaggerated shadow stretching along wall, high contrast black and white lighting, dramatic expressionist set design with distorted angles, slow cinematic push in, subtle film grain texture, eerie atmosphere,",
        "duration": "10",
        "resolution": "720p",
        "aspect_ratio": "16:9",
        "generate_audio": true
    },
    "webhook_url": ""
}' \
  https://api.eachlabs.ai/v1/prediction/

Documentation8 sections

Overview
ByteDance | Seedance 2.0 | Mini | Text to Video Overview

ByteDance | Seedance 2.0 | Mini | Text to Video is a compact text-to-video generation model from ByteDance’s Seedance 2.0 family, designed to turn short text prompts into coherent, cinematic video clips. It focuses on fast, lightweight generation while preserving recognizable motion, scene structure, and style, making it suitable for interactive and high-volume workflows. As part of the Seedance 2.0 line of Bytedance text-to-video systems, the Mini variant is optimized for shorter clips and more efficient inference rather than long-form storytelling. On each::labs, the model exposes controls for prompt, optional reference image or video, aspect ratio, resolution presets, duration, and native audio handling, so teams can quickly prototype rich video content directly from text.
Capabilities
Capabilities
- Generates short cinematic video clips directly from natural-language prompts.
- Supports optional conditioning on reference images or short videos to better control style, subject identity, or scene layout.
- Offers adjustable aspect ratios and resolution presets for horizontal, vertical, or square formats.
- Allows users to set clip duration, enabling quick experimentation with very short or slightly longer sequences within the Mini profile.
- Provides native audio handling, including the ability to include or omit an automatically generated audio track.
- Optimized for relatively fast inference compared with larger Seedance configurations, enabling interactive iteration and higher throughput.
- Produces coherent motion and scene transitions suitable for concept previews, social posts, and marketing snippets.
- Integrates cleanly through the ByteDance | Seedance 2.0 | Mini | Text to Video API on each::labs for programmatic generation at scale.
Use cases
Use Cases for ByteDance | Seedance 2.0 | Mini | Text to Video

Content creators can quickly prototype social-ready clips by pairing detailed prompts with vertical aspect ratios, taking advantage of efficient short-clip generation. For example: “ POV skateboard ride through a seaside boardwalk at sunset, vertical, energetic camera shake.”

Marketers can generate A/B test variations of product teasers using the Bytedance text-to-video capabilities and consistent reference imagery, such as “rotating shot of a minimalist wireless earbud on a reflective black surface, dramatic lighting.”

Designers and art directors can use reference boards and precise style prompts to explore motion directions for campaigns, like “studio-style macro video of ink swirling in water, soft gradients, slow motion.”

Developers can integrate the ByteDance | Seedance 2.0 | Mini | Text to Video API into creative tools or pipelines on each::labs, dynamically generating explainer snippets or UI previews from user-entered descriptions.
Tips & tricks
Tips and Tricks

To get the most from ByteDance | Seedance 2.0 | Mini | Text to Video, treat the prompt like a concise shot description. Specify camera motion, lighting, and style, such as “cinematic,” “handheld,” or “slow dolly.” When using reference media, keep the prompt focused on how the scene should evolve from that reference rather than restating everything in the image or clip. Start with moderate duration and resolution to iterate quickly, then upscale settings once you are confident in the concept. Use aspect ratio presets aligned with your target platform (for example, vertical for mobile-first social content). Example prompts: “cinematic close-up of a cyberpunk city street at night, neon reflections in the rain, slow camera pan”; “hand-drawn animation of a rocket launching above the clouds, soft pastel colors, smooth tracking shot”; “time-lapse of a flower blooming in a sunlit studio, macro lens feel, shallow depth of field.”
Technical spec
Technical Specifications
- Provider / Family: ByteDance Seedance 2.0, Mini variant focused on efficient text-to-video generation.
- Task: Text-to-video generation with optional reference media conditioning.
- Typical duration: Short clips, generally a few seconds in length, optimized for quick previews and social-style outputs.
- Aspect ratios: Common widescreen, vertical, and square aspect ratios selectable via parameters.
- Resolution: Supports multiple resolution presets; higher resolutions may increase latency and compute usage.
- Inputs: Text prompt, optional reference image or video, duration, aspect ratio, resolution, and audio on/off or mode.
- Outputs: Encoded video file with optional native audio track.
- Runtime behavior: Inference speed depends on clip length and resolution, with the Mini profile tuned for relatively fast turnaround suitable for iterative workflows.
Things to be aware of
Things to Be Aware Of

ByteDance | Seedance 2.0 | Mini | Text to Video is tuned for short clips, so attempting to represent complex multi-scene narratives in a single request may yield compressed or confusing results. Very ambiguous or overly long prompts can cause the model to average concepts rather than focus on one clear idea. Motion may appear less stable at extreme aspect ratios or very high resolutions, especially when combined with long durations. Using reference media that is low quality, highly cluttered, or stylistically inconsistent with your prompt can reduce visual coherence. As with most generative systems, results are stochastic, so you may need multiple generations per prompt to find the best output.
Key considerations
Key Considerations

ByteDance | Seedance 2.0 | Mini | Text to Video is best used when you need short, visually engaging clips generated quickly rather than long, polished sequences. For the best results, prompts should clearly describe scene, subject, style, and motion, especially when no reference media is provided. The model’s Mini profile is well suited to prototyping, A/B testing creative concepts, and bulk content generation where speed and cost efficiency matter. Higher resolutions and longer durations will increase compute requirements and latency, so users on each::labs should tune settings to their performance and budget targets.
Limitations
Limitations

The Mini configuration of Seedance 2.0 is not designed for long-form, multi-minute storytelling or frame-perfect control. Fine-grained editing of individual frames, precise timing to external audio, or strict adherence to detailed scripts is limited. Extremely high-detail scenes, dense text overlays, or complex character interactions may appear simplified. Video length and resolution are constrained to keep latency and resource usage practical for interactive work. For advanced control or longer clips, users may need to combine multiple outputs or pair this model with downstream editing tools.

Related models

4 models

Bytedance Seedance 2.0 Text to Video · Fast AI model preview

Bytedance Seedance 2.0 Text to Video · FastBytedance

Veo 3.1 Lite · Text to VideoGoogle

XAI Grok Imagine · Text to Video AI model preview

XAI Grok Imagine · Text to VideoxAI

PixVerse C1 Text to Video AI model preview

PixVerse C1 Text to VideoPixverse

* FAQ

About ByteDance Seedance 2.0 Mini · Text to Video

01 / 03

What is Seedance 2.0 Mini Text-to-Video?

Seedance 2.0 Mini Text-to-Video is a model from ByteDance that creates a short video from a text description alone, with no source image needed. You describe the scene, characters, and action, and the model builds the shot from scratch with motion, camera movement, and synchronized audio.

ByteDance Seedance 2.0 Mini · Text to Video