Eachlabs | AI Workflows for app builders

Ovi | Text to Video

Ovi introduces a unified paradigm for audio-video generation seamlessly combining image, text, and sound to produce coherent, cinematic video outputs where motion, visuals, and audio are generated together with natural synchronization and depth.

Avg Run Time: 45.000s

Model Slug: ovi-text-to-video

Category: Text to Video

Input

Advanced Controls

Output

Example Result

Preview and download your result.

Each execution costs $0.2000. With $1 you can run this model about 5 times.

Create a Prediction

Send a POST request to create a new prediction. This will return a prediction ID that you'll use to check the result. The request should include your model inputs and API key.

Get Prediction Result

Poll the prediction endpoint with the prediction ID until the result is ready. The API uses long-polling, so you'll need to repeatedly check until you receive a success status.