Ovi | Text to Video
Ovi introduces a unified paradigm for audio-video generation seamlessly combining image, text, and sound to produce coherent, cinematic video outputs where motion, visuals, and audio are generated together with natural synchronization and depth.
Avg Run Time: 45.000s
Model Slug: ovi-text-to-video
Category: Text to Video
Input
Output
Example Result
Preview and download your result.
Create a Prediction
Send a POST request to create a new prediction. This will return a prediction ID that you'll use to check the result. The request should include your model inputs and API key.
Get Prediction Result
Poll the prediction endpoint with the prediction ID until the result is ready. The API uses long-polling, so you'll need to repeatedly check until you receive a success status.
Related AI Models
You can seamlessly integrate advanced AI capabilities into your applications without the hassle of managing complex infrastructure.