OmniHuman

OmniHuman is an image-to-video generation model that creates realistic videos or animations from an image and performs lip sync with audio.

Official Partner

Avg Run Time: 200.000s

Model Slug: omnihuman

Category: Image to Video

Create a Prediction

Send a POST request to create a new prediction. This will return a prediction ID that you'll use to check the result. The request should include your model inputs and API key.

Get Prediction Result

Poll the prediction endpoint with the prediction ID until the result is ready. The API uses long-polling, so you'll need to repeatedly check until you receive a success status.

Related AI Models

You can seamlessly integrate advanced AI capabilities into your applications without the hassle of managing complex infrastructure.

Image to Video

Runway Gen-4 Turbo I2V is an image-to-video model for generating cinematic video from a single image. It brings still images to life with realistic motion and smooth camera effects. Perfect for visual storytelling and dynamic scene creation.

Runway | Gen4 | Turbo

40 s

Image to Video

Runway Act-Two turns performance videos into realistic character animations by transferring gestures and expressions.

Runway | Act-Two

200 s

Image to Video

Generates realistic talking videos by combining an input image and an audio file. Lip-syncs the character naturally to match the voice, producing smooth and lifelike results.

Character 3

160 s

Image to Video

Moonvalley Marey transforms a single image into a smooth, realistic video by adding natural motion, camera dynamics, and temporal consistency while preserving the original visual details.

Moonvalley | Marey | Image to Video

280 s