LatentSync

A video-to-video model, LatentSync generates accurate lip sync from audio for natural, high-quality results

Avg Run Time: 45.000s

Model Slug: latentsync

Category: Video to Video

Input

Video Url*

Enter an URL or choose a file from your computer.

Click to upload or drag and drop

(Max 50MB)

Audio Url*

Enter an URL or choose a file from your computer.

Click to upload or drag and drop

(Max 50MB)

Guidance Scale

Seed

Loop Mode

Output

Example Result

Preview and download your result.

Flat $0.20 up to 40s, then $0.005 per second overage from output duration

Related AI Models

You can seamlessly integrate advanced AI capabilities into your applications without the hassle of managing complex infrastructure.

Video to Video

MMAudio generates synchronized audio given video and/or text inputs.

MM Audio

5 s

Video to Video

Wan v2.2 14B Animate Replace allows you to animate videos while seamlessly replacing both objects and people with realistic motion and consistency.

Wan | v2.2 14B | Animate | Replace

300 s

Video to Video

Runway Aleph is an advanced model for text-based video editing. It can generate new camera angles, extend scenes, adjust lighting and atmosphere, add or remove objects, and apply different visual styles to videos.

Runway Gen4 | Aleph

250 s

Video to Video

A video generation model that smoothly extends scenes with consistent visual quality. Ideal for creating seamless cinematic transitions and lengthening existing footage.