# LatentSync A video-to-video model, LatentSync generates accurate lip sync from audio for natural, high-quality results ## API Information - **Model Slug:** latentsync - **Branded URL:** https://www.eachlabs.ai/alibaba/latentsync/latentsync - **Provider:** Alibaba - **Category:** Video to Video - **Output Type:** video - **Status:** active - **Version:** 0.0.1 - **Base Cost:** Flat $0.20 up to 40s, then $0.005 per second overage from output duration - **Estimated Processing Time:** 45 seconds - **Last Updated:** 2026-04-06 - **Interactive Demo:** https://www.eachlabs.ai/ai-models/latentsync ## Pricing - **Charge Type:** dynamic - **Estimated Price (default example):** $0.2000 - **Pricing Details:** Flat $0.20 up to 40s, then $0.005 per second overage from output duration ### Pricing Rules | Rule | Condition | Price | | --- | --- | --- | | tiered_duration_from_output | - | - | ## Input Schema | Parameter | Type | Required | Default | Constraints | Description | |-----------|------|----------|---------|-------------|-------------| | video_url | string | Yes | - | - | The URL of the video to generate the lip sync for. | | audio_url | string | Yes | - | - | The URL of the audio to generate the lip sync for. | | guidance_scale | number | No | 1 | 1–2 | Guidance scale for the model inference | | seed | integer | No | - | - | Random seed for generation. If None, a random seed will be used. | | loop_mode | string | No | - | pingpong,loop | Video loop mode when audio is longer than video. Options: pingpong, loop | ## Example Request ```bash curl -X POST https://api.eachlabs.ai/v1/prediction/ \ -H "X-API-Key: YOUR_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "latentsync", "input": { "video_url": "https://storage.googleapis.com/magicpoint/inputs/latentsync-input-video.mp4", "audio_url": "https://storage.googleapis.com/magicpoint/inputs/latentsync-input-audio.mp3" } }' ``` ## Output Schema Response returned by `GET /v1/prediction/{id}` when the job completes: ```json { "status": "success", "predictionID": "string", "output": "string (URL of generated video)", "metrics": { "predict_time": "number (seconds)" } } ``` ## Polling ```bash curl https://api.eachlabs.ai/v1/prediction/{PREDICTION_ID} \ -H "X-API-Key: YOUR_API_KEY" ``` | Status | Meaning | |--------|---------| | `processing` | Still running — poll again | | `success` | Done — read `output` | | `error` | Failed — read `message` / `details` | ## Webhook (alternative to polling) Pass `"webhook_url": "https://your.host/path"` in the create request. Eachlabs POSTs this payload when the job ends: ```json { "exec_id": "prediction-uuid", "status": "succeeded", "output": "https://...", "error": "" } ``` `status` is `"succeeded"` or `"failed"`. `exec_id` equals the `predictionID` from create. Return 2xx within 30 seconds. ## Errors Error body: `{ "status": "error", "message": "...", "details": "..." }` | Code | Meaning | |------|---------| | `400` | Invalid input | | `401` | Missing / invalid `X-API-Key` | | `404` | Unknown model or prediction id | | `429` | Rate limit — 100 creates / min, 10 concurrent per key | | `5xx` | Retry with backoff | ## Overview **latentsync — Video-to-Video AI Model** latentsync, developed by Alibaba as part of the latentsync family, delivers precise lip synchronization for video-to-video generation, transforming input videos and audio into natural, high-fidelity outputs with accurate facial movements. This Alibaba video-to-video model excels at creating realistic lip sync from audio inputs, solving the challenge of mismatched mouth movements in AI-generated talking head videos. Users searching for **video-to-video AI model** with superior audio-visual alignment find latentsync ideal for professional-grade results without manual editing. Built on advanced diffusion technology, latentsync supports seamless integration of audio-driven expressions, making it a go-to for creators needing **Alibaba latentsync API** capabilities in dynamic video production. ## Usage Notes - API Base URL: `https://api.eachlabs.ai/v1` - Authentication: send `X-API-Key: YOUR_API_KEY`. Generate a key from the Eachlabs dashboard at https://www.eachlabs.ai/dashboard/api-keys. - File-typed parameters (`*_url`, `image_url`, `video_url`, `audio_url`, etc.) accept publicly-reachable HTTPS URLs only. Upload your asset first (GCS / S3 / your CDN) and pass the resulting URL. Data-URIs and localhost URLs are rejected. - For structured parameters (arrays / objects) send real JSON values, not stringified payloads. - Monetary values are reported in USD; per-token / per-megapixel rates may be billed in micro-cents internally. - Prefer `webhook_url` over polling for long-running predictions — see the Webhook Callback section.