# Kling | v3 | Standard | Text to Video Creates AI videos from text prompts using Kling O3 Standard a faster, cost-efficient option for generating cinematic clips up to 15 seconds with native audio generation. ## API Information - **Model Slug:** kling-v3-standard-text-to-video - **Branded URL:** https://www.eachlabs.ai/kling/kling-v3/kling-v3-standard-text-to-video - **Provider:** Kling - **Category:** Text to Video - **Output Type:** video - **Status:** active - **Version:** 0.0.1 - **Estimated Processing Time:** 50 seconds - **Last Updated:** 2026-06-08 - **Interactive Demo:** https://www.eachlabs.ai/ai-models/kling-v3-standard-text-to-video ## Pricing Pricing information not available. ## Input Schema | Parameter | Type | Required | Default | Constraints | Description | |-----------|------|----------|---------|-------------|-------------| | prompt | string | No | - | - | Text prompt for video generation. Either prompt or multi_prompt must be provided, but not both. | | duration | string | No | 5 | 3,4,5,6,7,8,9,10,11,12,13,14,15 | The duration of the generated video in seconds | | multi_prompt | array | No | - | 0–5 | List of prompts for multi-shot video generation. If provided, overrides the single prompt and divides the video into multiple shots with specified prompts and durations. | | generate_audio | boolean | No | true | - | Whether to generate native audio for the video. Supports Chinese and English voice output. Other languages are automatically translated to English. For English speech, use lowercase letters; for acronyms or proper nouns, use uppercase. | | voice_ids | array | No | - | 0–2 | Optional Voice IDs for video generation. Reference voices in your prompt with <<>> and <<>> (maximum 2 voices per task). Get voice IDs from the kling video create-voice endpoint: https://www.eachlabs.ai/kling/kling-voice/kling-voice-create | | shot_type | string | No | customize | customize,intelligent | The type of multi-shot video generation | | aspect_ratio | string | No | 16:9 | 16:9,9:16,1:1 | The aspect ratio of the generated video frame | | negative_prompt | string | No | blur, distort, and low quality | - | - | | cfg_scale | number | No | 0.5 | 0–1 | The CFG (Classifier Free Guidance) scale is a measure of how close you want the model to stick to your prompt. | ## Example Request ```bash curl -X POST https://api.eachlabs.ai/v1/prediction/ \ -H "X-API-Key: YOUR_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "model": "kling-v3-standard-text-to-video", "input": {} }' ``` ## Output Schema Response returned by `GET /v1/prediction/{id}` when the job completes: ```json { "status": "success", "predictionID": "string", "output": "string (URL of generated video)", "metrics": { "predict_time": "number (seconds)" } } ``` ## Polling ```bash curl https://api.eachlabs.ai/v1/prediction/{PREDICTION_ID} \ -H "X-API-Key: YOUR_API_KEY" ``` | Status | Meaning | |--------|---------| | `processing` | Still running — poll again | | `success` | Done — read `output` | | `error` | Failed — read `message` / `details` | ## Webhook (alternative to polling) Pass `"webhook_url": "https://your.host/path"` in the create request. Eachlabs POSTs this payload when the job ends: ```json { "exec_id": "prediction-uuid", "status": "succeeded", "output": "https://...", "error": "" } ``` `status` is `"succeeded"` or `"failed"`. `exec_id` equals the `predictionID` from create. Return 2xx within 30 seconds. ## Errors Error body: `{ "status": "error", "message": "...", "details": "..." }` | Code | Meaning | |------|---------| | `400` | Invalid input | | `401` | Missing / invalid `X-API-Key` | | `404` | Unknown model or prediction id | | `429` | Rate limit — 100 creates / min, 10 concurrent per key | | `5xx` | Retry with backoff | ## Overview Kling | v3 | Standard | Text to Video, from provider Kling in the kling-v3 family, transforms text prompts or reference images into high-quality video clips with synchronized native audio. This model solves the challenge of creating cinematic, multi-shot videos efficiently, balancing quality, speed, and cost for creators needing consistent motion and sound. Its primary differentiator is structured multi-prompt support for up to six sequential shots in a single generation, enabling seamless scene transitions without manual editing. Ideal for narrative clips, social content, and product demos, Kling | v3 | Standard | Text to Video delivers temporally stable outputs with dialogue, ambient sound, and clear character tracking. Available via the Kling | v3 | Standard | Text to Video API on platforms like each::labs, it supports both text-to-video and image-to-video workflows for versatile Kling text-to-video production. ## Usage Notes - API Base URL: `https://api.eachlabs.ai/v1` - Authentication: send `X-API-Key: YOUR_API_KEY`. Generate a key from the Eachlabs dashboard at https://www.eachlabs.ai/dashboard/api-keys. - File-typed parameters (`*_url`, `image_url`, `video_url`, `audio_url`, etc.) accept publicly-reachable HTTPS URLs only. Upload your asset first (GCS / S3 / your CDN) and pass the resulting URL. Data-URIs and localhost URLs are rejected. - For structured parameters (arrays / objects) send real JSON values, not stringified payloads. - Monetary values are reported in USD; per-token / per-megapixel rates may be billed in micro-cents internally. - Prefer `webhook_url` over polling for long-running predictions — see the Webhook Callback section.