WAN-2.7

Wan 2.7 Video Edit applies instruction-based edits, reference image-based edits, or style transfer to existing videos. Supports 720P/1080P, preserves or regenerates audio, and handles 2-10s input videos.

Avg Run Time: 300.000s

Model Slug: alibaba-wan-2-7-video-edit

Release Date: April 3, 2026

Input

Prompt

Negative Prompt

Video URL*

Enter a URL or choose a file from your computer.

Invalid URL.

(Max 50MB)

Reference Image 1

Enter a URL or choose a file from your computer.

Click to upload or drag and drop

(Max 50MB)

Reference Image 2

Enter a URL or choose a file from your computer.

Click to upload or drag and drop

(Max 50MB)

Reference Image 3

Enter a URL or choose a file from your computer.

Click to upload or drag and drop

(Max 50MB)

Resolution

Aspect Ratio

Duration

Audio Setting

Prompt Extend

Watermark

Seed

Output

Example Result

Preview and download your result.

1080P pricing: $0.15/sec (default)

Pricing Type: Dynamic

1080P pricing: $0.15/sec (default)

Current Pricing

1080P pricing: $0.15/sec (default)

Pricing Rules

Condition	Pricing
`resolution matches "720P"`	720P pricing: $0.10/sec
`Rule 2`(Active)	1080P pricing: $0.15/sec (default)

AI TRENDS

Related AI Models

You can seamlessly integrate advanced AI capabilities into your applications without the hassle of managing complex infrastructure.

Video to Video

Generates high-quality, realistic lip-sync animations from audio using the state-of-the-art Sync Lipsync 2 Pro model, preserving natural teeth, unique facial features, and lifelike expressions.

Sync | Lipsync | v2 | Pro

220 s

Video to Video

PixVerse Swap replaces a subject or object in an existing video with a reference image. Provide a video and the new image, and Swap automatically targets the primary detected subject (face, body, or object). v1 caveat: the first detected subject (mask_info[0]) is auto-picked. Up to 720p; the source video codec must be h.264 or h.265.

PixVerse Swap

160 s

Video to Video

PixVerse Lip Sync v2 synchronizes mouth movements in videos with provided audio or text-to-speech, supporting multiple built-in voices or custom audio input.

PixVerse Lip Sync v2

80 s

Video to Video

When your footage isn't long enough, use veo3-1-extend-video to seamlessly extend the duration without breaking the scene's context or narrative flow.