MINIMAX-MUSIC
MiniMax Music v1.5 turns simple text prompts into high-quality, expressive music. It offers a wide range of styles and moods, creating melodies and rhythms that feel natural and engaging.
Avg Run Time: 40.000s
Model Slug: minimax-music-v1-5
Playground
Input
Output
Example Result
Preview and download your result.
API & SDK
Create a Prediction
Send a POST request to create a new prediction. This will return a prediction ID that you'll use to check the result. The request should include your model inputs and API key.
Get Prediction Result
Poll the prediction endpoint with the prediction ID until the result is ready. The API uses long-polling, so you'll need to repeatedly check until you receive a success status.
Readme
Overview
minimax-music-v1.5 — Text-to-Audio AI Model
Developed by Minimax as part of the minimax-music family, minimax-music-v1.5 transforms simple text prompts into complete, professional-grade songs up to 4 minutes long, solving the challenge of rapid music creation for creators without instruments or studios. This text-to-voice AI model excels in generating natural vocals, rich instrumentation, and structured tracks with lyrics, making it ideal for Minimax text-to-voice applications in content production. Users input a music prompt describing style, mood, and scenario—plus optional structured lyrics—and receive high-fidelity audio in formats like MP3 or WAV within 1-2 minutes.
Technical Specifications
What Sets minimax-music-v1.5 Apart
minimax-music-v1.5 stands out in the text-to-audio landscape with its support for detailed song structures using tags like [Verse], [Chorus], and [Bridge], enabling precise control over composition that many competitors lack. This allows users to craft full songs with logical flow, from intro to outro, maintaining musical coherence. It also delivers natural vocals and multi-instrument layers up to 4 minutes, surpassing shorter-clip models while offering rapid generation in 60-120 seconds.
- Enhanced lyrical section control: Specify [Intro], [Verse 1], [Chorus] in prompts for structured songs, enabling songwriters to prototype demos with professional arrangement.
- Rich style and emotion customization: Describe genres like indie pop or electronic with mood tags, producing expressive tracks with clear vocals and transients—ideal for minimax-music-v1.5 API integrations.
- Multi-format output up to 4 minutes: Supports MP3, WAV, FLAC with customizable duration, perfect for video soundtracks or podcasts without length limitations of basic generators.
Key Considerations
Lyric Length: Stay within 10-600 character limit for lyrics to ensure quality output
Style Clarity: Vague style descriptions may result in generic or unpredictable musical arrangements
Cultural Context: Traditional music styles may require specific cultural or regional terminology
Genre Mixing: Complex multi-genre requests may produce inconsistent musical results
Tempo Specifications: Include specific tempo or rhythm descriptions for better control
Vocal Style: Specify desired vocal characteristics (male/female, age, energy level)
Content Guidelines: Follow appropriate content policies for lyrical material
Legal Information for Minimax Music V1.5
By using this Minimax Music V1.5, you agree to:
- Minimax Privacy
- Minimax SERVICE AGREEMENT
Tips & Tricks
How to Use minimax-music-v1.5 on Eachlabs
Access minimax-music-v1.5 seamlessly through Eachlabs' Playground for instant testing, API for production apps, or SDK for custom integrations. Provide a text prompt with style, mood, and optional structured lyrics using [Verse], [Chorus] tags, plus duration up to 4 minutes; receive high-quality MP3/WAV outputs with natural vocals and instrumentation in 1-2 minutes. Eachlabs delivers reliable, scalable access to this Minimax powerhouse.
---Capabilities
Complete Song Generation: Creates full musical compositions with lyrics, melody, harmony, and arrangement
Multi-Genre Mastery: Handles diverse musical styles from traditional to contemporary genres
Bilingual Processing: Supports both English and Chinese lyrics with appropriate musical styling
Structural Intelligence: Automatically arranges songs with proper verse-chorus-bridge organization
Vocal Synthesis: Generates realistic vocal performances that match lyrical content and musical style
Instrumental Variety: Incorporates wide range of instruments including traditional and modern options
Emotional Expression: Translates emotional descriptions into appropriate musical elements
Cultural Authenticity: Accurately represents traditional music styles and cultural elements
What Can I Use It For?
Use Cases for minimax-music-v1.5
Content creators producing YouTube videos or podcasts use minimax-music-v1.5 to generate custom background tracks; for example, input a prompt like "upbeat indie folk with acoustic guitar, soft vocals, [Verse] about chasing dreams, [Chorus] uplifting melody" to get a 2-minute song with natural flow in under 2 minutes.
Songwriters and musicians prototype ideas via the minimax-music-v1.5 API, feeding structured lyrics and style descriptors like "punk rock energy with driving bass" to create full demos up to 4 minutes, streamlining the transition from concept to polished track.
Marketers crafting brand jingles input scenario prompts such as "energetic electronic theme for tech startup, [Bridge] building tension," leveraging its vocal nuance and instrumental clarity for memorable, original audio assets tailored to campaigns.
Developers building AI music generator apps integrate this model for user-facing tools, combining emotion control and section tags to enable real-time song creation from text, enhancing apps for educators teaching composition or gamers needing dynamic soundtracks.
Things to Be Aware Of
Beginner Compositions
- Simple Pop Song: "[verse] Walking down the street, feeling so free [chorus] This is my moment, just let me be"
- Acoustic Ballad: "[verse] Quiet nights and starlit skies [chorus] In your eyes I see forever"
- Upbeat Dance: "[verse] Move your body to the beat [chorus] Dance like nobody's watching"
- Country Folk: "[verse] Old dirt roads and summer days [chorus] Take me home to simpler ways"
Advanced Musical Concepts
- Genre Fusion: Combine classical orchestra with modern electronic elements
- Cultural Blending: Mix traditional Chinese instruments with Western pop structures
- Complex Arrangements: Create multi-section songs with intro, multiple verses, bridge, and outro
- Emotional Journeys: Develop songs that progress from melancholy verses to uplifting choruses
Professional Content
- Commercial Jingles: Short, catchy musical phrases for brand recognition
- Film Scoring: Dramatic instrumental pieces for video content and presentations
- Theme Development: Create recurring musical themes for series or ongoing projects
- Mood Music: Generate ambient and atmospheric music for various professional contexts
Experimental Projects
- Instrumental Focus: Create purely instrumental pieces highlighting specific instruments
- Vocal Harmony: Develop songs featuring complex vocal arrangements and harmonies
- Tempo Variations: Experiment with songs that change tempo throughout the composition
- Cultural Exploration: Generate authentic traditional music from various global cultures
Educational Content
- Music Theory Examples: Create songs demonstrating specific musical concepts and structures
- Historical Styles: Generate music in the style of different historical periods
- Language Practice: Produce songs in different languages for pronunciation and vocabulary practice
- Rhythm Studies: Create compositions focusing on specific rhythmic patterns and time signatures
Limitations
Duration Constraint: Maximum output length limited to 240 seconds (4 minutes)
Lyric Length: Restricted to 600 characters maximum for input lyrics
Language Limitation: Currently supports only English and Chinese languages
Style Mixing: Very complex multi-genre combinations may produce inconsistent results
Real-time Generation: Not suitable for live performance or real-time music creation
Personalization: Cannot learn individual user preferences or create custom vocal styles
Copyright Restrictions: Cannot reproduce copyrighted melodies or lyrics accurately
Technical Complexity: Very specific music theory requests may not be interpreted correctly
Output Format: MP3
Pricing
Pricing Detail
This model runs at a cost of $0.030 per execution.
Pricing Type: Fixed
The cost remains the same regardless of which model you use or how long it runs. There are no variables affecting the price. It is a set, fixed amount per run, as the name suggests. This makes budgeting simple and predictable because you pay the same fee every time you execute the model.
Related AI Models
You can seamlessly integrate advanced AI capabilities into your applications without the hassle of managing complex infrastructure.
