ELEVENLABS

ElevenLabs Speech-to-Text Scribe v2 is a high-accuracy speech recognition model that converts audio into text with strong precision and multilingual support.

Avg Run Time: 20.000s

Model Slug: elevenlabs-speech-to-text-scribe-v2

Input

Output

Example Result

Preview and download your result.

{"output":{"language_code":"eng"
"language_probability":0.9277039170265198
"status":"COMPLETED"
"text":"Hey everyone, welcome to EachLabs AI! EachLabs is an advanced AI platform that offers powerful tools for text, image, and voice generation. It's built to help creators, developers, and businesses produce high-quality content quickly and easily. With a focus on realism, speed, and flexibility, EachLabs supports a wide range of creative and commercial use cases, making AI more accessible and impactful for everyone."
"transcription_id":"CopnrIVpHBYGd4f6FEsl"
"words":[0:{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
{...}
]
}
}

$0.22/hour base rate

Pricing Type: Dynamic

$0.22/hour base rate

Current Pricing

$0.22/hour base rate

Pricing Rules

Condition	Pricing
`keyterms matches "undefined"`	$0.33/hour with keyterms + entity detection (base $0.22 + 20% + 30% additive)
`entity_detection matches "undefined"`	$0.286/hour with entity detection (base $0.22 + 30% entity premium)
`keyterms matches "undefined"`	$0.264/hour with keyterms (base $0.22 + 20% keyterm premium)
`Rule 4`(Active)	$0.22/hour base rate

AI TRENDS

Related AI Models

You can seamlessly integrate advanced AI capabilities into your applications without the hassle of managing complex infrastructure.

Voice to Text

Wizper with Timestamp is a multilingual speech recognition and translation model built on Whisper v3 that transcribes audio with precise word-level timestamps. It delivers fast, accurate, and time-aligned transcripts, making it ideal for subtitles, media indexing, and real-time transcription workflows

Wizper with Timestamp

20 s

Voice to Text

Whisper is designed to turn speech into text across multiple languages.

Whisper

8 s

Voice to Text

Wizper is a multilingual speech recognition and translation model based on Whisper v3 that quickly and accurately converts audio files into text. It is optimized for real-time transcription and translation.

Wizper

10 s

Voice to Text

Create an RVC v2 voice cloning dataset from a Url automatically.

Rvc Dataset

20 s

Explore More

FREQUENTLY ASKED QUESTIONS

Dev questions, real answers.

ElevenLabs Scribe v2 is an advanced speech-to-text model by ElevenLabs that delivers highly accurate audio transcription across multiple languages. It supports speaker diarization, punctuation, and timestamp generation, producing clean, structured transcripts suitable for professional and enterprise use.

ElevenLabs Scribe v2 is accessible through the eachlabs unified API. Submit an audio file; the model returns a structured JSON transcript with speaker labels and timestamps. Billing is pay-as-you-go through eachlabs no separate ElevenLabs subscription is required.

ElevenLabs Scribe v2 is best suited for podcast transcription, meeting notes generation, and multilingual audio indexing. Its speaker diarization capability makes it particularly valuable for interview transcription and multi-speaker content where attribution accuracy is important.

ELEVENLABS

Input

Output

Example Result

Related AI Models

Dev questions, real answers.

What is ElevenLabs Scribe v2?

How do I use ElevenLabs Scribe v2 via API?

What is ElevenLabs Scribe v2 best suited for?

ELEVENLABS

Playground

Input

Output

Example Result

API & SDK

Create a Prediction

Get Prediction Result

Readme

Overview

elevenlabs-speech-to-text-scribe-v2 — Voice-to-Text AI Model

Technical Specifications

What Sets elevenlabs-speech-to-text-scribe-v2 Apart

Key Considerations

Tips & Tricks

How to Use elevenlabs-speech-to-text-scribe-v2 on Eachlabs

Capabilities

What Can I Use It For?

Use Cases for elevenlabs-speech-to-text-scribe-v2

Things to Be Aware Of

Limitations

Pricing

Pricing Type: Dynamic

Current Pricing

Pricing Rules

Related AI Models

Dev questions, real answers.

What is ElevenLabs Scribe v2?

How do I use ElevenLabs Scribe v2 via API?

What is ElevenLabs Scribe v2 best suited for?