Filters Clear all
Pricing
Free tier
API
Open source
Platform

transcription 12

AI transcription tools convert spoken audio and video into text, supporting use cases from meeting notes and podcast captions to accessibility compliance and research. With 76 tools in this category, there is significant variation in accuracy, language support, and specialization.

Jammable

transcription

Create music covers with AI voice models

Free 42 · 43,722 votes

Convert audio and video to text online for free

transcription

Transcribe audio and video to text

Free 40 · 31,951 votes

Speechforms

transcription

Speech development screening quiz

Free 38 · 63,238 votes

1forAll

transcription

Natural-sounding text-to-speech for videos and learning

Free 37 · 44,030 votes

MiiTel

transcription

Voice analytics and Smart PBX for sales teams

Free 32 · 29,291 votes

Voicesend AI

transcription

Ringless voicemail drops at scale

Free 32 · 52,801 votes

Bắn Cá Đổi Thưởng

transcription

Vietnamese arcade fishing game with rewards

Free 31 · 44,084 votes

MagicLoop

transcription

Voice AI for revenue growth and lead qualification

Free 31 · 28,390 votes

NotesNudge

transcription

Daily reminders of your past insights

Free 30 · 21,781 votes

The co-pilot for your product team

transcription

Aggregate customer feedback to drive decisions

Free 30 · 19,537 votes

Voicesii

transcription

Dutch social audio platform for voice messages

Free 28 · 8,521 votes

Rain

transcription

Creative and performance marketing agency

Free 28 · 7,024 votes

Accuracy is the primary differentiator, and it varies by language, accent, audio quality, and domain vocabulary. General transcription tools like WhisperClip and Whisper Notes are built on open-source Whisper models and handle a wide range of languages, while specialized tools focus on specific contexts like medical, legal, or broadcast media. Apptek, for instance, targets enterprise and broadcast workflows. When choosing, prioritize: does the tool support your language and accent well? Can it handle multiple speakers? Is the output editable in the interface before export? Turnaround time matters for live or near-live use cases. Pricing models range from per-minute charges to monthly minute allowances, so calculate based on your actual recording hours rather than feature lists.