Otter.ai
transcriptionLive meeting transcription with automatic summaries and action items.
AI audio tools cover a wide range of sound-related tasks: voice synthesis, music generation, transcription, podcast editing, and audio enhancement. The 434 tools in this category serve musicians, podcasters, content creators, and developers who need programmatic audio processing.
Live meeting transcription with automatic summaries and action items.
Listen to any text at up to 4.5x speed on any device
AI platform for generating original music tracks
Text-to-speech in 100+ languages
Generate royalty-free music with custom settings
Podcast summaries, transcripts, and note-taking
AI music generation from style and parameter inputs
AI music tools for remixing, splitting, and detecting
Create videos and podcasts by chatting with AI
Voice agents for customer service and task automation
Generate singing voices from text
Extract and process data from invoices automatically
Voice-assisted form filling with text and file input
AI agents for customer service and sales
Enterprise platform for sales, partner, and leadership training
Build custom AI companions and conversational avatars
AI platform for audio-visual creation and analysis
Generate stories, characters, and build fictional worlds
AI speech and video software for podcast creation
AI dubbing with phrase-level control in 140+ languages
Telegram CRM and outreach automation
Speech-to-text API supporting 140+ languages
Get live AI hints during meetings and presentations
Real-time speech-to-text and text-to-speech with semantic accuracy
The audio category divides into several distinct subcategories. Music generation tools like Beatopia and Music Eleven AI create original compositions from prompts or chord progressions. Transcription and speech tools handle voice-to-text, text-to-speech, and language processing. Podcast tools like Podstash and Podbrews focus on summarization, chapter generation, and content repurposing. Audio enhancement tools improve recording quality, remove background noise, or adjust acoustics. When choosing between them, the quality of the underlying voice model matters most for speech synthesis, especially for commercial use. Music generation tools differ in style flexibility and whether you retain rights to the output. Licensing terms for generated audio vary and should be checked carefully if you plan to monetize content. Some tools require a DAW or plugin environment while others work entirely in the browser.