Otter.ai
transcriptionLive meeting transcription with automatic summaries and action items.
AI audio tools cover a wide range of sound-related tasks: voice synthesis, music generation, transcription, podcast editing, and audio enhancement. The 434 tools in this category serve musicians, podcasters, content creators, and developers who need programmatic audio processing.
Live meeting transcription with automatic summaries and action items.
Text-to-speech tool that reads documents, PDFs, and web pages aloud with AI voices
Natural-sounding text-to-speech for games, audiobooks, and e-learning
Generate royalty-free music from text prompts
Convert text to lifelike speech
Text-to-speech in 100+ languages
AI tool for generating creative briefs
Remove background noise and edit podcasts with AI
AI voice conversion and cover creation
AI voice synthesis with 1000+ voices in 142+ languages
AI music generation from style and parameter inputs
AI music tools for remixing, splitting, and detecting
Voice agents for customer service and task automation
Speech development screening quiz
AI automation for sales workflows
Generate singing voices from text
Redesign rooms using photos and style preferences
Voice-assisted form filling with text and file input
Enterprise platform for sales, partner, and leadership training
AI music generator powered by Grammy-nominated producers
Interactive music platform on Roblox
AI dubbing with phrase-level control in 140+ languages
Natural-sounding text-to-speech for videos and learning
Edit podcasts and add effects in minutes
The audio category divides into several distinct subcategories. Music generation tools like Beatopia and Music Eleven AI create original compositions from prompts or chord progressions. Transcription and speech tools handle voice-to-text, text-to-speech, and language processing. Podcast tools like Podstash and Podbrews focus on summarization, chapter generation, and content repurposing. Audio enhancement tools improve recording quality, remove background noise, or adjust acoustics. When choosing between them, the quality of the underlying voice model matters most for speech synthesis, especially for commercial use. Music generation tools differ in style flexibility and whether you retain rights to the output. Licensing terms for generated audio vary and should be checked carefully if you plan to monetize content. Some tools require a DAW or plugin environment while others work entirely in the browser.