UnitText
transcriptionPeer review tool for writers
AI audio tools cover a wide range of sound-related tasks: voice synthesis, music generation, transcription, podcast editing, and audio enhancement. The 434 tools in this category serve musicians, podcasters, content creators, and developers who need programmatic audio processing.
Peer review tool for writers
Transcribe audio notes to structured text
Create music, split stems, and design covers
Resell Vapi.ai voice agents under your brand
AI music generator by Meta using language models
Dutch social audio platform for voice messages
Automate construction invoice processing
Creative and performance marketing agency
Stem separation and audio processing
Audio storytelling app for podcasts and audiobooks
Flashcard app with spaced repetition and language tools
Text-to-speech for dyslexic readers
Daily podcast of top Hacker News stories
Convert voice into social media posts for LinkedIn and Instagram
Deploy voice and conversational AI agents in days, no-code
AI that understands real conversations better than language models
Dub videos into 140+ languages with AI, no actors or studios needed
Enterprise AI, security, and infrastructure for workflow automation
Real-time voice translation between languages
Open source machine learning platform
Remove vocals or extract instrumental audio
Convert text and articles to natural-sounding audio
Record, edit, and publish podcasts with studio-quality audio
Find music for videos instantly using AI suggestions
The audio category divides into several distinct subcategories. Music generation tools like Beatopia and Music Eleven AI create original compositions from prompts or chord progressions. Transcription and speech tools handle voice-to-text, text-to-speech, and language processing. Podcast tools like Podstash and Podbrews focus on summarization, chapter generation, and content repurposing. Audio enhancement tools improve recording quality, remove background noise, or adjust acoustics. When choosing between them, the quality of the underlying voice model matters most for speech synthesis, especially for commercial use. Music generation tools differ in style flexibility and whether you retain rights to the output. Licensing terms for generated audio vary and should be checked carefully if you plan to monetize content. Some tools require a DAW or plugin environment while others work entirely in the browser.