Audio AI tools
AI tools for voice generation, speech synthesis, audio editing and voiceovers. Explore 82 curated tools in this category.
ElevenLabs
Audio
Realistic AI voice and speech synthesis
ElevenLabs provides state-of-the-art AI voice generation with natural-sounding text-to-speech, voice cloning, and multilingual support for creators, publishers, and developers.
Murf AI
Audio
Studio-quality AI voiceover generator
Murf AI is a voice generation platform offering a library of lifelike AI voices for creating voiceovers for videos, presentations, podcasts, and e-learning content.
Descript
Audio
AI-powered podcast and video editor
Descript is an AI-driven audio and video editing platform that lets users edit media by editing text transcripts, with features like overdub voice cloning and automatic filler word removal.
Krisp
Audio
AI noise cancellation for calls
Krisp is an AI-powered noise cancellation app that removes background noise, echo, and voices from calls in real time, improving audio quality for remote workers and podcasters.
Voicemod
Audio
Real-time AI voice changer and effects
Voicemod is an AI-powered real-time voice changer and soundboard app for gamers, streamers, and content creators, offering hundreds of voice effects and custom voice creation.
Resemble AI
Audio
AI voice cloning and synthesis platform
Resemble AI enables developers and enterprises to create custom AI voices, clone existing voices, and generate realistic speech for games, virtual assistants, and media.
Soundraw
Audio
AI music generator for creators
Soundraw is an AI music generation platform that creates royalty-free, customizable music tracks for videos, podcasts, and social media based on mood, genre, and length.
Aiva
Audio
AI music composition assistant
Aiva is an AI music composition tool that generates original soundtracks for films, games, ads, and other media in a wide variety of styles and emotional moods.
Podcastle
Audio
AI podcast recording and editing platform
Podcastle is an AI-powered podcast creation platform with studio-quality recording, automatic transcription, AI voice cloning, and one-click audio enhancement.
Fliki
Audio
AI text-to-video with voiceover
Fliki converts text scripts and blog posts into engaging videos using realistic AI voiceovers and a rich media library, perfect for social media and marketing content.
Synthflow AI
Audio
No-code AI voice agent builder
Synthflow AI enables businesses to build and deploy AI voice agents for phone calls, customer support, and appointment booking without any coding required.
Speechify
Audio
AI text-to-speech reading assistant
Speechify converts any written content including PDFs, articles, emails, and books into high-quality audio using AI voices, helping users consume content faster.
Whisper AI
Audio
OpenAI's speech recognition model
Whisper is an open-source automatic speech recognition system by OpenAI trained on large-scale multilingual data, offering highly accurate transcription across dozens of languages.
AssemblyAI
Audio
AI speech recognition API for developers
AssemblyAI provides a powerful speech-to-text API with speaker diarization, sentiment analysis, content moderation, and auto-highlights for developers building audio-powered apps.
Deepgram
Audio
Enterprise AI speech recognition API
Deepgram is an AI-powered speech recognition platform offering real-time and batch transcription APIs with high accuracy, low latency, and custom model training for enterprises.
Rask AI
Audio
AI video translation and dubbing tool
Rask AI translates and dubs videos into 130+ languages using AI voice cloning and lip-sync technology, helping creators and businesses reach global audiences instantly.
Papercup
Audio
AI video dubbing for global reach
Papercup is an AI dubbing platform that translates video content into multiple languages with realistic AI voices, preserving the speaker's tone and personality.
Dubverse
Audio
AI multilingual video dubbing platform
Dubverse is an AI-powered video dubbing solution that helps content creators and enterprises translate and dub videos in multiple languages with natural-sounding AI voices.
Typecast
Audio
AI voice actor and video creation tool
Typecast is an AI voice and video content platform with a library of AI voice actors for creating narrated videos, podcasts, e-learning content, and audiobooks.
Play.ht
Audio
AI voice generator and podcast creator
Play.ht is an AI voice generation platform offering ultra-realistic text-to-speech voices in 900+ styles, used for podcasts, audiobooks, e-learning, and accessibility.
Lovo AI
Audio
AI voice generator with 500+ voices
Lovo AI is a voice synthesis platform with over 500 AI voices in 100+ languages, offering voice cloning, text-to-speech, and AI video creation for media professionals.
Coqui AI
Audio
Open-source AI voice cloning
Coqui AI provides open-source text-to-speech and voice cloning technology for developers, offering state-of-the-art models for building custom voice applications.
Uberduck
Audio
AI voice synthesis for music and media
Uberduck is an AI voice synthesis platform with thousands of expressive AI voices used for music production, rap vocals, meme creation, and creative content.
Suno AI
Audio
AI music generation from text prompts
Suno AI generates complete, high-quality songs with vocals, instruments, and lyrics from simple text prompts, making music creation accessible to everyone.
Udio
Audio
AI music composition and generation
Udio is an AI music generation platform that creates original, professional-quality songs across any genre from text descriptions, with fine-grained style and mood controls.
Boomy
Audio
AI music creator for instant songs
Boomy enables anyone to create original AI-generated songs in seconds and submit them to streaming platforms to earn royalties, democratizing music production.
Loudly
Audio
AI music generator for content creators
Loudly is an AI music generation platform that creates royalty-free background music for videos, podcasts, and social media based on genre, tempo, and mood selection.
Ecrett Music
Audio
AI royalty-free music for video creators
Ecrett Music uses AI to generate royalty-free background music for videos and content by selecting scene type, mood, and genre, making music production simple and affordable.
Beatoven AI
Audio
AI background music composer for videos
Beatoven AI creates unique, mood-based royalty-free music for videos and podcasts by composing tracks that adapt to the emotional needs of different content segments.
Loudpocket
Audio
AI stem separation for audio tracks
Loudpocket uses AI to separate audio stems from mixed tracks, isolating vocals, drums, bass, and instruments for remixing, sampling, and music production.
Lalal AI
Audio
AI vocal and music stem splitter
Lalal AI is an AI-powered audio stem separation tool that extracts vocals, accompaniment, drums, bass, piano, and other stems from any audio or video file with high precision.
Moises App
Audio
AI music practice and stem separation
Moises App uses AI to separate audio stems, detect chords and beats, and change pitch and tempo, helping musicians practice, learn songs, and create remixes.
Adobe Podcast AI
Audio
AI audio enhancement for podcasters
Adobe Podcast AI offers AI-powered audio recording, transcription, and enhancement tools that remove background noise and improve voice quality for podcasters and creators.
Cleanvoice AI
Audio
AI podcast audio cleaning tool
Cleanvoice AI automatically removes filler sounds, mouth noises, stuttering, and dead air from podcast recordings, delivering clean, professional-sounding audio files.
Auphonic
Audio
AI audio post-production tool
Auphonic is an AI audio post-production service that automatically levels loudness, reduces noise, filters music, and encodes audio for podcasts, radio, and video.
Descript Overdub
Audio
AI voice cloning for podcast editing
Descript Overdub allows podcasters and video creators to clone their own voice and fix recording mistakes by typing corrections, making post-production seamless and fast.
Altered AI
Audio
Professional AI voice changer studio
Altered AI is a professional voice transformation platform that lets creators and actors change their voice to any AI voice in real time or post-production for media projects.
Wellsaid Labs
Audio
Enterprise AI voiceover platform
WellSaid Labs provides studio-quality AI voiceovers for enterprises, offering natural-sounding AI voices for e-learning, marketing, and product experiences.
Speechelo
Audio
AI text-to-speech for video creators
Speechelo transforms text into natural-sounding voiceovers with AI voices in multiple languages and accents, designed for YouTube creators, marketers, and educators.
Narakeet
Audio
AI narration and video maker from scripts
Narakeet creates narrated videos and audio files from scripts and presentations using realistic AI voices, ideal for e-learning and tutorial creators.
Verbatik
Audio
AI text-to-speech with neural voices
Verbatik is a cloud-based text-to-speech platform with 600+ neural AI voices across 142 languages, supporting SSML and commercial usage for creators and developers.
VoiceOverMaker
Audio
Online AI voiceover generator
VoiceOverMaker is an online AI text-to-speech tool that creates professional voiceovers from text with natural voices for videos, ads, and presentations.
Replica Studios
Audio
AI voice actors for games and media
Replica Studios provides AI voice acting technology for game developers and media creators, enabling realistic character voices and rapid dialogue generation for productions.
Kits AI
Audio
AI voice conversion for music production
Kits AI is a music-focused AI voice platform for converting vocals to different AI singing voices, creating custom voice models, and producing music with AI tools.
Voicify AI
Audio
AI singing voice cover creator
Voicify AI enables users to create AI-generated song covers by applying famous artist voice models to any audio track for creative and entertainment purposes.
Listnr
Audio
AI podcast and audio content platform
Listnr is an AI audio platform that converts blog posts into podcasts using realistic AI voices, creates custom radio shows, and distributes audio content across platforms.
Podcastle AI
Audio
AI studio for podcast production
Podcastle AI Studio provides browser-based remote podcast recording with AI noise removal, voice enhancement, automatic transcription, and editing for independent podcasters.
Hume AI
Audio
Emotionally intelligent AI voice API
Hume AI builds emotion AI models that understand and express human emotions through voice, enabling empathic voice interfaces and emotional intelligence in AI applications.
Ava AI
Audio
AI captioning for deaf and hard-of-hearing
Ava AI provides real-time AI-powered captioning for conversations, meetings, and events, making communication accessible for deaf and hard-of-hearing individuals.
Happy Scribe
Audio
AI transcription and subtitle generator
Happy Scribe is an AI-powered transcription and subtitle platform supporting 120+ languages, used by journalists, podcasters, and video creators for accurate captions.
Sonix AI
Audio
AI automated transcription platform
Sonix is an AI-powered automated transcription, translation, and subtitling service used by media professionals, researchers, and legal teams for accurate, fast results.
Rev AI
Audio
AI speech recognition and transcription API
Rev AI offers enterprise-grade speech recognition and transcription APIs with human review options, captions, and audio intelligence for developers and businesses.
Verbit AI
Audio
AI transcription for legal and education
Verbit AI is an AI-powered transcription and captioning platform specialized for legal, education, and media industries with human-in-the-loop quality assurance.
Gladia AI
Audio
Real-time AI speech recognition API
Gladia AI provides a fast, accurate speech transcription API with speaker diarization, word-level timestamps, and real-time streaming for developers building voice apps.
Neuphonic
Audio
Ultra-low latency AI speech synthesis
Neuphonic provides ultra-low latency text-to-speech API for real-time conversational AI applications, enabling natural voice interactions with minimal delay.
Cartesia AI
Audio
Real-time AI voice generation API
Cartesia AI offers a real-time voice generation platform with state-space model architecture for ultra-low latency, expressive, and customizable AI voices for applications.
Speechgen AI
Audio
AI text-to-speech with 150 voices
Speechgen AI is a text-to-speech platform with 150+ realistic AI voices across 50+ languages for creating voiceovers for videos, audiobooks, and e-learning courses.
Eleven Reader
Audio
AI reading app with ElevenLabs voices
Eleven Reader is an AI reading app powered by ElevenLabs that converts articles, PDFs, and RSS feeds into high-quality audio using natural-sounding AI voices.
Natural Reader
Audio
AI text-to-speech reading tool
NaturalReader is an AI text-to-speech tool that converts written text from documents, PDFs, and web pages into natural-sounding audio for learning and accessibility.
Balabolka
Audio
Free AI text-to-speech desktop tool
Balabolka is a free desktop text-to-speech application that converts text from various file formats into spoken audio using AI voices for accessibility and productivity.
Pimsleur AI
Audio
AI-enhanced audio language learning
Pimsleur uses AI-powered speech recognition to evaluate pronunciation and personalize language learning through its proven audio-based method for 51 languages.
Elsa Speak
Audio
AI English pronunciation coach
ELSA Speak is an AI-powered English pronunciation app that uses speech recognition to identify pronunciation errors and provide personalized accent coaching.
Bandwidth AI
Audio
AI voice and messaging network platform
Bandwidth AI is a cloud communications platform with AI-powered voice intelligence, call transcription, and real-time conversation analytics for enterprise applications.
Speechmatics
Audio
Enterprise AI speech recognition API
Speechmatics provides enterprise-grade automatic speech recognition with high accuracy across 50+ languages and accents for real-time and batch transcription needs.
Kaldi AI
Audio
Open-source speech recognition toolkit
Kaldi is a state-of-the-art open-source speech recognition toolkit written in C++ widely used by researchers and developers for building custom speech recognition systems.
Mozilla DeepSpeech
Audio
Open-source AI speech recognition engine
Mozilla DeepSpeech is an open-source speech-to-text engine based on Baidu's Deep Speech research, enabling developers to build offline speech recognition applications.
Vosk AI
Audio
Offline speech recognition toolkit
Vosk is an offline open-source speech recognition toolkit supporting 20+ languages with small model sizes suitable for mobile, IoT, and embedded device deployment.
Picovoice
Audio
On-device AI voice recognition platform
Picovoice is an on-device voice AI platform that provides wake word detection, speech recognition, and natural language understanding for privacy-first voice applications.
Speechbrain
Audio
Open-source AI speech processing toolkit
SpeechBrain is an open-source PyTorch-based speech processing toolkit for building speech recognition, speaker verification, speech enhancement, and synthesis systems.
Whisper JAX
Audio
Accelerated Whisper transcription model
Whisper JAX is an optimized implementation of OpenAI's Whisper speech recognition model using JAX, providing up to 70x faster transcription for audio processing.
Camb AI
Audio
AI video dubbing and translation tool
Camb AI provides AI-powered video dubbing and translation services with voice cloning, lip sync, and multilingual support for content creators reaching global audiences.
AutoPod AI
Audio
AI automated podcast video editor
AutoPod AI is a set of Adobe Premiere Pro plugins that automate podcast video editing including multi-camera switching, social clip creation, and sequence editing.
Uncut AI
Audio
AI podcast video clipping tool
Uncut AI analyzes podcast episodes and automatically finds the most engaging highlights to create short clips optimized for social media with captions.
Spext AI
Audio
AI podcast editing by text
Spext AI is an intelligent audio and video editing tool that lets you edit recordings by editing the transcript text, with automatic filler word and silence removal.
Alitu AI
Audio
AI podcast maker tool
Alitu AI is an all-in-one podcast maker that automates audio cleanup, transcript creation, chapter markers, and episode publishing for independent podcast creators.
Zencastr AI
Audio
AI podcast recording and editing platform
Zencastr AI is a podcast production platform with AI audio enhancement, automatic transcription, video recording, and one-click audiogram creation for podcasters.
Buzzsprout AI
Audio
AI podcast hosting and promotion platform
Buzzsprout AI adds artificial intelligence to podcast hosting with automated transcription, chapter generation, and AI-powered episode optimization for discoverability.
Transistor AI
Audio
AI podcast hosting with analytics
Transistor AI is a podcast hosting platform with AI transcription, smart show notes generation, and advanced analytics for professional podcast publishers.
Deciphr AI
Audio
AI content repurposer for podcasters
Deciphr AI transforms podcast audio into timestamps, show notes, blog posts, and social media content automatically using AI transcription and summarization.
Podsqueeze AI
Audio
AI podcast content repurposing tool
Podsqueeze AI generates show notes, timestamps, newsletters, tweets, and blog posts from podcast episodes automatically using AI transcription and content generation.
Swell AI
Audio
AI content repurposing for podcasters
Swell AI automatically repurposes podcast and video content into written articles, social posts, transcripts, and newsletters to maximize content reach and ROI.