Audio AI Tools Directory

Audio

Audio AI tools

AI tools for voice generation, speech synthesis, audio editing and voiceovers. Explore 82 curated tools in this category.

ElevenLabs

Audio

Freemium

Realistic AI voice and speech synthesis

ElevenLabs provides state-of-the-art AI voice generation with natural-sounding text-to-speech, voice cloning, and multilingual support for creators, publishers, and developers.

#AI Voice Generator #Text to Speech

No reviewLaunched 2022

View details

Murf AI

Audio

Freemium

Studio-quality AI voiceover generator

Murf AI is a voice generation platform offering a library of lifelike AI voices for creating voiceovers for videos, presentations, podcasts, and e-learning content.

#AI Voice Generator #Text to Speech

No reviewLaunched 2020

View details

Descript

Audio

Freemium

AI-powered podcast and video editor

Descript is an AI-driven audio and video editing platform that lets users edit media by editing text transcripts, with features like overdub voice cloning and automatic filler word removal.

#AI Voice Generator #Speech to Text #AI Video Generator

No reviewLaunched 2017

View details

Krisp

Audio

Freemium

AI noise cancellation for calls

Krisp is an AI-powered noise cancellation app that removes background noise, echo, and voices from calls in real time, improving audio quality for remote workers and podcasters.

#AI Productivity

No reviewLaunched 2017

View details

Voicemod

Audio

Freemium

Real-time AI voice changer and effects

Voicemod is an AI-powered real-time voice changer and soundboard app for gamers, streamers, and content creators, offering hundreds of voice effects and custom voice creation.

#AI Voice Generator

No reviewLaunched 2014

View details

Resemble AI

Audio

Freemium

AI voice cloning and synthesis platform

Resemble AI enables developers and enterprises to create custom AI voices, clone existing voices, and generate realistic speech for games, virtual assistants, and media.

#AI Voice Generator #Text to Speech

No reviewLaunched 2019

View details

Soundraw

Audio

Paid

AI music generator for creators

Soundraw is an AI music generation platform that creates royalty-free, customizable music tracks for videos, podcasts, and social media based on mood, genre, and length.

#AI Productivity

No reviewLaunched 2020

View details

Aiva

Audio

Freemium

AI music composition assistant

Aiva is an AI music composition tool that generates original soundtracks for films, games, ads, and other media in a wide variety of styles and emotional moods.

#AI Productivity

No reviewLaunched 2016

View details

Podcastle

Audio

Freemium

AI podcast recording and editing platform

Podcastle is an AI-powered podcast creation platform with studio-quality recording, automatic transcription, AI voice cloning, and one-click audio enhancement.

#AI Voice Generator #Speech to Text

No reviewLaunched 2019

View details

Fliki

Audio

Freemium

AI text-to-video with voiceover

Fliki converts text scripts and blog posts into engaging videos using realistic AI voiceovers and a rich media library, perfect for social media and marketing content.

#Text to Video #AI Video Generator #AI Voice Generator

No reviewLaunched 2021

View details

Synthflow AI

Audio

Freemium

No-code AI voice agent builder

Synthflow AI enables businesses to build and deploy AI voice agents for phone calls, customer support, and appointment booking without any coding required.

#AI Voice Generator #AI Agents #Conversational AI

No reviewLaunched 2023

View details

Speechify

Audio

Freemium

AI text-to-speech reading assistant

Speechify converts any written content including PDFs, articles, emails, and books into high-quality audio using AI voices, helping users consume content faster.

#Text to Speech #AI Productivity

No reviewLaunched 2017

View details

Whisper AI

Audio

Free

OpenAI's speech recognition model

Whisper is an open-source automatic speech recognition system by OpenAI trained on large-scale multilingual data, offering highly accurate transcription across dozens of languages.

#Speech to Text

No reviewLaunched 2022

View details

AssemblyAI

Audio

Freemium

AI speech recognition API for developers

AssemblyAI provides a powerful speech-to-text API with speaker diarization, sentiment analysis, content moderation, and auto-highlights for developers building audio-powered apps.

#Speech to Text #AI Research Assistant

No reviewLaunched 2017

View details

Deepgram

Audio

Freemium

Enterprise AI speech recognition API

Deepgram is an AI-powered speech recognition platform offering real-time and batch transcription APIs with high accuracy, low latency, and custom model training for enterprises.

#Speech to Text

No reviewLaunched 2015

View details

Rask AI

Audio

Paid

AI video translation and dubbing tool

Rask AI translates and dubs videos into 130+ languages using AI voice cloning and lip-sync technology, helping creators and businesses reach global audiences instantly.

#AI Video Generator #AI Voice Generator

No reviewLaunched 2023

View details

Papercup

Audio

Paid

AI video dubbing for global reach

Papercup is an AI dubbing platform that translates video content into multiple languages with realistic AI voices, preserving the speaker's tone and personality.

#AI Voice Generator #Text to Speech

No reviewLaunched 2017

View details

Dubverse

Audio

Freemium

AI multilingual video dubbing platform

Dubverse is an AI-powered video dubbing solution that helps content creators and enterprises translate and dub videos in multiple languages with natural-sounding AI voices.

#AI Voice Generator #AI Video Generator

No reviewLaunched 2021

View details

Typecast

Audio

Freemium

AI voice actor and video creation tool

Typecast is an AI voice and video content platform with a library of AI voice actors for creating narrated videos, podcasts, e-learning content, and audiobooks.

#AI Voice Generator #Text to Speech #AI Video Generator

No reviewLaunched 2020

View details

Play.ht

Audio

Freemium

AI voice generator and podcast creator

Play.ht is an AI voice generation platform offering ultra-realistic text-to-speech voices in 900+ styles, used for podcasts, audiobooks, e-learning, and accessibility.

#AI Voice Generator #Text to Speech

No reviewLaunched 2016

View details

Lovo AI

Audio

Freemium

AI voice generator with 500+ voices

Lovo AI is a voice synthesis platform with over 500 AI voices in 100+ languages, offering voice cloning, text-to-speech, and AI video creation for media professionals.

#AI Voice Generator #Text to Speech

No reviewLaunched 2019

View details

Coqui AI

Audio

Free

Open-source AI voice cloning

Coqui AI provides open-source text-to-speech and voice cloning technology for developers, offering state-of-the-art models for building custom voice applications.

#AI Voice Generator #Text to Speech

No reviewLaunched 2021

View details

Uberduck

Audio

Freemium

AI voice synthesis for music and media

Uberduck is an AI voice synthesis platform with thousands of expressive AI voices used for music production, rap vocals, meme creation, and creative content.

#AI Voice Generator #Text to Speech

No reviewLaunched 2021

View details

Suno AI

Audio

Freemium

AI music generation from text prompts

Suno AI generates complete, high-quality songs with vocals, instruments, and lyrics from simple text prompts, making music creation accessible to everyone.

#AI Productivity

No reviewLaunched 2023

View details

Udio

Audio

Freemium

AI music composition and generation

Udio is an AI music generation platform that creates original, professional-quality songs across any genre from text descriptions, with fine-grained style and mood controls.

#AI Productivity

No reviewLaunched 2024

View details

Boomy

Audio

Freemium

AI music creator for instant songs

Boomy enables anyone to create original AI-generated songs in seconds and submit them to streaming platforms to earn royalties, democratizing music production.

#AI Productivity

No reviewLaunched 2018

View details

Loudly

Audio

Freemium

AI music generator for content creators

Loudly is an AI music generation platform that creates royalty-free background music for videos, podcasts, and social media based on genre, tempo, and mood selection.

#AI Productivity

No reviewLaunched 2014

View details

Ecrett Music

Audio

Paid

AI royalty-free music for video creators

Ecrett Music uses AI to generate royalty-free background music for videos and content by selecting scene type, mood, and genre, making music production simple and affordable.

#AI Productivity

No reviewLaunched 2019

View details

Beatoven AI

Audio

Freemium

AI background music composer for videos

Beatoven AI creates unique, mood-based royalty-free music for videos and podcasts by composing tracks that adapt to the emotional needs of different content segments.

#AI Productivity

No reviewLaunched 2021

View details

Loudpocket

Audio

Freemium

AI stem separation for audio tracks

Loudpocket uses AI to separate audio stems from mixed tracks, isolating vocals, drums, bass, and instruments for remixing, sampling, and music production.

#AI Productivity

No reviewLaunched 2022

View details

Lalal AI

Audio

Freemium

AI vocal and music stem splitter

Lalal AI is an AI-powered audio stem separation tool that extracts vocals, accompaniment, drums, bass, piano, and other stems from any audio or video file with high precision.

#AI Productivity

No reviewLaunched 2020

View details

Moises App

Audio

Freemium

AI music practice and stem separation

Moises App uses AI to separate audio stems, detect chords and beats, and change pitch and tempo, helping musicians practice, learn songs, and create remixes.

#AI Productivity

No reviewLaunched 2019

View details

Adobe Podcast AI

Audio

Free

AI audio enhancement for podcasters

Adobe Podcast AI offers AI-powered audio recording, transcription, and enhancement tools that remove background noise and improve voice quality for podcasters and creators.

#Speech to Text #AI Productivity

No reviewLaunched 2022

View details

Cleanvoice AI

Audio

Paid

AI podcast audio cleaning tool

Cleanvoice AI automatically removes filler sounds, mouth noises, stuttering, and dead air from podcast recordings, delivering clean, professional-sounding audio files.

#AI Productivity

No reviewLaunched 2021

View details

Auphonic

Audio

Freemium

AI audio post-production tool

Auphonic is an AI audio post-production service that automatically levels loudness, reduces noise, filters music, and encodes audio for podcasts, radio, and video.

#AI Productivity

No reviewLaunched 2011

View details

Descript Overdub

Audio

Paid

AI voice cloning for podcast editing

Descript Overdub allows podcasters and video creators to clone their own voice and fix recording mistakes by typing corrections, making post-production seamless and fast.

#AI Voice Generator #Text to Speech

No reviewLaunched 2020

View details

Altered AI

Audio

Freemium

Professional AI voice changer studio

Altered AI is a professional voice transformation platform that lets creators and actors change their voice to any AI voice in real time or post-production for media projects.

#AI Voice Generator #Text to Speech

No reviewLaunched 2018

View details

Wellsaid Labs

Audio

Paid

Enterprise AI voiceover platform

WellSaid Labs provides studio-quality AI voiceovers for enterprises, offering natural-sounding AI voices for e-learning, marketing, and product experiences.

#AI Voice Generator #Text to Speech

No reviewLaunched 2018

View details

Speechelo

Audio

Paid

AI text-to-speech for video creators

Speechelo transforms text into natural-sounding voiceovers with AI voices in multiple languages and accents, designed for YouTube creators, marketers, and educators.

#Text to Speech #AI Voice Generator

No reviewLaunched 2020

View details

Narakeet

Audio

Freemium

AI narration and video maker from scripts

Narakeet creates narrated videos and audio files from scripts and presentations using realistic AI voices, ideal for e-learning and tutorial creators.

#Text to Speech #AI Voice Generator #AI Video Generator

No reviewLaunched 2020

View details

Verbatik

Audio

Paid

AI text-to-speech with neural voices

Verbatik is a cloud-based text-to-speech platform with 600+ neural AI voices across 142 languages, supporting SSML and commercial usage for creators and developers.

#Text to Speech #AI Voice Generator

No reviewLaunched 2021

View details

VoiceOverMaker

Audio

Freemium

Online AI voiceover generator

VoiceOverMaker is an online AI text-to-speech tool that creates professional voiceovers from text with natural voices for videos, ads, and presentations.

#Text to Speech #AI Voice Generator

No reviewLaunched 2020

View details

Replica Studios

Audio

Freemium

AI voice actors for games and media

Replica Studios provides AI voice acting technology for game developers and media creators, enabling realistic character voices and rapid dialogue generation for productions.

#AI Voice Generator #Text to Speech

No reviewLaunched 2018

View details

Kits AI

Audio

Freemium

AI voice conversion for music production

Kits AI is a music-focused AI voice platform for converting vocals to different AI singing voices, creating custom voice models, and producing music with AI tools.

#AI Voice Generator

No reviewLaunched 2022

View details

Voicify AI

Audio

Paid

AI singing voice cover creator

Voicify AI enables users to create AI-generated song covers by applying famous artist voice models to any audio track for creative and entertainment purposes.

#AI Voice Generator

No reviewLaunched 2023

View details

Listnr

Audio

Freemium

AI podcast and audio content platform

Listnr is an AI audio platform that converts blog posts into podcasts using realistic AI voices, creates custom radio shows, and distributes audio content across platforms.

#Text to Speech #AI Voice Generator

No reviewLaunched 2021

View details

Podcastle AI

Audio

Freemium

AI studio for podcast production

Podcastle AI Studio provides browser-based remote podcast recording with AI noise removal, voice enhancement, automatic transcription, and editing for independent podcasters.

#Speech to Text #AI Voice Generator

No reviewLaunched 2021

View details

Hume AI

Audio

Freemium

Emotionally intelligent AI voice API

Hume AI builds emotion AI models that understand and express human emotions through voice, enabling empathic voice interfaces and emotional intelligence in AI applications.

#AI Voice Generator #AI Research Assistant

No reviewLaunched 2021

View details

Ava AI

Audio

Freemium

AI captioning for deaf and hard-of-hearing

Ava AI provides real-time AI-powered captioning for conversations, meetings, and events, making communication accessible for deaf and hard-of-hearing individuals.

#Speech to Text #AI Productivity

No reviewLaunched 2014

View details

Happy Scribe

Audio

Freemium

AI transcription and subtitle generator

Happy Scribe is an AI-powered transcription and subtitle platform supporting 120+ languages, used by journalists, podcasters, and video creators for accurate captions.

#Speech to Text #AI Productivity

No reviewLaunched 2017

View details

Sonix AI

Audio

Paid

AI automated transcription platform

Sonix is an AI-powered automated transcription, translation, and subtitling service used by media professionals, researchers, and legal teams for accurate, fast results.

#Speech to Text #AI Productivity

No reviewLaunched 2017

View details

Rev AI

Audio

Paid

AI speech recognition and transcription API

Rev AI offers enterprise-grade speech recognition and transcription APIs with human review options, captions, and audio intelligence for developers and businesses.

#Speech to Text

No reviewLaunched 2010

View details

Verbit AI

Audio

Paid

AI transcription for legal and education

Verbit AI is an AI-powered transcription and captioning platform specialized for legal, education, and media industries with human-in-the-loop quality assurance.

#Speech to Text

No reviewLaunched 2017

View details

Gladia AI

Audio

Freemium

Real-time AI speech recognition API

Gladia AI provides a fast, accurate speech transcription API with speaker diarization, word-level timestamps, and real-time streaming for developers building voice apps.

#Speech to Text

No reviewLaunched 2022

View details

Neuphonic

Audio

Freemium

Ultra-low latency AI speech synthesis

Neuphonic provides ultra-low latency text-to-speech API for real-time conversational AI applications, enabling natural voice interactions with minimal delay.

#Text to Speech #AI Voice Generator

No reviewLaunched 2023

View details

Cartesia AI

Audio

Freemium

Real-time AI voice generation API

Cartesia AI offers a real-time voice generation platform with state-space model architecture for ultra-low latency, expressive, and customizable AI voices for applications.

#Text to Speech #AI Voice Generator

No reviewLaunched 2023

View details

Speechgen AI

Audio

Freemium

AI text-to-speech with 150 voices

Speechgen AI is a text-to-speech platform with 150+ realistic AI voices across 50+ languages for creating voiceovers for videos, audiobooks, and e-learning courses.

#Text to Speech #AI Voice Generator

No reviewLaunched 2021

View details

Eleven Reader

Audio

Free

AI reading app with ElevenLabs voices

Eleven Reader is an AI reading app powered by ElevenLabs that converts articles, PDFs, and RSS feeds into high-quality audio using natural-sounding AI voices.

#Text to Speech #AI Productivity

No reviewLaunched 2024

View details

Natural Reader

Audio

Freemium

AI text-to-speech reading tool

NaturalReader is an AI text-to-speech tool that converts written text from documents, PDFs, and web pages into natural-sounding audio for learning and accessibility.

#Text to Speech #AI Voice Generator

No reviewLaunched 2003

View details

Balabolka

Audio

Free

Free AI text-to-speech desktop tool

Balabolka is a free desktop text-to-speech application that converts text from various file formats into spoken audio using AI voices for accessibility and productivity.

#Text to Speech

No reviewLaunched 2007

View details

Pimsleur AI

Audio

Paid

AI-enhanced audio language learning

Pimsleur uses AI-powered speech recognition to evaluate pronunciation and personalize language learning through its proven audio-based method for 51 languages.

#AI Assistant #Speech to Text

No reviewLaunched 2013

View details

Elsa Speak

Audio

Freemium

AI English pronunciation coach

ELSA Speak is an AI-powered English pronunciation app that uses speech recognition to identify pronunciation errors and provide personalized accent coaching.

#AI Assistant #Speech to Text

No reviewLaunched 2015

View details

Bandwidth AI

Audio

Paid

AI voice and messaging network platform

Bandwidth AI is a cloud communications platform with AI-powered voice intelligence, call transcription, and real-time conversation analytics for enterprise applications.

#Speech to Text #AI Voice Generator

No reviewLaunched 2001

View details

Speechmatics

Audio

Paid

Enterprise AI speech recognition API

Speechmatics provides enterprise-grade automatic speech recognition with high accuracy across 50+ languages and accents for real-time and batch transcription needs.

#Speech to Text

No reviewLaunched 2006

View details

Kaldi AI

Audio

Free

Open-source speech recognition toolkit

Kaldi is a state-of-the-art open-source speech recognition toolkit written in C++ widely used by researchers and developers for building custom speech recognition systems.

#Speech to Text #AI Research Assistant

No reviewLaunched 2009

View details

Mozilla DeepSpeech

Audio

Free

Open-source AI speech recognition engine

Mozilla DeepSpeech is an open-source speech-to-text engine based on Baidu's Deep Speech research, enabling developers to build offline speech recognition applications.

#Speech to Text #AI Code Assistant

No reviewLaunched 2017

View details

Vosk AI

Audio

Free

Offline speech recognition toolkit

Vosk is an offline open-source speech recognition toolkit supporting 20+ languages with small model sizes suitable for mobile, IoT, and embedded device deployment.

#Speech to Text #AI Code Assistant

No reviewLaunched 2019

View details

Picovoice

Audio

Freemium

On-device AI voice recognition platform

Picovoice is an on-device voice AI platform that provides wake word detection, speech recognition, and natural language understanding for privacy-first voice applications.

#Speech to Text #AI Voice Generator

No reviewLaunched 2016

View details

Speechbrain

Audio

Free

Open-source AI speech processing toolkit

SpeechBrain is an open-source PyTorch-based speech processing toolkit for building speech recognition, speaker verification, speech enhancement, and synthesis systems.

#Speech to Text #AI Research Assistant

No reviewLaunched 2021

View details

Whisper JAX

Audio

Free

Accelerated Whisper transcription model

Whisper JAX is an optimized implementation of OpenAI's Whisper speech recognition model using JAX, providing up to 70x faster transcription for audio processing.

#Speech to Text #AI Code Assistant

No reviewLaunched 2023

View details

Camb AI

Audio

Freemium

AI video dubbing and translation tool

Camb AI provides AI-powered video dubbing and translation services with voice cloning, lip sync, and multilingual support for content creators reaching global audiences.

#AI Video Generator #AI Voice Generator

No reviewLaunched 2023

View details

AutoPod AI

Audio

Paid

AI automated podcast video editor

AutoPod AI is a set of Adobe Premiere Pro plugins that automate podcast video editing including multi-camera switching, social clip creation, and sequence editing.

#AI Video Generator

No reviewLaunched 2022

View details

Uncut AI

Audio

Freemium

AI podcast video clipping tool

Uncut AI analyzes podcast episodes and automatically finds the most engaging highlights to create short clips optimized for social media with captions.

#AI Video Generator

No reviewLaunched 2023

View details

Spext AI

Audio

Freemium

AI podcast editing by text

Spext AI is an intelligent audio and video editing tool that lets you edit recordings by editing the transcript text, with automatic filler word and silence removal.

#Speech to Text #AI Video Generator

No reviewLaunched 2019

View details

Alitu AI

Audio

Paid

AI podcast maker tool

Alitu AI is an all-in-one podcast maker that automates audio cleanup, transcript creation, chapter markers, and episode publishing for independent podcast creators.

#Speech to Text #AI Productivity

No reviewLaunched 2015

View details

Zencastr AI

Audio

Freemium

AI podcast recording and editing platform

Zencastr AI is a podcast production platform with AI audio enhancement, automatic transcription, video recording, and one-click audiogram creation for podcasters.

#Speech to Text #AI Productivity

No reviewLaunched 2015

View details

Buzzsprout AI

Audio

Freemium

AI podcast hosting and promotion platform

Buzzsprout AI adds artificial intelligence to podcast hosting with automated transcription, chapter generation, and AI-powered episode optimization for discoverability.

#Speech to Text #AI Productivity

No reviewLaunched 2009

View details

Transistor AI

Audio

Paid

AI podcast hosting with analytics

Transistor AI is a podcast hosting platform with AI transcription, smart show notes generation, and advanced analytics for professional podcast publishers.

#Speech to Text #AI Productivity

No reviewLaunched 2018

View details

Deciphr AI

Audio

Freemium

AI content repurposer for podcasters

Deciphr AI transforms podcast audio into timestamps, show notes, blog posts, and social media content automatically using AI transcription and summarization.

#Speech to Text #AI Writing

No reviewLaunched 2022

View details

Podsqueeze AI

Audio

Freemium

AI podcast content repurposing tool

Podsqueeze AI generates show notes, timestamps, newsletters, tweets, and blog posts from podcast episodes automatically using AI transcription and content generation.

#Speech to Text #AI Writing #AI Content Generator

No reviewLaunched 2023

View details

Swell AI

Audio

Freemium

AI content repurposing for podcasters

Swell AI automatically repurposes podcast and video content into written articles, social posts, transcripts, and newsletters to maximize content reach and ROI.

#Speech to Text #AI Writing

No reviewLaunched 2022

View details

Castmagic AI

Audio

Paid

AI audio to content tool for creators

Castmagic AI converts podcast and video recordings into ready-to-publish content including show notes, blog posts, social clips, and newsletters with one click.

#Speech to Text #AI Writing #AI Content Generator

No reviewLaunched 2022

View details