Audio & Speech Models
Explore 12+ AI models for speech recognition, voice synthesis, and audio generation across multiple languages.
Whisper
OpenAI
Pricing
$0.006 per minute
Language Support
99 languages supported
Key Capabilities
Best Use Case
Transcription services, multilingual content, accessibility
ElevenLabs
ElevenLabs
Pricing
$5/month Starter, $22/month Creator
Language Support
29 languages supported
Key Capabilities
Best Use Case
Voice-overs, audiobooks, interactive applications
Azure Speech
Microsoft
Pricing
$1 per hour (Standard voices)
Language Support
100+ languages and variants
Key Capabilities
Best Use Case
Enterprise applications, accessibility, customer service
Bark
Suno AI
Pricing
Open Source (hosting costs apply)
Language Support
Multilingual with accent support
Key Capabilities
Best Use Case
Creative projects, research, custom voice applications
MusicLM
Pricing
Research access
Language Support
Text prompts in English
Key Capabilities
Best Use Case
Music composition, creative applications, research
Murf AI
Murf
Pricing
$19/month Basic, $26/month Pro
Language Support
20+ languages
Key Capabilities
Best Use Case
Professional voice-overs, presentations, e-learning