🏷️ This Website is For Sale 🏷️
Access ALL AI Models for just $10/month
🎵

Audio & Speech Models

Explore 12+ AI models for speech recognition, voice synthesis, and audio generation across multiple languages.

Whisper

OpenAI

Generally Available

Pricing

$0.006 per minute

Language Support

99 languages supported

Key Capabilities

Speech-to-text Translation Timestamps Multiple formats

Best Use Case

Transcription services, multilingual content, accessibility

ElevenLabs

ElevenLabs

Generally Available

Pricing

$5/month Starter, $22/month Creator

Language Support

29 languages supported

Key Capabilities

Voice cloning Real-time synthesis Emotion control Custom voices

Best Use Case

Voice-overs, audiobooks, interactive applications

Azure Speech

Microsoft

Generally Available

Pricing

$1 per hour (Standard voices)

Language Support

100+ languages and variants

Key Capabilities

Neural voices SSML support Custom models Real-time streaming

Best Use Case

Enterprise applications, accessibility, customer service

Bark

Suno AI

Open Source

Pricing

Open Source (hosting costs apply)

Language Support

Multilingual with accent support

Key Capabilities

Realistic speech Multiple speakers Sound effects Emotional speech

Best Use Case

Creative projects, research, custom voice applications

MusicLM

Google

Research

Pricing

Research access

Language Support

Text prompts in English

Key Capabilities

Text-to-music Style transfer Long-form generation Conditional generation

Best Use Case

Music composition, creative applications, research

Murf AI

Murf

Generally Available

Pricing

$19/month Basic, $26/month Pro

Language Support

20+ languages

Key Capabilities

Studio-quality voices Voice changer Background music Team collaboration

Best Use Case

Professional voice-overs, presentations, e-learning