Whisper
OpenAI's open-source speech recognition model
Voice & AudioFree (open-source)★ 72,000Works with OpenClaw
About Whisper
Whisper is OpenAI's open-source automatic speech recognition model. It can transcribe and translate audio in 99 languages with remarkable accuracy, and can be run locally on consumer hardware.
Features
Speech-to-text
Multi-language
Translation
Local running
High accuracy
The tally
FOR
- +Best open-source speech recognition
- +99 language support
- +Translation capability
- +Free and open-source
- +Runs locally
AGAINST
- −Slower than commercial APIs
- −Requires GPU for real-time
- −No speaker diarization
- −Large model file sizes
Related concepts
Kept nearby
Resemble AI
Voice cloning with deepfake detection built in
From $29/mo
ElevenLabs
AI voice synthesis and cloning platform
Free + from $6/mo
Descript
Edit audio and video by editing the transcript
Free + from $16/mo
Cartesia
Ultra-low-latency realtime voice AI (Sonic)
Free tier + usage-based
Browse all Voice & Audio tools →