Whisper

OpenAI's open-source speech recognition model
Voice & AudioFree (open-source)72,000Works with OpenClaw

About Whisper

Whisper is OpenAI's open-source automatic speech recognition model. It can transcribe and translate audio in 99 languages with remarkable accuracy, and can be run locally on consumer hardware.

Features

Speech-to-text
Multi-language
Translation
Local running
High accuracy

The tally

FOR
  • +Best open-source speech recognition
  • +99 language support
  • +Translation capability
  • +Free and open-source
  • +Runs locally
AGAINST
  • Slower than commercial APIs
  • Requires GPU for real-time
  • No speaker diarization
  • Large model file sizes

Related concepts

Kept nearby

Browse all Voice & Audio tools →

Featured in