Cartesia
Ultra-low-latency realtime voice AI (Sonic)
Voice & AudioFree tier + usage-based
About Cartesia
Cartesia’s Sonic models deliver text-to-speech with sub-100ms latency for realtime voice agents, with instant voice cloning and on-device deployment options.
Features
Sub-100ms TTS
Instant voice cloning
Realtime API
On-device models
Kept nearby
Resemble AI
Voice cloning with deepfake detection built in
From $29/mo
ElevenLabs
AI voice synthesis and cloning platform
Free + from $6/mo
Descript
Edit audio and video by editing the transcript
Free + from $16/mo
Whisper
OpenAI's open-source speech recognition model
Free (open-source) · ★ 72,000
Browse all Voice & Audio tools →