
WhisperX
Fast, word-level ASR with speaker diarization and 70× realtime speed
- Stars
- 19,013
- License
- BSD-2-Clause
- Last commit
- 1 month ago
Explore leading tools in the Speech-to-Text & Dictation category, including open-source options and SaaS products. Compare features, use cases, and find the best fit for your workflow.
7 open-source projects · 3 SaaS products
These projects are active, self-hostable choices for knowledge management teams evaluating alternatives to SaaS tools.

Fast, word-level ASR with speaker diarization and 70× realtime speed

Instantly transcribe speech to any active window with a keystroke

Dictate anywhere, get instant AI-powered transcription with privacy options
Fast, word-level ASR with speaker diarization and 70× realtime speed
VoiceInk delivers near‑instant, 99% accurate transcription on macOS, fully offline for privacy, with smart context awareness, custom dictionaries, global shortcuts, and AI assistant features.
Expect a strong TypeScript presence among maintained projects.
Understand the commercial incumbents teams migrate from and how many open-source alternatives exist for each product.
AI meeting assistant for transcription and automated note-taking
Real-time transcription and translation API
Voice AI and speech recognition technology
Otter.ai provides real-time transcription, meeting summaries, and action items with up to 95% accuracy. It integrates with video conferencing platforms and CRM systems.
Frequently replaced when teams want private deployments and lower TCO.
Browse neighbouring categories in Applications to widen your evaluation.