Speechmatics and Thoth are both AI meeting assistants for recording, transcription, and summaries, compared here on pricing, features, and workflow fit. Speechmatics: Speech-to-text and voice AI provider offering real-time transcription and live captioning APIs. Thoth: Privacy-first macOS app that records, transcribes, and summarizes meetings entirely on-device with no cloud or data leaving the Mac. They overlap on ai-meeting-assistants, ai-transcription, so the right pick depends on team size, budget, and which meeting workflows you automate.
For ai-meeting-assistants, ai-transcription workflows, shortlist Speechmatics when adding live captions to broadcasts, sports, and events matters most, and Thoth when professionals handling confidential or regulated information matters most. Both record across Zoom, Google Meet, and Microsoft Teams; trial each on real meetings before committing.
Speech-to-text and voice AI provider offering real-time transcription and live captioning APIs.
APIs for embedding transcription into other applicationsLive captioning for events, broadcasts, and streamsLow-latency real-time processing for live use
Speechmatics is a free tier with paid upgrades (freemium); Thoth is a free tier with paid upgrades (freemium). Always confirm current pricing on each vendor's site before buying.
Speech-to-text transcription for recorded and real-time audio
100% local, on-device recording, transcription, and summarization
Standout feature
Low-latency real-time processing for live use
Dual-channel capture of microphone and system audio
Team usage
Live captioning for events, broadcasts, and streams
On-device speaker detection with color-coded transcripts
Integrations
Multi-speaker and multilingual support across many languages
Whisper-based transcription supporting many languages
Languages & capture
APIs for embedding transcription into other applications
On-device anonymization to redact sensitive content
Best-fit workflow
Speech-to-text transcription for recorded and real-time audio
Export to PDF, Word, Markdown, and timestamped JSON
Best for
Speechmatics
Choose Speechmatics if you need adding live captions to broadcasts, sports, and events — strengths include real-time, low-latency transcription suitable for live captioning.
Thoth
Choose Thoth if you need professionals handling confidential or regulated information — strengths include fully local processing with no cloud requirement and no user database.
Pros & cons
Speechmatics
+ Real-time, low-latency transcription suitable for live captioning
+ Broad language and multi-speaker coverage
- Primarily a developer-facing engine rather than a ready-made app
Thoth
+ Fully local processing with no cloud requirement and no user database
+ Captures both sides of online meetings and works offline
- Requires a Mac with Apple Silicon and sufficient unified memory
FAQ
Is Speechmatics or Thoth better for AI meeting notes?
It depends on your workflow. Speechmatics is strong for adding live captions to broadcasts, sports, and events, while Thoth is strong for professionals handling confidential or regulated information. Both transcribe and summarize meetings.
How do Speechmatics and Thoth compare on price?
Speechmatics is a free tier with paid upgrades and Thoth is a free tier with paid upgrades. Check each vendor's pricing page for the latest plans and free-tier limits.
Can I use both Speechmatics and Thoth?
Yes. Many teams run more than one meeting assistant when the workflows are complementary and the budget is justified.