Willow Voice and joinly are both AI meeting assistants for recording, transcription, and summaries, compared here on pricing, features, and workflow fit. Willow Voice: Willow Voice is a system-wide AI dictation app for Mac, Windows, and iPhone that converts speech into cleaned-up, formatted text and inserts it at the cursor in any application. joinly: Open-source, self-hostable connector that lets AI agents join Google Meet, Zoom, and Microsoft Teams calls to transcribe, listen, and act in real time via MCP. They overlap on ai-meeting-assistants, so the right pick depends on team size, budget, and which meeting workflows you automate.
For ai-meeting-assistants workflows, shortlist Willow Voice when drafting emails, slack messages, and chat replies by speaking instead of typing matters most, and joinly when building custom ai meeting agents that answer questions and run tasks during live calls matters most. Both record across Zoom, Google Meet, and Microsoft Teams; trial each on real meetings before committing.
Willow Voice is a system-wide AI dictation app for Mac, Windows, and iPhone that converts speech into cleaned-up, formatted text and inserts it at the cursor in any application.
AI mode that expands brief spoken notes into complete, polished messagesAutomatic cleanup that removes filler words, fixes punctuation, and applies formattingCross-platform availability on Mac, Windows, and iPhone with support for many languages
Open-source, self-hostable connector that lets AI agents join Google Meet, Zoom, and Microsoft Teams calls to transcribe, listen, and act in real time via MCP.
Cross-platform support for Google Meet, Zoom, Microsoft Teams, and browser-based callsDocker-based self-hosting with optional CUDA GPU imageMCP server that exposes meeting tools (join/leave, transcript, chat, audio control, snapshots) to AI agents
Willow Voice is a free tier with paid upgrades (freemium); joinly is a free tier with paid upgrades (freemium). Always confirm current pricing on each vendor's site before buying.
System-wide dictation that inserts transcribed text at the cursor in any app via a hotkey
MCP server that exposes meeting tools (join/leave, transcript, chat, audio control, snapshots) to AI agents
Standout feature
Automatic cleanup that removes filler words, fixes punctuation, and applies formatting
Real-time transcription with timestamps and speaker information, subscribable for live updates
Team usage
Style and tone matching that learns and adapts to your writing across apps
Cross-platform support for Google Meet, Zoom, Microsoft Teams, and browser-based calls
Integrations
AI mode that expands brief spoken notes into complete, polished messages
Modular speech-to-text and text-to-speech backends (Whisper, Deepgram, Kokoro, ElevenLabs)
Languages & capture
Voice commands for formatting (e.g. new line, bullet point) and an auto-learning dictionary for names and terms
Model-agnostic: works with OpenAI, Anthropic, and local LLMs via Ollama
Best-fit workflow
Cross-platform availability on Mac, Windows, and iPhone with support for many languages
Docker-based self-hosting with optional CUDA GPU image
Best for
Willow Voice
Choose Willow Voice if you need drafting emails, slack messages, and chat replies by speaking instead of typing — strengths include works everywhere you type instead of being locked to a single app or note interface.
joinly
Choose joinly if you need building custom ai meeting agents that answer questions and run tasks during live calls — strengths include fully open source (mit) and self-hostable for complete data control.
Pros & cons
Willow Voice
+ Works everywhere you type instead of being locked to a single app or note interface
+ Cleans up and formats speech automatically, producing usable text rather than raw transcripts
- It is a cursor-based dictation tool, not a meeting or call recorder, so it does not record or summarize live conversations
joinly
+ Fully open source (MIT) and self-hostable for complete data control
+ Agents can actively participate by voice and chat, not just passively transcribe
- Developer-oriented framework that requires setup and engineering effort rather than a ready-made app
FAQ
Is Willow Voice or joinly better for AI meeting notes?
It depends on your workflow. Willow Voice is strong for drafting emails, slack messages, and chat replies by speaking instead of typing, while joinly is strong for building custom ai meeting agents that answer questions and run tasks during live calls. Both transcribe and summarize meetings.
How do Willow Voice and joinly compare on price?
Willow Voice is a free tier with paid upgrades and joinly is a free tier with paid upgrades. Check each vendor's pricing page for the latest plans and free-tier limits.
Can I use both Willow Voice and joinly?
Yes. Many teams run more than one meeting assistant when the workflows are complementary and the budget is justified.