Tongyi Tingwu and joinly are both AI meeting assistants for recording, transcription, and summaries, compared here on pricing, features, and workflow fit. Tongyi Tingwu: Alibaba Cloud's AI assistant that transcribes, translates, and summarizes meetings and lectures in real time. joinly: Open-source, self-hostable connector that lets AI agents join Google Meet, Zoom, and Microsoft Teams calls to transcribe, listen, and act in real time via MCP. They overlap on ai-meeting-assistants, so the right pick depends on team size, budget, and which meeting workflows you automate.
For ai-meeting-assistants workflows, shortlist Tongyi Tingwu when chinese professionals transcribing and summarizing meetings in real time matters most, and joinly when building custom ai meeting agents that answer questions and run tasks during live calls matters most. Both record across Zoom, Google Meet, and Microsoft Teams; trial each on real meetings before committing.
Open-source, self-hostable connector that lets AI agents join Google Meet, Zoom, and Microsoft Teams calls to transcribe, listen, and act in real time via MCP.
Cross-platform support for Google Meet, Zoom, Microsoft Teams, and browser-based calls
Docker-based self-hosting with optional CUDA GPU image
MCP server that exposes meeting tools (join/leave, transcript, chat, audio control, snapshots) to AI agents
Tongyi Tingwu is a free tier with paid upgrades (freemium); joinly is a free tier with paid upgrades (freemium). Always confirm current pricing on each vendor's site before buying.
MCP server that exposes meeting tools (join/leave, transcript, chat, audio control, snapshots) to AI agents
Standout feature
Speech translation across multiple languages
Real-time transcription with timestamps and speaker information, subscribable for live updates
Team usage
LLM-based summarization of audio and video content
Cross-platform support for Google Meet, Zoom, Microsoft Teams, and browser-based calls
Integrations
Speaker separation for attributed transcripts
Modular speech-to-text and text-to-speech backends (Whisper, Deepgram, Kokoro, ElevenLabs)
Languages & capture
Slide/PPT extraction from video content
Model-agnostic: works with OpenAI, Anthropic, and local LLMs via Ollama
Best-fit workflow
Web interface and browser extension access
Docker-based self-hosting with optional CUDA GPU image
Best for
Tongyi Tingwu
Choose Tongyi Tingwu if you need chinese professionals transcribing and summarizing meetings in real time — strengths include backed by alibaba cloud with llm-powered summarization.
joinly
Choose joinly if you need building custom ai meeting agents that answer questions and run tasks during live calls — strengths include fully open source (mit) and self-hostable for complete data control.
Pros & cons
Tongyi Tingwu
+ Backed by Alibaba Cloud with LLM-powered summarization
+ Real-time transcription and translation with speaker separation
- Primarily oriented to the Chinese market and Chinese/English content
joinly
+ Fully open source (MIT) and self-hostable for complete data control
+ Agents can actively participate by voice and chat, not just passively transcribe
- Developer-oriented framework that requires setup and engineering effort rather than a ready-made app
FAQ
Is Tongyi Tingwu or joinly better for AI meeting notes?
It depends on your workflow. Tongyi Tingwu is strong for chinese professionals transcribing and summarizing meetings in real time, while joinly is strong for building custom ai meeting agents that answer questions and run tasks during live calls. Both transcribe and summarize meetings.
How do Tongyi Tingwu and joinly compare on price?
Tongyi Tingwu is a free tier with paid upgrades and joinly is a free tier with paid upgrades. Check each vendor's pricing page for the latest plans and free-tier limits.
Can I use both Tongyi Tingwu and joinly?
Yes. Many teams run more than one meeting assistant when the workflows are complementary and the budget is justified.
Tongyi Tingwu vs joinly: Pricing, Features & Recommendation | Hosiqo