Microsoft launches VibeVoice

Microsoft Research has launched VibeVoice, an open-source TTS model enabling expressive, multi-speaker dialogues. Capable of generating 90-minute conversations with up to four speakers, it uses continuous speech tokenisers at 7.5 Hz for high-quality audio. Powered by Qwen2.5-1.5B and a 123M-parameter diffusion head, VibeVoice supports English and Chinese, embeds watermarks for safety, and is available on GitHub (MIT License) for research use.

What's Hot

Fraganote Secures $3 Million Series A to Propel D2C Growth and Portfolio Expansion

Fraganote Secures $3 Million Series A to Propel D2C Growth and Portfolio Expansion

Amazon Prime Music in India to Introduce Ads, Remove Offline Downloads From July 2026

Microsoft has launched VibeVoice, a Open-Source Text-to-Speech Model

IIT Madras Director Unfazed by India’s AI Sovereignty Concerns

Google’s Gemini 3.0 Ushers in Agentic AI Era, Rewiring Workflows Across Search, Workspace and Beyond

Google’s Gemini 3.0 Ushers in Agentic AI Era, Rewiring Workflows Across Search, Workspace and Beyond

How India, Dubai, and Singapore Are Quietly Building the World’s Next Startup Power Corridor

Amazon Prime Music in India to Introduce Ads, Remove Offline Downloads From July 2026

Acko Taps Top Banks for H1 2027 IPO Target

Fairdeal.Market Secures US$15M to Scale B2B Quick Commerce Across India

What's Hot

Microsoft has launched VibeVoice, a Open-Source Text-to-Speech Model

Keep Reading