Microsoft Research has launched VibeVoice, an open-source TTS model enabling expressive, multi-speaker dialogues. Capable of generating 90-minute conversations with up to four speakers, it uses continuous speech tokenisers at 7.5 Hz for high-quality audio. Powered by Qwen2.5-1.5B and a 123M-parameter diffusion head, VibeVoice supports English and Chinese, embeds watermarks for safety, and is available on GitHub (MIT License) for research use.
Trending
- Acko Taps Top Banks for H1 2027 IPO Target
- Fairdeal.Market Secures US$15M to Scale B2B Quick Commerce Across India
- India’s Startup IPO Surge: 48+ Listings Set to Unlock INR 50,000 Cr in 2026
- Indian Startup IPO Surge Expected by 2026 Amid Market Reset
- IIT Madras Director Unfazed by India’s AI Sovereignty Concerns
- Flipkart Bolsters Leadership with Key VP Appointments
- IEA Warns Current Oil Crisis Exceeds Past Shocks
- HomeEssentials Secures $7.7M to Fuel Retail Expansion


