Microsoft VibeVoice: The Open-Source Frontier Voice AI That’s Changing the Game
Microsoft releases VibeVoice, an open-source frontier voice AI with 31,000+ stars on GitHub, bringing state-of-the-art conversational AI to developers worldwide.
Microsoft releases VibeVoice, an open-source frontier voice AI with 31,000+ stars on GitHub, bringing state-of-the-art conversational AI to developers worldwide.
Mistral AI releases Voxtral TTS, a free open-weight text-to-speech model that beats ElevenLabs in human evaluations. Enterprise voice AI just got a lot more interesting.
Deep-Live-Cam lets users perform real-time face swaps on video calls and recordings using just a single image. The open-source tool is powerful, accessible, and raising serious ethical questions about consent and deepfake…
Anthropic Claude can now control your Mac, click buttons, open apps, and execute tasks on your behalf 鈥?but early tests suggest it works about half the time.
Google TurboQuant algorithm achieves 6x KV cache compression with zero accuracy loss 鈥?and runs up to 8x faster on H100 GPUs.
New research technique xMemory replaces flat RAG with a four-level semantic hierarchy, cutting token usage by 48% for multi-session AI agents.
SakanaAI unveils AI-Scientist-v2, an autonomous research system that has produced the first AI-written paper accepted through peer review. Here's what it means for science.
Deep-Live-Cam enables real-time face swapping with just one image, raising questions about open-source AI ethics and the future of deepfake technology.
Google's new TurboQuant algorithm can reduce AI memory usage by 6x with zero accuracy loss, potentially democratizing access to powerful language models.
xMemory introduces a four-level semantic hierarchy that cuts AI agent token usage nearly in half, promising to revolutionize long conversation and multi-session AI task handling.