March 2026 – Page 3

IndexCache: Tsinghua Researchers Achieve 1.82x Speed Boost for Long-Context AI Inference

Researchers at Tsinghua University have developed IndexCache, a new technique that achieves 1.82x faster inference for long-context AI models by eliminating redundant sparse attention computations.

openx_editor March 31, 2026 5 mins read

Open Source

Mistral AI Releases Open-Weight Voxtral TTS, Claims Victory Over ElevenLabs in Voice Quality

Mistral AI releases Voxtral TTS, an open-weight text-to-speech model that beats ElevenLabs in voice quality benchmarks. The 3B parameter model runs on laptops and smartphones.

openx_editor March 31, 2026 4 mins read

AI Tools

IndexCache: A New Sparse Attention Technique That Makes Long-Context AI Inference 1.82x Faster

A new sparse attention technique called IndexCache promises 1.82x faster inference on long-context AI models by caching repeated token selections across adjacent layers — and the community is already porting it to…

openx_editor March 31, 2026 4 mins read

AI Agents

Microsoft Brings Claude-Powered ‘Cowork’ to Copilot — A New Phase in Enterprise AI Agents

Microsoft's new Copilot Cowork brings Anthropic's Claude-powered agentic workflows to Microsoft 365, marking an unusual cross-company AI partnership and a new phase in enterprise AI adoption.

openx_editor March 31, 2026 4 mins read

AI News

Apple Intelligence Accidentally Launched in China — Then Vanished Within Hours

Apple Intelligence briefly appeared to launch in China on March 30, 2026 — then vanished within hours. Here's what happened, why it matters, and what it tells us about AI geopolitics.

openx_editor March 31, 2026 4 mins read

AI Agents, AI News

Anthropic’s Claude Can Now Control Your Mac: The Future of AI Agents Gets Real

Anthropic launches Claude computer use and Dispatch features, letting Claude control your Mac and accept tasks from your iPhone — the most ambitious step yet toward AI agents that actually do work.

openx_editor March 30, 2026 5 mins read

AI News, AI Tools

Google’s TurboQuant Algorithm Cuts AI Memory Usage by 6x, Could Reduce Cloud Costs by 50%

Google Research releases TurboQuant, a KV cache compression algorithm that reduces AI memory usage by 6x with zero accuracy loss, potentially cutting cloud AI costs by 50% or more.

openx_editor March 30, 2026 4 mins read

AI Tools, Open Source

Mistral AI Releases Voxtral TTS: Open-Weight Text-to-Speech Model That Outperforms ElevenLabs

Mistral AI releases Voxtral TTS, an open-weight text-to-speech model with 3B parameters that outperforms ElevenLabs in human evaluations — and enterprises can run it entirely on their own infrastructure.

openx_editor March 30, 2026 3 mins read

AI Agents, Open Source

The Rise of Multi-Agent Claude Code Orchestration: oh-my-claudecode and hermes-agent Lead the Way

Multi-agent AI orchestration is trending on GitHub with oh-my-claudecode (17.5K stars) and hermes-agent (18.4K stars) leading the charge in collaborative AI development.

openx_editor March 30, 2026 3 mins read

AI News, Industry News

Apple’s Third-Party Siri Extensions Could Lead to an AI App Store in iOS 27

Apple's upcoming iOS 27 Siri Extensions could create an AI App Store, allowing third-party AI chatbots to run natively in Siri for the first time.

openx_editor March 30, 2026 3 mins read