AI & Machine Learning
132 articles RSS
Google Launches Gemini 3.1 Flash Live, a Real-Time Voice Model That Detects Emotion and Powers Agents Across 200 Countries
Google's new Gemini 3.1 Flash Live model brings acoustic awareness, doubled context length, and tool-use capabilities to real-time voice AI, rolling out globally via Gemini Live and Search Live.
Google and Anthropic Launch Competing Import Tools as AI Chatbot Data Portability Race Heats Up
Google Gemini and Anthropic Claude both ship tools to import rival chatbot data, signaling that user lock-in is now a front in the AI platform war.
Anthropic Data Leak Reveals Claude Mythos, a New AI Model the Company Says Poses Unprecedented Cybersecurity Risks
Nearly 3,000 unpublished assets exposed through an unsecured content management system reveal Anthropic's next-generation AI model and its own warnings about its offensive cyber capabilities.
Google Stitch Adds Voice Canvas and MCP Integration in Major Update, Sending Figma Stock Down 12 Percent
Google's AI design tool gains voice commands, multi-screen generation, and developer tool integrations, triggering a two-day selloff in Figma shares.
Apple Will Open Siri to Rival AI Chatbots in iOS 27, Ending OpenAI's Exclusive Integration
A new Extensions system in iOS 27 will let users route Siri queries to Claude, Gemini, and other third-party AI services, replacing the current ChatGPT-only arrangement.
Nvidia's Nemotron-Cascade 2 Achieves Gold-Medal AI Reasoning with 20x Fewer Parameters Than Its Nearest Rival
Nvidia open-sources a 30B mixture-of-experts model that activates only 3B parameters yet matches DeepSeek's 671B model on olympiad-level math and coding benchmarks.
Mistral Releases Voxtral TTS, a 4-Billion-Parameter Open-Weight Text-to-Speech Model That Rivals ElevenLabs
The French AI startup ships its first text-to-speech model under open weights, offering enterprises voice cloning from three seconds of audio and 70-millisecond latency across nine languages.
Google's TurboQuant Compresses AI Memory by 6x With No Accuracy Loss, Triggering a Selloff in Memory Chip Stocks
Google Research unveils TurboQuant, a training-free compression algorithm that reduces LLM key-value cache memory by 6x and boosts inference throughput up to 8x, sending SK Hynix and Samsung shares down 5-6 percent.
OpenAI Shuts Down Sora Video App After Six Months as Disney Pulls $1 Billion Investment and Resources Shift to Spud Model
OpenAI discontinues its Sora video generation app, ends a three-year Disney licensing deal, and redirects compute to a new model codenamed Spud ahead of a potential IPO.
SoftBank to Build 10 GW Data Center at Former Ohio Uranium Enrichment Site With $33.3 Billion in Japanese Funding
The U.S. Department of Energy announced a partnership with SoftBank subsidiary SB Energy to develop a 10 GW data center and matching power generation complex at the decommissioned Portsmouth uranium enrichment plant in Pike County, Ohio, backed by $33.3 billion in Japanese investment.
Google Expands Universal Commerce Protocol With Cart, Catalog, and Identity Linking as Agentic Shopping Takes Shape
Google added Cart, Catalog, and Identity Linking capabilities to its Universal Commerce Protocol, expanding the open standard that lets AI agents complete purchases across retailer platforms.
IBM, Red Hat, and Google Donate Kubernetes LLM Inference Framework llm-d to the CNCF
IBM Research, Red Hat, and Google Cloud donated llm-d, a Kubernetes-native LLM inference framework, to the CNCF as a sandbox project at KubeCon Europe 2026, backed by NVIDIA, CoreWeave, AMD, and other partners.