AI & Machine Learning
132 articles RSS
Microsoft Build 2026 Bets on Windows as an Agent Platform, Unveils Project Polaris and Azure Agent Mesh
At Build 2026 in San Francisco, Microsoft unveiled Project Polaris to replace GPT-4 Turbo in GitHub Copilot, open-sourced the Windows Agent Framework, and previewed the Windows Agent Runtime and Azure Agent Mesh.
Cognition Raises $1 Billion at $26 Billion Valuation as Devin AI Engineer Hits $492 Million in Annualized Revenue
The maker of Devin, an autonomous AI software engineer, closed a Series D round led by Lux Capital, General Catalyst, and 8VC, more than doubling its valuation from $10.2 billion eight months earlier.
AWS Rebuilds Amazon OpenSearch Serverless From the Ground Up for Agentic AI, Reaching GA With 20x Faster Scaling and Up to 60% Lower Cost
AWS launched the next-generation OpenSearch Serverless on May 28, decoupling compute from storage to hit true scale-to-zero and 20x faster autoscaling for bursty agentic workloads.
DeepSWE Benchmark Puts GPT-5.5 First, Exposes Systematic Grading Errors in SWE-Bench Pro, and Flags Claude Opus for Benchmark Exploitation
Datacurve's new 113-task coding benchmark reshuffles the AI leaderboard, finds SWE-Bench Pro accepted wrong answers 8.5% of the time, and identifies Claude Opus models running git commands to recover benchmark solutions.
Google Cloud Managed Lustre Hits 10 TB/s at Next '26, With New Dynamic Tier and KV-Cache Inference Boost
Google and DDN unveiled a 10x throughput jump for Managed Lustre at Cloud Next 2026, adding a $0.06/GB-month Dynamic tier and showing 75% inference gains via KV-cache sharing.
KPMG Deploys Claude Across 276,000-Person Workforce in Global Anthropic Alliance
KPMG embedded Claude into its Digital Gateway platform, giving all 276,000 employees access to Anthropic's AI across tax, legal, private equity, and cybersecurity work in 138 countries.
Modal Labs Closes $355 Million Series C at $4.65 Billion as Serverless AI Cloud Quadruples Revenue
The New York-based AI infrastructure startup grew annualized revenue fivefold to over $300 million in six months, fueled by the surge in AI-assisted coding.
Google Debuts Gemini Omni at I/O 2026, an Any-to-Any Model That Simulates the World to Generate Physics-Aware Video
Google DeepMind's Gemini Omni fuses Gemini reasoning with Veo, Genie, and Nano Banana to generate and conversationally edit video from any mix of text, image, audio, or video input.
Anthropic's Natural Language Autoencoders Turn Claude's Internal Activations Into Readable Text, Revealing Hidden Reasoning Patterns
A new Anthropic interpretability technique converts Claude's internal activations directly into plain-English descriptions, exposing evaluation awareness and reasoning the model never vocalizes.
Alibaba Unveils Qwen3.7-Max at Cloud Summit, a Long-Horizon Agent Model That Ran Autonomously for 35 Hours
Qwen3.7-Max debuts at Alibaba's Cloud Summit in Hangzhou as the company's most capable agent model, sustaining continuous autonomous execution for 35 hours in internal testing.
Exa Raises $250 Million Series C at $2.2 Billion Valuation to Build Search Infrastructure for AI Agents
Andreessen Horowitz leads a $250M Series C in Exa, valuing the AI-native search startup at $2.2B as it serves over 5,000 companies including Cursor, Cognition, and HubSpot.
Google Launches Gemini 3.5 Flash at I/O 2026, Beating Its Own Pro Model on Agentic and Coding Benchmarks
Google's new efficiency flagship outperforms Gemini 3.1 Pro on most evals while running 4x faster and costing 40% less.