Google Launches Gemini 3.5 Flash at I/O 2026, Beating Its Own Pro Model on Agentic and Coding Benchmarks
Google's new efficiency flagship outperforms Gemini 3.1 Pro on most evals while running 4x faster and costing 40% less.
Source
4 articles citing this source
Google's new efficiency flagship outperforms Gemini 3.1 Pro on most evals while running 4x faster and costing 40% less.
Figure AI's Helix-02-powered Figure 03 humanoids ran a public, multi-day sorting livestream from May 13, processing more than 100,000 packages in 81 hours with no teleoperation — though a leading roboticist calls the demo more 'science project' than commercial proof.
Alibaba's new flagship tops SWE-bench Pro, Terminal-Bench 2.0 and four other coding leaderboards, but ships as a proprietary hosted model rather than an open-weight release.
OpenAI unveiled GPT-5.4-Cyber on April 14, a cyber-permissive variant with binary reverse engineering capabilities, limited to vetted defenders through an expanded Trusted Access for Cyber program.