Models
37 articles RSS
Google Debuts Gemini Omni at I/O 2026, an Any-to-Any Model That Simulates the World to Generate Physics-Aware Video
Google DeepMind's Gemini Omni fuses Gemini reasoning with Veo, Genie, and Nano Banana to generate and conversationally edit video from any mix of text, image, audio, or video input.
Google Launches Gemini 3.5 Flash at I/O 2026, Beating Its Own Pro Model on Agentic and Coding Benchmarks
Google's new efficiency flagship outperforms Gemini 3.1 Pro on most evals while running 4x faster and costing 40% less.
Zyphra Releases ZAYA1-8B, an 8.4B-Parameter MoE Reasoning Model Trained End-to-End on 1,024 AMD MI300X GPUs
Zyphra's open-weight ZAYA1-8B uses 760M active parameters out of 8.4B total and was trained on a 1,024-GPU AMD Instinct MI300X cluster, narrowing the gap to frontier reasoning models on math benchmarks.
Miami Startup Subquadratic Emerges From Stealth With $29M and a 12-Million-Token Model It Says Beats Frontier Compute by 1,000x
Subquadratic launched SubQ on May 5, 2026 with a 12 million token context window and benchmarks it claims undercut Claude Opus by orders of magnitude. AI researchers are split between fascination and accusations of vaporware.
Mistral Medium 3.5 Folds Chat, Reasoning, and Coding Into a Single 128-Billion-Parameter Open-Weight Flagship
Mistral released Medium 3.5, a dense 128B open-weight model with a 256k context window that consolidates Medium 3.1, Magistral, and Devstral 2 under a Modified MIT license, with a per-query reasoning toggle.
Tencent Open-Sources HY-World 2.0, the First Foundation Model to Output Game-Engine-Ready 3D Worlds Instead of Video
Tencent's Hunyuan team released HY-World 2.0 on April 16, an open-source multi-modal foundation model that converts text or images into editable 3D assets importable into Unity, Unreal, and Isaac Sim — a sharp break from video-only world models like Google's Genie 3.
DeepSeek Releases V4 Under MIT License, Putting a 1.6-Trillion-Parameter Open Model Within Three to Six Months of the Frontier
DeepSeek's V4-Pro and V4-Flash arrive with 1M-token context, three reasoning modes, and pricing that undercuts frontier rivals by up to 8x.
OpenAI Releases GPT-5.5, the First Fully Retrained Base Model Since GPT-4.5, With 1M-Token Context and State-of-the-Art Agentic Benchmarks
OpenAI's GPT-5.5 arrives as a ground-up retrain with a 922K-token context window, 82.7% on Terminal-Bench 2.0, and two-tier pricing starting at $5/$30 per million tokens.
Alibaba Unveils Qwen3.6-Max-Preview, Topping Six Coding Benchmarks and Cementing a Pivot to Closed Weights
Alibaba's new flagship tops SWE-bench Pro, Terminal-Bench 2.0 and four other coding leaderboards, but ships as a proprietary hosted model rather than an open-weight release.
Moonshot AI Open-Sources Kimi K2.6, a Trillion-Parameter Model That Runs 300-Agent Swarms for Hours
Moonshot released Kimi K2.6 under a modified MIT license, claiming parity with GPT-5.4 and Claude Opus 4.6 on coding benchmarks while orchestrating agent swarms that run for half a day unattended.
QUT and Baker Lab Turn AI-Designed Proteins Into Molecular Switches That Work Inside Living Cells
QUT and Baker-lab researchers built AI-designed allosteric switches that turn on in the presence of small molecules, peptides, or whole proteins and work inside bacteria or on electrodes.
Anthropic Releases Claude Opus 4.7 as It Tests Safer Cyber Guardrails Ahead of Mythos
Opus 4.7 lands across Claude, Bedrock, Vertex AI, and Foundry with unchanged pricing, while Anthropic uses it to trial new cyber safeguards before any broader Mythos rollout.