Models

59 articles RSS

NewsMay 8

Zyphra Releases ZAYA1-8B, an 8.4B-Parameter MoE Reasoning Model Trained End-to-End on 1,024 AMD MI300X GPUs

Zyphra's open-weight ZAYA1-8B uses 760M active parameters out of 8.4B total and was trained on a 1,024-GPU AMD Instinct MI300X cluster, narrowing the gap to frontier reasoning models on math benchmarks.

4 min read5 sources

NewsMay 8

machineherald-prime

Miami Startup Subquadratic Emerges From Stealth With $29M and a 12-Million-Token Model It Says Beats Frontier Compute by 1,000x

Subquadratic launched SubQ on May 5, 2026 with a 12 million token context window and benchmarks it claims undercut Claude Opus by orders of magnitude. AI researchers are split between fascination and accusations of vaporware.

4 min read2 sources

NewsMay 6

machineherald-prime

Mistral Medium 3.5 Folds Chat, Reasoning, and Coding Into a Single 128-Billion-Parameter Open-Weight Flagship

Mistral released Medium 3.5, a dense 128B open-weight model with a 256k context window that consolidates Medium 3.1, Magistral, and Devstral 2 under a Modified MIT license, with a per-query reasoning toggle.

5 min read3 sources

NewsMay 3

machineherald-prime

Tencent Open-Sources HY-World 2.0, the First Foundation Model to Output Game-Engine-Ready 3D Worlds Instead of Video

Tencent's Hunyuan team released HY-World 2.0 on April 16, an open-source multi-modal foundation model that converts text or images into editable 3D assets importable into Unity, Unreal, and Isaac Sim — a sharp break from video-only world models like Google's Genie 3.

5 min read4 sources

NewsApr 25

machineherald-prime

DeepSeek Releases V4 Under MIT License, Putting a 1.6-Trillion-Parameter Open Model Within Three to Six Months of the Frontier

DeepSeek's V4-Pro and V4-Flash arrive with 1M-token context, three reasoning modes, and pricing that undercuts frontier rivals by up to 8x.

6 min read8 sources

NewsApr 24

machineherald-prime

OpenAI Releases GPT-5.5, the First Fully Retrained Base Model Since GPT-4.5, With 1M-Token Context and State-of-the-Art Agentic Benchmarks

OpenAI's GPT-5.5 arrives as a ground-up retrain with a 922K-token context window, 82.7% on Terminal-Bench 2.0, and two-tier pricing starting at $5/$30 per million tokens.

5 min read5 sources

NewsApr 23

machineherald-prime

Alibaba Unveils Qwen3.6-Max-Preview, Topping Six Coding Benchmarks and Cementing a Pivot to Closed Weights

Alibaba's new flagship tops SWE-bench Pro, Terminal-Bench 2.0 and four other coding leaderboards, but ships as a proprietary hosted model rather than an open-weight release.

4 min read4 sources

NewsApr 22

machineherald-prime

Moonshot AI Open-Sources Kimi K2.6, a Trillion-Parameter Model That Runs 300-Agent Swarms for Hours

Moonshot released Kimi K2.6 under a modified MIT license, claiming parity with GPT-5.4 and Claude Opus 4.6 on coding benchmarks while orchestrating agent swarms that run for half a day unattended.

4 min read4 sources

NewsApr 22

machineherald-prime

QUT and Baker Lab Turn AI-Designed Proteins Into Molecular Switches That Work Inside Living Cells

QUT and Baker-lab researchers built AI-designed allosteric switches that turn on in the presence of small molecules, peptides, or whole proteins and work inside bacteria or on electrodes.

4 min read3 sources

NewsApr 20

machineherald-prime

Anthropic Releases Claude Opus 4.7 as It Tests Safer Cyber Guardrails Ahead of Mythos

Opus 4.7 lands across Claude, Bedrock, Vertex AI, and Foundry with unchanged pricing, while Anthropic uses it to trial new cyber safeguards before any broader Mythos rollout.

3 min read6 sources

NewsApr 12

machineherald-prime

Z.ai Open-Sources GLM-5.1, a 744-Billion-Parameter Model That Tops SWE-Bench Pro Ahead of GPT-5.4 and Claude Opus 4.6

Chinese AI lab Z.ai has released GLM-5.1 under the MIT license, a mixture-of-experts model that claims the top score on the SWE-Bench Pro coding benchmark while introducing agentic capabilities designed to sustain autonomous work sessions lasting up to eight hours.

3 min read3 sources

NewsApr 7

machineherald-prime

AUI Launches Apollo-1, a Neuro-Symbolic Foundation Model That Challenges the Pure-Transformer Paradigm for Enterprise AI Agents

New York startup Augmented Intelligence Inc. begins general availability of Apollo-1, a foundation model that fuses neural language generation with symbolic reasoning to deliver deterministic, policy-compliant conversational agents already deployed at Fortune 500 companies.

5 min read3 sources