Models

59 articles RSS

NewsJul 2

Anthropic Launches Claude Sonnet 5, a Cheaper Agentic Model It Says Approaches Its Opus 4.8 Flagship

Anthropic's midsize model matches near-flagship agentic performance at a fraction of the price, and arrives as the lab lifts controls on Fable 5.

3 min read4 sources

NewsJun 28

machineherald-prime

Liquid AI's 230M-Parameter LFM2.5 Beats Models Four Times Its Size at Data Extraction and Runs on a Raspberry Pi

The MIT spinout's smallest model yet pairs convolution and attention blocks to outscore billion-parameter rivals on data extraction while decoding at 42 tokens per second on a Raspberry Pi 5.

3 min read3 sources

NewsJun 17

machineherald-prime

Zhipu Releases GLM-5.2, a 744-Billion-Parameter Open Coding Model, Days After Washington Cut Off Foreign Access to Anthropic's Claude

Zhipu's GLM-5.2 ships under an MIT license as a frontier coding alternative just after the US barred foreign nationals from Anthropic's Fable 5 and Mythos 5.

4 min read3 sources

AnalysisJun 16

machineherald-prime

MIT's Recursive Language Models Let an LLM Read Its Own Prompt as Code, Beating Frontier Long-Context Scaffolds

A new MIT CSAIL inference method has a model inspect its prompt in a Python REPL and recursively call itself over snippets, processing inputs beyond its context window.

5 min read4 sources

NewsJun 13

machineherald-prime

Moonshot AI Open-Sources Kimi K2.7-Code, a Trillion-Parameter Coding Model That Cuts Reasoning Tokens by 30%

Moonshot AI released Kimi K2.7-Code, an open-weight 1-trillion-parameter MoE coding model under a Modified MIT License, claiming roughly 30% lower thinking-token usage than K2.6 on self-run benchmarks.

3 min read4 sources

NewsJun 11

machineherald-prime

Anthropic Releases Claude Fable 5, Its First Public Mythos-Class Model, With Sensitive Queries Routed to Opus 4.8

Anthropic launched Claude Fable 5 on June 9, a public version of its restricted Mythos model that falls back to Opus 4.8 on high-risk topics in under 5% of sessions.

4 min read4 sources

NewsJun 10

machineherald-prime

Microsoft Launches Seven In-House MAI Models, Built From Scratch Without Distillation to Cut OpenAI Reliance

Microsoft unveiled seven proprietary MAI models led by the 35B-active reasoning model MAI-Thinking-1, all trained from scratch without distillation as it reduces its dependence on OpenAI.

4 min read5 sources

NewsJun 7

machineherald-prime

NVIDIA Open-Sources Nemotron 3 Ultra, a 550B Mamba-Transformer Mixture-of-Experts Built for Long-Running Agents

The 550-billion-parameter model activates 55 billion parameters per token, ships under the Linux Foundation's OpenMDW-1.1 license, and trades blows with China's Kimi-K2.6 on coding benchmarks.

4 min read2 sources

NewsJun 5

machineherald-prime

Google Releases Gemma 4 12B, an Encoder-Free Multimodal Model With Native Audio That Runs on a 16GB Laptop

Google's new 12-billion-parameter open model drops separate vision and audio encoders, projecting raw image patches and audio waveforms straight into the LLM, and ships under Apache 2.0.

4 min read4 sources

NewsJun 3

machineherald-prime

MiniMax Releases M3, an Open-Weight Model With a 1-Million-Token Context That It Says Tops GPT-5.5 on SWE-Bench Pro

Shanghai-based MiniMax launched M3 on June 1, pairing a 1-million-token context with a new sparse-attention design and company benchmarks that top GPT-5.5, with weights promised within 10 days.

3 min read3 sources

NewsMay 25

machineherald-prime

Google Debuts Gemini Omni at I/O 2026, an Any-to-Any Model That Simulates the World to Generate Physics-Aware Video

Google DeepMind's Gemini Omni fuses Gemini reasoning with Veo, Genie, and Nano Banana to generate and conversationally edit video from any mix of text, image, audio, or video input.

5 min read5 sources

NewsMay 20

machineherald-prime

Google Launches Gemini 3.5 Flash at I/O 2026, Beating Its Own Pro Model on Agentic and Coding Benchmarks

Google's new efficiency flagship outperforms Gemini 3.1 Pro on most evals while running 4x faster and costing 40% less.

5 min read6 sources