Models
37 articles RSS
Z.ai Open-Sources GLM-5.1, a 744-Billion-Parameter Model That Tops SWE-Bench Pro Ahead of GPT-5.4 and Claude Opus 4.6
Chinese AI lab Z.ai has released GLM-5.1 under the MIT license, a mixture-of-experts model that claims the top score on the SWE-Bench Pro coding benchmark while introducing agentic capabilities designed to sustain autonomous work sessions lasting up to eight hours.
AUI Launches Apollo-1, a Neuro-Symbolic Foundation Model That Challenges the Pure-Transformer Paradigm for Enterprise AI Agents
New York startup Augmented Intelligence Inc. begins general availability of Apollo-1, a foundation model that fuses neural language generation with symbolic reasoning to deliver deterministic, policy-compliant conversational agents already deployed at Fortune 500 companies.
MIT's VibeGen Model Designs Proteins by Their Motion Rather Than Shape, Opening a New Axis for Drug and Materials Engineering
MIT engineers have built an AI system called VibeGen that generates entirely novel protein sequences optimized for specific vibrational and flexing patterns, inverting the traditional structure-first approach to protein design.
PrismML Exits Stealth With First Commercially Viable 1-Bit Large Language Models, Fitting an 8B-Parameter Model in 1.15 GB
A Caltech spinout backed by Khosla Ventures has released a family of fully 1-bit language models under Apache 2.0 that compress an 8-billion-parameter model to 1.15 GB while matching full-precision rivals on standard benchmarks.
Google Releases Gemma 4 Under Apache 2.0 License, Making Its Most Capable Open Models Fully Redistributable for the First Time
Google DeepMind shipped four Gemma 4 model variants on April 2 under the OSI-approved Apache 2.0 license, replacing the restrictive Gemma Terms of Use and removing all barriers to commercial deployment, modification, and redistribution.
Google Launches Gemini 3.1 Flash Live, a Real-Time Voice Model That Detects Emotion and Powers Agents Across 200 Countries
Google's new Gemini 3.1 Flash Live model brings acoustic awareness, doubled context length, and tool-use capabilities to real-time voice AI, rolling out globally via Gemini Live and Search Live.
Google and Anthropic Launch Competing Import Tools as AI Chatbot Data Portability Race Heats Up
Google Gemini and Anthropic Claude both ship tools to import rival chatbot data, signaling that user lock-in is now a front in the AI platform war.
Google Stitch Adds Voice Canvas and MCP Integration in Major Update, Sending Figma Stock Down 12 Percent
Google's AI design tool gains voice commands, multi-screen generation, and developer tool integrations, triggering a two-day selloff in Figma shares.
Nvidia's Nemotron-Cascade 2 Achieves Gold-Medal AI Reasoning with 20x Fewer Parameters Than Its Nearest Rival
Nvidia open-sources a 30B mixture-of-experts model that activates only 3B parameters yet matches DeepSeek's 671B model on olympiad-level math and coding benchmarks.
Mistral Releases Voxtral TTS, a 4-Billion-Parameter Open-Weight Text-to-Speech Model That Rivals ElevenLabs
The French AI startup ships its first text-to-speech model under open weights, offering enterprises voice cloning from three seconds of audio and 70-millisecond latency across nine languages.
OpenAI Shuts Down Sora Video App After Six Months as Disney Pulls $1 Billion Investment and Resources Shift to Spud Model
OpenAI discontinues its Sora video generation app, ends a three-year Disney licensing deal, and redirects compute to a new model codenamed Spud ahead of a potential IPO.
Xiaomi Unmasked as Creator of Mystery Hunter Alpha AI Model After Week of DeepSeek Speculation
An anonymous trillion-parameter AI model that topped OpenRouter usage charts and triggered widespread speculation about a stealth DeepSeek V4 launch turned out to be Xiaomi's MiMo-V2-Pro, signaling the smartphone and EV maker's escalating ambitions in the AI model race.