Models
22 articles RSS
PrismML Exits Stealth With First Commercially Viable 1-Bit Large Language Models, Fitting an 8B-Parameter Model in 1.15 GB
A Caltech spinout backed by Khosla Ventures has released a family of fully 1-bit language models under Apache 2.0 that compress an 8-billion-parameter model to 1.15 GB while matching full-precision rivals on standard benchmarks.
Google Releases Gemma 4 Under Apache 2.0 License, Making Its Most Capable Open Models Fully Redistributable for the First Time
Google DeepMind shipped four Gemma 4 model variants on April 2 under the OSI-approved Apache 2.0 license, replacing the restrictive Gemma Terms of Use and removing all barriers to commercial deployment, modification, and redistribution.
Google Launches Gemini 3.1 Flash Live, a Real-Time Voice Model That Detects Emotion and Powers Agents Across 200 Countries
Google's new Gemini 3.1 Flash Live model brings acoustic awareness, doubled context length, and tool-use capabilities to real-time voice AI, rolling out globally via Gemini Live and Search Live.
Google and Anthropic Launch Competing Import Tools as AI Chatbot Data Portability Race Heats Up
Google Gemini and Anthropic Claude both ship tools to import rival chatbot data, signaling that user lock-in is now a front in the AI platform war.
Google Stitch Adds Voice Canvas and MCP Integration in Major Update, Sending Figma Stock Down 12 Percent
Google's AI design tool gains voice commands, multi-screen generation, and developer tool integrations, triggering a two-day selloff in Figma shares.
Nvidia's Nemotron-Cascade 2 Achieves Gold-Medal AI Reasoning with 20x Fewer Parameters Than Its Nearest Rival
Nvidia open-sources a 30B mixture-of-experts model that activates only 3B parameters yet matches DeepSeek's 671B model on olympiad-level math and coding benchmarks.
Mistral Releases Voxtral TTS, a 4-Billion-Parameter Open-Weight Text-to-Speech Model That Rivals ElevenLabs
The French AI startup ships its first text-to-speech model under open weights, offering enterprises voice cloning from three seconds of audio and 70-millisecond latency across nine languages.
OpenAI Shuts Down Sora Video App After Six Months as Disney Pulls $1 Billion Investment and Resources Shift to Spud Model
OpenAI discontinues its Sora video generation app, ends a three-year Disney licensing deal, and redirects compute to a new model codenamed Spud ahead of a potential IPO.
Xiaomi Unmasked as Creator of Mystery Hunter Alpha AI Model After Week of DeepSeek Speculation
An anonymous trillion-parameter AI model that topped OpenRouter usage charts and triggered widespread speculation about a stealth DeepSeek V4 launch turned out to be Xiaomi's MiMo-V2-Pro, signaling the smartphone and EV maker's escalating ambitions in the AI model race.
Meta Delays Avocado AI Model to May After Internal Testing Reveals Performance Gaps with Google, OpenAI and Anthropic
Meta postpones its next-generation Avocado AI model by at least two months after benchmarks show it trailing rivals, raising questions about the company's $135 billion AI investment.
Alibaba Qwen 3.5 Small Models Pack Multimodal Intelligence Into Sub-10-Billion Parameters for Edge Devices
Alibaba releases four open-source Qwen 3.5 Small models ranging from 0.8 to 9 billion parameters, delivering multimodal reasoning on consumer hardware while the 9B variant outperforms models up to 13 times its size on key benchmarks.
OpenAI Launches GPT-5.4 With Native Computer Use and Financial Plugins as It Fights to Regain User Trust
OpenAI releases its most capable model yet with autonomous computer operation and enterprise integrations, but the launch is overshadowed by Pentagon deal fallout and a reported 1.5 million user exodus.