Models
37 articles RSS
Meta Delays Avocado AI Model to May After Internal Testing Reveals Performance Gaps with Google, OpenAI and Anthropic
Meta postpones its next-generation Avocado AI model by at least two months after benchmarks show it trailing rivals, raising questions about the company's $135 billion AI investment.
Alibaba Qwen 3.5 Small Models Pack Multimodal Intelligence Into Sub-10-Billion Parameters for Edge Devices
Alibaba releases four open-source Qwen 3.5 Small models ranging from 0.8 to 9 billion parameters, delivering multimodal reasoning on consumer hardware while the 9B variant outperforms models up to 13 times its size on key benchmarks.
OpenAI Launches GPT-5.4 With Native Computer Use and Financial Plugins as It Fights to Regain User Trust
OpenAI releases its most capable model yet with autonomous computer operation and enterprise integrations, but the launch is overshadowed by Pentagon deal fallout and a reported 1.5 million user exodus.
Yann LeCun's AMI Labs Raises $1.03 Billion in Record European Seed Round to Build World Models That Challenge the LLM Paradigm
Turing Award winner Yann LeCun's Paris-based startup raises the largest seed round in European history to develop world models, an alternative AI architecture that learns from physical reality rather than text prediction.
Alibaba's Qwen3.5 Arrives as the First Native Multimodal Open-Weight Model to Challenge Frontier Proprietary AI
Alibaba releases Qwen3.5, a 397-billion-parameter open-weight model that activates only 17B parameters per pass, beats rivals on instruction-following benchmarks, and ships at 1/18th the cost of Gemini 3 Pro.
Cancel ChatGPT Trends as Claude Hits Number One on App Store, Then Crashes Under Unprecedented Demand
OpenAI's Pentagon deal sparked a consumer backlash that drove Anthropic's Claude to the top of Apple's App Store, before a surge in demand caused a worldwide outage.
DeepSeek V4 Set to Launch This Week as Anthropic Accuses It of Industrial-Scale Model Theft
DeepSeek's first major AI release in over a year arrives amid Anthropic allegations that the Chinese lab used 24,000 fake accounts and 16 million queries to extract Claude's capabilities.
Google DeepMind Releases Gemini 3.1 Pro, Claims Top Spot on 12 of 18 Major AI Benchmarks
Google DeepMind's Gemini 3.1 Pro launches with a 2.5x reasoning improvement over its predecessor, topping the ARC-AGI-2 benchmark at 77.1% and ranking first across most tracked evaluations.
Zhipu AI's GLM-5 Becomes the First Frontier Model Trained Entirely on Chinese Chips, Rivaling Western Labs Under an MIT License
China's Zhipu AI releases a 744-billion-parameter open-source model trained on Huawei Ascend chips that matches Claude and GPT on key benchmarks, sending its stock up 34 percent.
Microsoft's BitNet Proves 1-Bit AI Models Can Match Full-Precision Rivals at a Fraction of the Cost
Microsoft Research's BitNet b1.58 framework uses ternary weights to run large language models on ordinary CPUs with up to 92% less energy, challenging the assumption that AI demands expensive GPU hardware.
GPT-5.3 Codex vs. Claude Opus 4.6: A Head-to-Head Comparison of February's Dueling Flagships
Both models launched on the same day but target different developer needs — Codex prioritizes speed and agentic reliability, while Opus leads on reasoning depth and multi-agent coordination.
OpenAI Launches GPT-5.3-Codex with Major Agentic Gains and First 'High' Cybersecurity Risk Rating
OpenAI releases GPT-5.3-Codex, its fastest agentic coding model yet, but delays API access after classifying it as 'High' cybersecurity capability under its Preparedness Framework.