Infrastructure
10 articles RSS
Google's TurboQuant Compresses AI Memory by 6x With No Accuracy Loss, Triggering a Selloff in Memory Chip Stocks
Google Research unveils TurboQuant, a training-free compression algorithm that reduces LLM key-value cache memory by 6x and boosts inference throughput up to 8x, sending SK Hynix and Samsung shares down 5-6 percent.
SoftBank to Build 10 GW Data Center at Former Ohio Uranium Enrichment Site With $33.3 Billion in Japanese Funding
The U.S. Department of Energy announced a partnership with SoftBank subsidiary SB Energy to develop a 10 GW data center and matching power generation complex at the decommissioned Portsmouth uranium enrichment plant in Pike County, Ohio, backed by $33.3 billion in Japanese investment.
IBM, Red Hat, and Google Donate Kubernetes LLM Inference Framework llm-d to the CNCF
IBM Research, Red Hat, and Google Cloud donated llm-d, a Kubernetes-native LLM inference framework, to the CNCF as a sandbox project at KubeCon Europe 2026, backed by NVIDIA, CoreWeave, AMD, and other partners.
KubeCon Europe 2026 Returns to Amsterdam With 12,000 Expected Attendees and a New Focus on Agentic AI Infrastructure
The largest cloud native conference opens March 23 in Amsterdam, introducing Agentics Day alongside 224 sessions spanning AI workloads, platform engineering, and security.
NVIDIA Invests 2 Billion Dollars in Nebius as the Neocloud Race for AI Infrastructure Intensifies
NVIDIA announced a 2 billion dollar strategic investment in Amsterdam-based Nebius Group, granting the AI cloud provider early access to next-generation Rubin GPUs and Vera CPUs as both companies target over 5 gigawatts of deployed capacity by 2030.
Apple Plans to Replace Core ML with Core AI Framework at WWDC 2026, Signaling a Developer-Facing AI Platform Shift
Apple is reportedly planning to retire Core ML at WWDC 2026 and introduce Core AI, a modernized framework intended to help developers integrate outside AI models into iOS apps, according to Bloomberg's Mark Gurman.
CoreWeave's $67 Billion Backlog Cannot Mask a $4.2 Billion GPU Debt Wall as Neocloud Economics Face Their First Real Test
CoreWeave reports record Q4 revenue but a $452 million loss, while approximately $21 billion in total debt and securities class actions test the viability of GPU-backed financing.
Deutsche Telekom Opens Europe's Largest Sovereign AI Factory in Munich, Deploying 10,000 NVIDIA Blackwell GPUs Under German Law
Germany's first industrial AI cloud delivers 0.5 ExaFLOPS from a converted bank building cooled by the Eisbach river, marking Europe's boldest bid for AI independence from US hyperscalers.
India Unveils $200 Billion AI Infrastructure Push at Landmark Summit, Drawing Pledges from Adani, Microsoft, Amazon, and Google
India targets $200 billion in AI investments over two years across its full AI stack, anchored by Adani's $100 billion data center commitment and $67 billion from Western tech giants.
OpenAI Launches GPT-5.3-Codex-Spark on Cerebras Hardware, Delivering 1,000 Tokens per Second in Its First Production Deployment Away from Nvidia
OpenAI's smaller, speed-optimized Codex-Spark model runs on Cerebras' Wafer Scale Engine 3, marking the AI giant's first move off Nvidia silicon for production inference.