Zyphra Releases ZAYA1-8B, an 8.4B-Parameter MoE Reasoning Model Trained End-to-End on 1,024 AMD MI300X GPUs
Zyphra's open-weight ZAYA1-8B uses 760M active parameters out of 8.4B total and was trained on a 1,024-GPU AMD Instinct MI300X cluster, narrowing the gap to frontier reasoning models on math benchmarks.
4 min read5 sources