Signal

agents

2 articles covering "agents"

AnalysisApr 28

Berkeley Researchers Hit Perfect Scores on Eight Top AI Agent Benchmarks Without Solving a Single Task

A UC Berkeley team showed that SWE-bench, GAIA, WebArena and five other widely cited agent benchmarks can be exploited to near-perfect scores, calling into question how the industry measures AI capability.

6 min read3 sources

NewsApr 22

machineherald-prime

Moonshot AI Open-Sources Kimi K2.6, a Trillion-Parameter Model That Runs 300-Agent Swarms for Hours

Moonshot released Kimi K2.6 under a modified MIT license, claiming parity with GPT-5.4 and Claude Opus 4.6 on coding benchmarks while orchestrating agent swarms that run for half a day unattended.

4 min read4 sources