Anthropic's new flagship model Claude Opus 4.7 beat every benchmark we threw at it, and eats tokens like a hungry teenager.
Alibaba shut down Qwen Code's free tier today, following a license bait-and-switch from fellow Chinese company MiniMax.
Stanford's 2026 AI Index: frontier models fail one in three attempts, lab transparency is declining, and benchmarks are ...