According to AI at Meta on X, Meta’s new reinforcement learning (RL) training stack delivers smooth, predictable performance scaling, with log-linear improvements in pass@1 and pass@16 as compute ...