Large Language Models
o3's 10x RL Compute Gambit: The Real State of LLM Reasoning Reinforcement
OpenAI's o3 didn't just scale — it poured 10x compute into reinforcement learning for reasoning, smashing benchmarks. Meanwhile, GPT-4.5's yawn proves scaling alone is tapped out.