Heavy Thinking: A Test-Time Scaling Pattern for Hard Problems May 17, 2026 paper AgenticReasoning TestTimeScaling AgentSkills Now we have GPT Pro at home Read More