Emerging reasoning with reinforcement learning | Hacker News