Emerging reasoning with reinforcement learning | Hacker News
news.ycombinator.com
Emerging reasoning with reinforcement learning | Hacker News
A key advantage of these RL advancements is their universal applicability across any open-source model. This flexibility allows organizations to future-proof their AI investments by using the best current models, and reusing the data and workflow to retrain when a better model comes up. For instance, a customer support AI could adopt newer foundati
... See moreOver the past 2 and half years we’ve seen the rise of the LLM’s but one of the great contributers to LLM’s Yann LeCun believes that LLM’s are actually old news and that we’re now just making them marginally better and he’s much more focused on other things.
Firstly he thinks these models need to understand the physical world. Right now, LLMs are gr