
Reinforcement Learning, Explained With a Minimum of Math and Jargon
Timothy B. Leeunderstandingai.org
Scaling up RL is all the rage right now, I had a chat with a friend about it yesterday. I'm fairly certain RL will continue to yield more intermediate gains, but I also don't expect it to be the full story. RL is basically "hey this happened to go well (/poorly), let me slightly increase (/decrease) the probability of every action I took for the fu... See more
Andrej Karpathyx.com