Towards Reinforcement Learning with AI Feedback (RLAIF). What open-sourced foundation models, instruction tuning, and other recent events mean for the future of AI
LeCun points to four essential characteristics of human intelligence that current AI systems, including LLMs, can’t replicate: reasoning, planning, persistent memory, and understanding the physical world. He stresses that LLMs’ reliance on textual data severely limits their understanding of reality: “We’re easily fooled into thinking they are intel... See more
Azeem Azhar • 🧠 AI’s $100bn question: The scaling ceiling
MargaretC added
sari and added
Nicolay Gerold and added
Evidence that LLMs are reaching a point of diminishing returns - and what that might mean
Gary Marcusgarymarcus.substack.comJohn Borthwick added
Relates to the broad question re; scaling, and the Bitter Lesson
A rough analogy to the current LLM process is that making a new model is like baking a cake. You figure out your data and algorithms—like mixing the batter—and then you pretrain the model, that is, run it on a large number of computers for several months—like putting it in the oven—and then at the end you do some “post training”—like frosting and d... See more
Avital Balwit • My Last Five Years of Work
Max Beauroyre added