Towards Reinforcement Learning with AI Feedback (RLAIF). Wha...

Towards Reinforcement Learning with AI Feedback (RLAIF). What open-sourced foundation models, instruction tuning, and other recent events mean for the future of AI

amatriain.net

RelatedHighlights

What We Learned From a Year of Building With LLMs

Bryan Bischof oreilly.com

and added

LeCun points to four essential characteristics of human intelligence that current AI systems, including LLMs, can’t replicate: reasoning, planning, persistent memory, and understanding the physical world. He stresses that LLMs’ reliance on textual data severely limits their understanding of reality: “We’re easily fooled into thinking they are intel... See more

Azeem Azhar • 🧠 AI’s $100bn question: The scaling ceiling

MargaretC added

Scaling: The State of Play in AI

Ethan Mollick oneusefulthing.org

and added

AI Revolution - Transformers and Large Language Models (LLMs)

Elad Gil blog.eladgil.com

sari and added

Shortwave — rajhesh.panchanadhan@gmail.com [Gmail alternative]

app.shortwave.com app.shortwave.com

Nicolay Gerold and added

Evidence that LLMs are reaching a point of diminishing returns - and what that might mean

Gary Marcus garymarcus.substack.com

John Borthwick added

Relates to the broad question re; scaling, and the Bitter Lesson

A rough analogy to the current LLM process is that making a new model is like baking a cake. You figure out your data and algorithms—like mixing the batter—and then you pretrain the model, that is, run it on a large number of computers for several months—like putting it in the oven—and then at the end you do some “post training”—like frosting and d... See more

Avital Balwit • My Last Five Years of Work

Max Beauroyre added

Co-Intelligence

Ethan Mollick

amazon.com