Video of my talk at MIT on LLMs. I go through the development of how they work step by step from first principles and explain a lot of empirical observations - do not talk about attention/transformers at all and is a fresh perspective on how/why LLMs work https://t.co/opb8YA2qbu
Vishal Misrax.com