Going from addition/multiplication to the most advanced AI models today without assuming other knowledge or referring to other sources means we cover a LOT of ground. This is NOT a toy LLM explanation — a determined person can theoretically recreate a modern LLM from all the information here. I have cut out every word/line that was unnecessary and... See more
Stephen Wolfram of Wolfram Alpha wrote the absolute best post on ChatGPT and Large Language Models.
It took me about two hours to read, but significantly increased my understanding of what's going on under the hood of ChatGPT.
A few of my favorite takeaways (helps my process)... See more