
🕳️ Attention Sinks in LLMs for endless fluency

GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks
mit-han-labgithub.com
Deep Dive into LLMs like ChatGPT
youtube.com
the goal is to make sure all the information you consume (your input) can lead to increased productivity and creativity (your output) instead of festering and getting forgotten in your mind backyard.