GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks

Things we learned about LLMs in 2024

Simon Willisonsimonwillison.net
Thumbnail of Things we learned about LLMs in 2024