GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks

microsoft GitHub - microsoft/LLMLingua: To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Mustafa Suleyman The Coming Wave: Technology, Power, and the Twenty-first Century's Greatest Dilemma

GitHub - katanaml/sparrow: Data processing with ML, LLM and Vision LLM

Andrej Baranovskijgithub.com
Thumbnail of GitHub - katanaml/sparrow: Data processing with ML, LLM and Vision LLM

Ben Auffarth Generative AI with LangChain: Build large language model (LLM) apps with Python, ChatGPT, and other LLMs

LiteLLM

litellm.ai
Thumbnail of LiteLLM

Ben Auffarth Generative AI with LangChain: Build large language model (LLM) apps with Python, ChatGPT, and other LLMs