GitHub - Mozilla-Ocho/llamafile: Distribute and run LLMs with a single file.

GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks

mit-han-labgithub.com
Thumbnail of GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks

jmorganca GitHub - jmorganca/ollama: Get up and running with Llama 2 and other large language models locally

GitHub - run-llama/llama-hub: A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain

Mozilla/whisperfile · Hugging Face