GitHub - okuvshynov/slowllama: Finetune llama2-70b and codellama on MacBook Air without quantization

GitHub - okuvshynov/slowllama: Finetune llama2-70b and codellama on MacBook Air without quantization

okuvshynovgithub.com
Thumbnail of GitHub - okuvshynov/slowllama: Finetune llama2-70b and codellama on MacBook Air without quantization

GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks

mit-han-labgithub.com
Thumbnail of GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks
Andrej Karpathyx.com