GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks

by mit-han-lab

Thumbnail of GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks

added by Darren LI and · updated 1y ago

  • Community Paper Reading

    6 cards · by Darren LI

    Darren LI added 1y ago

  • Generative AI

    125 cards · by sari and

    Darren LI added 1y ago