Sublime
An inspiration engine for ideas
zack
wenquai.comGitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks
mit-han-labgithub.comJesse Zhou - Business, Technology and Design
jesse-zhou.com
AgentBench: Evaluating LLMs as Agents
Xiao Liu • AgentBench: Evaluating LLMs as Agents
Lena Kul
linkedin.com
In streaming settings, StreamingLLM outperforms the sliding window recomputation baseline by up to 22.2x speedup.
mit-han-lab • GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks
Levi Lian
@levi
Neo Zhang
@neozhang