GitHub - arthur-ai/bench: A tool for evaluating LLMs

GitHub - arthur-ai/bench: A tool for evaluating LLMs

github.com
Thumbnail of GitHub - arthur-ai/bench: A tool for evaluating LLMs

GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks

mit-han-labgithub.com
Thumbnail of GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks

Darren LI and added

GitHub - MadcowD/ell: A language model programming library.

github.com
Thumbnail of GitHub - MadcowD/ell: A language model programming library.

added

DeepBench Knowledge as an addictive drug—an investor’s guide to the expert network industry

sari added

GitHub - FlowiseAI/Flowise: Drag & drop UI to build your customized LLM flow

github.com
Thumbnail of GitHub - FlowiseAI/Flowise: Drag & drop UI to build your customized LLM flow

Andrés added

Xiao Liu AgentBench: Evaluating LLMs as Agents

Darren LI added

and added

GitHub - romkatv/zsh-bench: Benchmark for interactive Zsh

romkatvgithub.com
Thumbnail of GitHub - romkatv/zsh-bench: Benchmark for interactive Zsh

Testing framework for LLM Part