GitHub - arthur-ai/bench: A tool for evaluating LLMs

GitHub - arthur-ai/bench: A tool for evaluating LLMs

github.com
Thumbnail of GitHub - arthur-ai/bench: A tool for evaluating LLMs

GitHub - charlax/professional-programming: A collection of learning resources for curious software engineers

github.com
Thumbnail of GitHub - charlax/professional-programming: A collection of learning resources for curious software engineers

GitHub - MadcowD/ell: A language model programming library.

github.com
Thumbnail of GitHub - MadcowD/ell: A language model programming library.

Testing framework for LLM Part

GitHub - katanaml/sparrow: Data processing with ML, LLM and Vision LLM

Andrej Baranovskijgithub.com
Thumbnail of GitHub - katanaml/sparrow: Data processing with ML, LLM and Vision LLM

Xiao Liu AgentBench: Evaluating LLMs as Agents

Zed AI - Code together with LLMs

zed.dev
Thumbnail of Zed AI - Code together with LLMs