GitHub - THUDM/AgentTuning: AgentTuning: Enabling Generalized Agent Abilities for LLMs

GitHub - THUDM/AgentTuning: AgentTuning: Enabling Generalized Agent Abilities for LLMs

THUDMgithub.com
Thumbnail of GitHub - THUDM/AgentTuning: AgentTuning: Enabling Generalized Agent Abilities for LLMs

Xiao Liu AgentBench: Evaluating LLMs as Agents

Darren LI added

Xiao Liu AgentBench: Evaluating LLMs as Agents

Darren LI added

r/singularity - Reddit

Nicolay Gerold added

Yi Dong, Zhilin Wang NVIDIA Technical Blog | News and tutorials for developers, data ...

r/MachineLearning - Reddit

Nicolay Gerold added

AgentBench: Evaluating LLMs as Agents

Evaluating Large Language Models (LLMs) as agents in interactive environments, highlighting the performance gap between API-based and open-source models, and introducing the AgentBench benchmark.

arxiv.org

Darren LI added

Fine-Tuning Large Language Models with Sequential Instructions

ar5iv.labs.arxiv.org
Thumbnail of Fine-Tuning Large Language Models with Sequential Instructions

LLM Powered Autonomous Agents

Lilian Wenglilianweng.github.io

Darren LI and added