Autonomous agents

AgentBench: Evaluating LLMs as Agents

Evaluating Large Language Models (LLMs) as agents in interactive environments, highlighting the performance gap between API-based and open-source models, and introducing the AgentBench benchmark.

arxiv.org

DDarren LI

Zach Tratartwitter.com

Autonomous Agents & Agent Simulations

LangChainblog.langchain.dev
Thumbnail of Autonomous Agents & Agent Simulations

Shishir Patil: Teaching AI to Use APIs with Gorilla LLM | Humans of AI Podcast #7

youtube.com

AI and the Automation of Work

Benedict Evansben-evans.com
Thumbnail of AI and the Automation of Work

Autonomous AI agents could change the world, but what do they actually do well?

Sandhya Hegdeunusual.vc
Thumbnail of Autonomous AI agents could change the world, but what do they actually do well?

Pay Your AI a Competitive Salary

mattprd.com
Thumbnail of Pay Your AI a Competitive Salary

LLM Powered Autonomous Agents

Lilian Wenglilianweng.github.io