Autonomous agents

Pay Your AI a Competitive Salary

mattprd.com
Thumbnail of Pay Your AI a Competitive Salary

AgentBench: Evaluating LLMs as Agents

Evaluating Large Language Models (LLMs) as agents in interactive environments, highlighting the performance gap between API-based and open-source models, and introducing the AgentBench benchmark.

arxiv.org

DDarren LI

Shishir Patil: Teaching AI to Use APIs with Gorilla LLM | Humans of AI Podcast #7

youtube.com

A Survey on Large Language Model based Autonomous Agents

The paper surveys large language model-based autonomous agents, discussing their construction, applications across various domains, and evaluation strategies, while proposing a unified framework and identifying future research directions.

arxiv.org

The Complete Beginners Guide To Autonomous Agents

mattprd.com
Thumbnail of The Complete Beginners Guide To Autonomous Agents
Thumbnail of twitter-com-karpathy-status-1707437820045062561

AI and the Automation of Work

Benedict Evansben-evans.com
Thumbnail of AI and the Automation of Work

Autonomous Agents & Agent Simulations

LangChainblog.langchain.dev
Thumbnail of Autonomous Agents & Agent Simulations