Darren LI
@darrenli
Creator Economy, AI & retail-tech investor | J.D. & J.M. | PNG collector | Musical lover | Happy to chat, using Cal.com link below to book calls (https://cal.com/darrenli)
twitter.comDarren LI
@darrenli
Evaluating Large Language Models (LLMs) as agents in interactive environments, highlighting the performance gap between API-based and open-source models, and introducing the AgentBench benchmark.
arxiv.orgGenerative AI and