I spent the weekend playing with ChatGPT, MidJourney, and other AI tools… and by combining all of them, published a children’s book co-written and illustrated by AI!
Here’s how! 🧵 https://t.co/0UjG2dxH7Q
Evaluating Large Language Models (LLMs) as agents in interactive environments, highlighting the performance gap between API-based and open-source models, and introducing the AgentBench benchmark.