A Survey on Large Language Model based Autonomous Agents
The paper surveys large language model-based autonomous agents, discussing their construction, applications across various domains, and evaluation strategies, while proposing a unified framework and identifying future research directions.
The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)
An analysis of GPT-4V, a large multimodal model with visual understanding, discussing its capabilities, input modes, working modes, prompting techniques, and potential applications in various domains.
The central lie behind these programs is that they are meant for artists. They’re not. We don’t need them and using them only hurts us. What our clients really need from us is what the A.I. button cannot and never will be able to give: a human expression in all its flawed, beautiful glory.
I can see a time once the hype has settled down, and we all see whether AI is truly capable at a practical, day-to-day level, where we realize that the control still remains with the humans. Just because everyone has a camera on their phone, doesn’t mean everyone takes great photos. That still requires skill, experience and above all, taste.
I spent the weekend playing with ChatGPT, MidJourney, and other AI tools… and by combining all of them, published a children’s book co-written and illustrated by AI!
Here’s how! 🧵 https://t.co/0UjG2dxH7Q