The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)
An analysis of GPT-4V, a large multimodal model with visual understanding, discussing its capabilities, input modes, working modes, prompting techniques, and potential applications in various domains.
the challenge. What is good? What is interesting? That part of the work is taste. “Taste is what enables designers to navigate the vast sea of possibilities that technology and global connectivity afford, and to then select and combine these elements in ways that, ideally, result in interesting, unique work
Because we are each an individual, infinitely complex being, our different physiological, environmental, and cultural variations bring us to infinite different endpoints. Like it or not, we all see the world slightly differently and our creative expressions reflect this.
I can see a time once the hype has settled down, and we all see whether AI is truly capable at a practical, day-to-day level, where we realize that the control still remains with the humans. Just because everyone has a camera on their phone, doesn’t mean everyone takes great photos. That still requires skill, experience and above all, taste.