
Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language
Finds that the LLMs with LENS perform highly competitively with much bigger and much more sophisticated systems, without any multimodal training whatsoever.
https://t.co/GxxW9iOW8z... See more
