Sublime
An inspiration engine for ideas


OCR-2.0 is coming, and Generative AI and multimodal LLMs will power it! 🔍 GOT (General OCR Theory) is a 580M end-to-end OCR-2.0 model that outperforms all existing methods.
GOT consists of a Vision-Encoder to convert images into transformers images into tokens and a decoder for generating OCR outputs in various formats... See more
inventions and progress; it'll be awfully primitive."
Charlotte Gilman • Herland
Goodreads
goodreads.com
Paul Graham
paulgraham.comadvanced image models