Sublime
An inspiration engine for ideas
Augen Pro
augen.pro
DeepSeek has just unveiled an OCR monster đ€Ż
DeepSeek-OCR is a 3B-parameter model that redefines document intelligence. It reaches 97% character-level accuracy with 10Ă input compression, preserving every detail.
Most OCR systems require over 6,000 tokens per... See more

i mean seriously. the amount of work previously required to get this data from an image was bonkers.
OCR calls, GPT to try to get the OCR data in to something useable, huge computer vision models to identify the object...all replaced by a single call to an openai endpoint. https://t.co/bOQ8tPBp4y

Did you know that besides @Microsoft's OmniParser, @Apple just released weights for Ferret-UI?
"A new MLLM tailored for enhanced understanding of mobile UI screens, equipped with referring, grounding and reasoning capabilities"
Paper (with models, demo): https://t.co/mtNQtaVR4a... See more
Floorplans are all you need.
Here's a video of our AI that reads floorplans. It reads room labels, finds dimensions, finds dimension lines, and locates doors and windows, all in under 2 minutes.
There are companies with thousands of people who's entire job is to perform that same work.... See more
Barrett Amesx.comIntroducing Gaze-LLE, a new model for gaze target estimation built on top of a frozen visual foundation model!
Gaze-LLE achieves SOTA results on multiple benchmarks while learning minimal parameters, and shows strong generalization
paper: https://t.co/Is2NgrrurO https://t.co/eQS9hRPyuL
Fiona Ryanx.com