Sublime
An inspiration engine for ideas
Vision RAG with vector database is all you need.
It uses vision language model to embed pages of PDF as directly vectors, without the tedious chunking process.
100% Opensource code. https://t.co/Td7Jjm2GQ6
Shubham Saboox.com
We have a small gift for the open-source community: RolmOCR, a new OCR model for complex document processing!
We at @reductoai trained a Qwen2.5-VL-7B model (by @Alibaba_Qwen ) using the amazing olmOCR dataset by @allen_ai earlier this year. As a result, we have RolmOCR - a slightly faster, maybe slightly better model under Apache 2.0... See more
Our computer vision textbook is now available for free online here:
https://t.co/ERy2Spc7c2
We are working on adding some interactive components like search and (beta) integration with LLMs.
Hope this is useful and feel free to submit Github issues to help us improve the text!
Phillip Isolax.com

