jina-ai/reader: Convert any URL to an LLM-friendly input ... - GitHub
Introducing llms.txt Generator ✨
You can now concatenate any website into a single text file that can be fed into any LLM.
We crawl the whole website with @firecrawl_dev and extract data with gpt-4o-mini.
Create your own llms.txt at https://t.co/wNrQE0DMJu!
Eric Ciarla (hiring)x.com
Built a little tool for turning a web page into Markdown and easily copying it to my clipboard - it's a very thin wrapper around the Jina Reader API https://t.co/kqhun3VUi9
PDF parsing is still painful because LLMs reorder text in complex layouts, break tables across pages, and fail on graphs or images.
💡Testing the new open-source OCRFlux model, and here the results are really good for a change.
So OCRFlux is a multimodal, LLM based toolkit for converting PDFs... See more
Rohan Paulx.com
