Sublime
An inspiration engine for ideas
PDFs are satan’s file format.
Almost everyone that builds RAG needs to deal with them - and it sucks.
Solutions on the market are either too slow, too expensive or not OSS.
It should be easier. Which is why we’re open sourcing https://t.co/0gCZxzbkWu
Ishaan Kapoorx.com
LLMs have made exciting progress on hard tasks! But they still struggle to analyze complex, unstructured documents (including today's Gemini 1.5 Pro 002).
We (UC Berkeley) built 📜DocETL, an open-source, low-code system for LLM-powered data processing: https://t.co/VmJ1zyre6m

Document splitting is common for vector storage / retrieval, but useful context can be lost. @LangChainAI has 3 new "context-aware" text splitters that keep metadata about where each split came from. Works for code (py, js) c/o @cristobal_dev, PDFs c/o @CorranMac, and Markdown .. https://t.co/PaWzq33IoM
Trends – Artificial Intelligence (AI) – May 2025 – BOND
The document analyzes rapid growth and transformative trends in artificial intelligence, highlighting unprecedented user adoption, technological advances, global competition, enterprise AI integration, and associated benefits and risks shaping the future.
bondcap.com


