
GitHub - marsupialtail/rottnest: Data lake indices

Indexify - Extraction and Retrieval from Videos, PDF and Audio for Interactive AI Applications
Indexify is an open-source engine for buidling fast data pipelines for unstructured data(video, audio, images and documents) using re-usable extractors for embedding, transformatio... See more
LLM applications backed by Indexify will never answer outdated information.
Indexify is an open-source engine for buidling fast data pipelines for unstructured data(video, audio, images and documents) using re-usable extractors for embedding, transformatio... See more
tensorlakeai • GitHub - tensorlakeai/indexify: A scalable realtime and continuous indexing engine for Unstructured Data to build Generative AI Applications

For a collection of advanced Retrieval-Augmented Generation (RAG) techniques this is a very resourceful repo.
Many topics are covered like
- Metadata Filtering: Apply filters based on attributes like date, source, author, or document type.
- Similarity Threshold... See more
Building always-on, business-critical AI applications or agents on a constantly updating and growing volume of unstructured data requires resilient and fast data infrastructure.
I am super excited to finally announce @tensorlake's open-source, real-time data framework, Indexify.
Real-time processing: Opt... See more
Diptanu Choudhuryx.com