GitHub - marsupialtail/rottnest: Data lake indices

RelatedHighlights

Welcome to RAGatouille

Easily use and train state of the art retrieval methods in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

The main motivation of RAGatouille is simple: bridging the gap between state-of-the-art research and alchemical RAG pipeline practices. RAG is complex, and there are many moving parts. To g... See more

GitHub - bclavie/RAGatouille: Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

一个可视化微软GraphRAG数据的Web工具：graphrag-visualizer，可通过上传GraphRAG索引流程生成的parquet文件，直接查看/分析数据支持2D和3D图形展示，parquet文件中的数据显示支持节点和关系的搜索，所有数据本地处理 github在下条 #graphrag数据可视化 #数据可视化

AIGCLINK x.com

GitHub - tidewave-ai/tidewave_phoenix: Tidewave for Phoenix

Dashbit github.com

The language is always only as good as its community. Let’s look at some of the existing open-source tools and frameworks built in and around Rust:

DataFusion based on Apache Arrow: Apache Arrow DataFusion SQL Query Engine similar to Spark

Polars: It’s a faster Pandas. Probably going to compete with DuckDB (?)

Delta Lake Rust: A native Rust library fo

Data Engineering • Rust for Data Engineering

GitHub - joschan21/resumable-llm-streams

joschan21 github.com