Sublime
An inspiration engine for ideas
PDFs are satanโs file format.
Almost everyone that builds RAG needs to deal with them - and it sucks.
Solutions on the market are either too slow, too expensive or not OSS.
It should be easier. Which is why weโre open sourcing https://t.co/0gCZxzbkWu
Ishaan Kapoorx.comGhanshyam call
Basic Editor
Logline class dhruv
Lights
Dhruv
Stree
Idm
Fsm
DataTrove
DataTrove is a library to process, filter and deduplicate text data at a very large scale. It provides a set of prebuilt commonly used processing blocks with a framework to easily add custom functionality.
DataTrove processing pipelines are platform-agnostic, running out of the box locally or on a slurm cluster. Its (relatively) low memory... See more
DataTrove is a library to process, filter and deduplicate text data at a very large scale. It provides a set of prebuilt commonly used processing blocks with a framework to easily add custom functionality.
DataTrove processing pipelines are platform-agnostic, running out of the box locally or on a slurm cluster. Its (relatively) low memory... See more
huggingface โข GitHub - huggingface/datatrove: Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Writing my note
Fbjjbbmn
Vhjhhbbjjh
Fiberesima
some name
# some
#color
greate
โ๐ฑ๐ฌ ๐๐ฒ๐ป๐โ ๐ถ๐ ๐ป๐ผ๐ ๐๐ผ๐ฟ๐๐ต ๐ญ๐ญ๐ฌ ๐ฐ๐ฒ๐ป๐๐. A genius little website by Brian Moore adjusts the rapperโs name for inflation. Simple. Clever. Perfect internet. ๐ง๐ต๐ฎ๐โ๐ ๐๐ต๐ ๐โฆ | Javier Bidezabal
linkedin.comRAG Time
m.youtube.comtask-chunking
Kimsia โข 2 cards