GitHub - Unstructured-IO/unstructured: Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

GitHub - Unstructured-IO/unstructured: Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

github.com
Thumbnail of GitHub - Unstructured-IO/unstructured: Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

GitHub - jxnl/instructor-classify

github.com
Thumbnail of GitHub - jxnl/instructor-classify

LLM data - Anna’s Archive

annas-archive.org
Thumbnail of LLM data - Anna’s Archive

Gitingest

gitingest.com
Thumbnail of Gitingest