GitHub - NVIDIA/NeMo-Curator: Scalable toolkit for data curation

GitHub - NVIDIA/NeMo-Curator: Scalable toolkit for data curation

github.com
Thumbnail of GitHub - NVIDIA/NeMo-Curator: Scalable toolkit for data curation

Tiago Forte The Heart Is the Bottleneck

google GitHub - google/magika: Detect file content types with deep learning

Models All The Way Down

knowingmachines.org
Thumbnail of Models All The Way Down