LLM Training

GitHub - NVIDIA/NeMo-Curator: Scalable toolkit for data curation

GitHub - mistralai/mistral-finetune

ghimiresunil GitHub - ghimiresunil/LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing: LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.

IBM GitHub - IBM/unitxt: 🦄 Unitxt: a python library for getting data fired up and set for training and evaluation

unslothai GitHub - unslothai/unsloth: 5X faster 50% less memory LLM finetuning

Ideas related to this collection