Sublime
An inspiration engine for ideas
Writing
Jesse Burkunk • 4 cards
Writing
Michele Gregoire Gill • 1 card
Writing
Jessica • 8 cards

2-5x faster 50% less memory local LLM finetuning
- Manual autograd engine - hand derived backprop steps.
- 2x to 5x faster than QLoRA. 50% less memory usage.
- All kernels written in OpenAI's Triton language.
- 0% loss in accuracy - no approximation methods - all exact.
- No change of hardware necessary. Supports NVIDIA GPUs since 2018+. Minimum CUDA Compute Cap
unslothai • GitHub - unslothai/unsloth: 5X faster 50% less memory LLM finetuning
fun
Yelyzaveta Hordii • 1 card
Writing
Cyrus Chen • 15 cards
blogging
Estebantxo • 1 card
deprecating GPT‐4.5 Preview in the API, as GPT‐4.1 offers improved or similar performance on many key capabilities at much lower cost and latency