Sublime
An inspiration engine for ideas
linus.systems
linus.systems
LoRAX (LoRA eXchange) is a framework that allows users to serve thousands of fine-tuned models on a single GPU, dramatically reducing the cost of serving without compromising on throughput or latency.
π Table of contents
π Table of contents
- π Table of contents
- π³ Features
- π Models
- πββοΈ Getting started with Docker
- Launch LoRAX Server
- Prompt via REST API
- Prompt via Python
predibase β’ GitHub - predibase/lorax: Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
the linen service @linenservice
instagram.com
Lina
@lolaluz
Lina Yang
@linahhhh
Lina Lang
@littlegoose
lina
@firesignemoji
Lina Guerrero
@linaguerrero