GitHub - predibase/lorax: Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

GitHub - predibase/lorax: Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

predibasegithub.com
Thumbnail of GitHub - predibase/lorax: Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

traceloop GitHub - traceloop/openllmetry: Open-source observability for your LLM application, based on OpenTelemetry

Mozilla-Ocho GitHub - Mozilla-Ocho/llamafile: Distribute and run LLMs with a single file.

jafioti GitHub - jafioti/luminal: Deep learning at the speed of light.