GitHub - predibase/lorax: Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

GitHub - predibase/lorax: Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

predibasegithub.com
Thumbnail of GitHub - predibase/lorax: Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

GitHub - ai-hero/llm-research-fine-tuning

GitHub - circlemind-ai/fast-graphrag: RAG that intelligently adapts to your use case, data, and queries

Charles Dickensgithub.com
Thumbnail of GitHub - circlemind-ai/fast-graphrag: RAG that intelligently adapts to your use case, data, and queries

ghimiresunil GitHub - ghimiresunil/LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing: LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.

Jason Risch Self-Serve Apps for ML Teams | Greylock

GitHub - SeldonIO/MLServer: An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more