GitHub - michaelfeil/infinity: Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of sentence-transformer models and frameworks.

GitHub - michaelfeil/infinity: Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of sentence-transformer models and frameworks.

michaelfeilgithub.com
Thumbnail of GitHub - michaelfeil/infinity: Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of sentence-transformer models and frameworks.

tensorlakeai GitHub - tensorlakeai/indexify: A scalable realtime and continuous indexing engine for Unstructured Data to build Generative AI Applications

fal.ai

fal.ai
Thumbnail of fal.ai

dgarnitz GitHub - dgarnitz/vectorflow: VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.

Self-Host DeepSeek with Ollama and Open WebUI

Jeremynoted.lol
Thumbnail of Self-Host DeepSeek with Ollama and Open WebUI

GitHub - huggingface/text-embeddings-inference: A blazing fast inference solution for text embeddings models

huggingfacegithub.com
Thumbnail of GitHub - huggingface/text-embeddings-inference: A blazing fast inference solution for text embeddings models