GitHub - michaelfeil/infinity: Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of sentence-transformer models and frameworks.

GitHub - michaelfeil/infinity: Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of sentence-transformer models and frameworks.

michaelfeilgithub.com
Thumbnail of GitHub - michaelfeil/infinity: Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of sentence-transformer models and frameworks.

Together AI – The AI Acceleration Cloud - Fast Inference, Fine-Tuning & Training

together.ai
Thumbnail of Together AI – The AI Acceleration Cloud - Fast Inference, Fine-Tuning & Training

Phil Wittig Workers AI: serverless GPU-powered inference on Cloudflare’s global network

Introduction

GitHub - mem0ai/mem0: The memory layer for Personalized AI

google GitHub - google/maxtext: A simple, performant and scalable Jax LLM!