jinaai/jina-embeddings-v2-base-en · Hugging Face

Long-Context Retrieval Models with Monarch Mixer

FOD#27: "Now And Then"

michaelfeil GitHub - michaelfeil/infinity: Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of sentence-transformer models and frameworks.

DeepSeek Coder

young-geng GitHub - young-geng/EasyLM: Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

NVIDIA Technical Blog | News and tutorials for developers, data ...

google GitHub - google/maxtext: A simple, performant and scalable Jax LLM!

Rostlab/prot_bert · Hugging Face