jinaai/jina-embeddings-v2-base-en

jinaai/jina-embeddings-v2-base-en · Hugging Face

RelatedHighlights

Text embeddings are a critical piece of many pipelines, from search, to RAG, to vector databases and more. Most embedding models are BERT/Transformer-based and typically have short context lengths (e.g., 512). That’s only about two pages of text, but documents can be very long – books, legal cases, TV screenplays, code repositories, etc can be tens... See more

Long-Context Retrieval Models with Monarch Mixer

Nicolay Gerold added

introduced

FOD#27: "Now And Then"

Nicolay Gerold added

Why Infinity:

Infinity provides the following features:

Deploy virtually any SentenceTransformer - deploy the model you know from SentenceTransformers

Fast inference backends : The inference server is built on top of torch, fastembed(onnx-cpu) and CTranslate2, getting most out of your CUDA or CPU hardware.

Dynamic batching : New embedding requests

michaelfeil • GitHub - michaelfeil/infinity: Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting a wide range of sentence-transformer models and frameworks.

Nicolay Gerold added

DeepSeek Coder comprises a series of code language models trained from scratch on both 87% code and 13% natural language in English and Chinese, with each model pre-trained on 2T tokens. We provide various sizes of the code model, ranging from 1B to 33B versions. Each model is pre-trained on repo-level code corpus by employing a window size of 16K ... See more

DeepSeek Coder

Nicolay Gerold added

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax. EasyLM can scale up LLM training to hundreds of TPU/GPU accelerators by leveraging JAX's pjit functionality.

Building on top of Hugginface's transformers and datasets, this repo provides an easy to use and easy... See more

young-geng • GitHub - young-geng/EasyLM: Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Nicolay Gerold added

The Nemotron-3 8B family is available in the Azure AI Model Catalog, HuggingFace, and the NVIDIA AI Foundation Model hub on the NVIDIA NGC Catalog. It includes base, chat, and question-and-answer (Q&A) models that are designed to solve a variety of downstream tasks. Table 1 shows the full family of foundation models.

Model

Variant

Key Benefit

Ba... See more

NVIDIA Technical Blog | News and tutorials for developers, data ...

Nicolay Gerold added

Overview

MaxText is a high performance , highly scalable , open-source LLM written in pure Python/Jax and targeting Google Cloud TPUs and GPUs for training and inference . MaxText achieves high MFUs and scales from single host to very large clusters while staying simple and "optimization-free" thanks to the power of Jax and the XLA compiler.

MaxText... See more

google • GitHub - google/maxtext: A simple, performant and scalable Jax LLM!

Nicolay Gerold added

Pretrained model on protein sequences using a masked language modeling (MLM) objective. It was introduced in

this paper and first released in

this repository. This model is trained on uppercase amino acids: it only works with capital letter amino acids.

Model description

ProtBert is based on Bert model which pretrained on a large corpus of protein sequ... See more

Rostlab/prot_bert · Hugging Face

Nicolay Gerold added