HuggingFaceH4/zephyr-7b-alpha

HuggingFaceH4/zephyr-7b-alpha · Hugging Face

RelatedHighlights

Model Card for Zephyr 7B β

Zephyr is a series of language models that are trained to act as helpful assistants. Zephyr-7B-β is the second model in the series, and is a fine-tuned version of mistralai/Mistral-7B-v0.1 that was trained on on a mix of publicly available, synthetic datasets using Direct Preference Optimization (DPO). We found that removi... See more

HuggingFaceH4/zephyr-7b-beta · Hugging Face

Nicolay Gerold added

One of the focus areas at Together Research is new architectures for long context, improved training, and inference performance over the Transformer architecture. Spinning out of a research program from our team and academic collaborators, with roots in signal processing-inspired sequence models, we are excited to introduce the StripedHyena models.... See more

Paving the way to efficient architectures: StripedHyena-7B, open source models offering a glimpse into a world beyond Transformers

Nicolay Gerold added

Stable Beluga 2

Use Stable Chat (Research Preview) to test Stability AI's best language models for free

Model Description

Stable Beluga 2 is a Llama2 70B model finetuned on an Orca style Dataset

stabilityai/StableBeluga2 · Hugging Face

Nicolay Gerold added

koboldcpp

🤗 Transformers

Huggingface is an open source platform and community for deep learning models for language, vision, audio and multimodal. They develop and maintain the transformers library, which simplifies the process of downloading and training state of the art deep learning models.

This is the best library if you have a background in m... See more

Moyi • 10 Ways To Run LLMs Locally And Which One Works Best For You

Nicolay Gerold added

Text embeddings are a critical piece of many pipelines, from search, to RAG, to vector databases and more. Most embedding models are BERT/Transformer-based and typically have short context lengths (e.g., 512). That’s only about two pages of text, but documents can be very long – books, legal cases, TV screenplays, code repositories, etc can be tens... See more

Long-Context Retrieval Models with Monarch Mixer

Nicolay Gerold added

Hugging Face – The AI community building the future.

huggingface.co

sari and added

The Nemotron-3 8B family is available in the Azure AI Model Catalog, HuggingFace, and the NVIDIA AI Foundation Model hub on the NVIDIA NGC Catalog. It includes base, chat, and question-and-answer (Q&A) models that are designed to solve a variety of downstream tasks. Table 1 shows the full family of foundation models.

Model

Variant

Key Benefit

Ba... See more

NVIDIA Technical Blog | News and tutorials for developers, data ...

Nicolay Gerold added

The text embedding set trained by Jina AI, Finetuner team.

Intended Usage & Model Info

jina-embeddings-v2-base-en is an English, monolingual embedding model supporting 8192 sequence length.

It is based on a Bert architecture (JinaBert) that supports the symmetric bidirectional variant of ALiBi to allow longer sequence length.

The backbone jina-bert-v2-... See more

jinaai/jina-embeddings-v2-base-en · Hugging Face

Nicolay Gerold added