glaiveai/glaive-coder-7b · Hugging Face

>>> Qwen3-Coder is here! ✅
We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves top-tier performance across multiple agentic... See more

Early this year, we trained a 70B model optimized for reasoning and coding. This model roughly matches LLAMA 3 70B despite being trained on 7x less data.
Today, we’re releasing a toolkit to help others do the same, including:
• 11 sanitized and extended NLP reasoning benchmarks including... See more
