Release Notes - December 6, 2024

jimmy.lin · December 6, 2024, 5:00pm

December 5, 2024

We’re thrilled to introduce some of the most exciting Qwen models, along with the leading content moderation moderation model, llama Guard 3, now available on the SambaNova Cloud.

Qwen2.5 72B
The Qwen2.5-72B model is a 72B-parameter language model that excels in coding, mathematics, and multilingual understanding. Trained on an extensive dataset of 18 trillion tokens, it supports context lengths up to 128,000 tokens and can generate outputs exceeding 8,000 tokens. The model offers robust instruction-following capabilities and supports over 29 languages, including English, Chinese, French, Spanish, and more.
Qwen2.5 Coder 32B
The Qwen2.5-Coder-32B model is a 32B-parameter language model tailored for code-related tasks. It was trained on 5.5 trillion tokens, including source code and synthetic data. The model excels in code generation, reasoning, and debugging across 92 programming languages. Notably, it achieves a HumanEval score of 92.7%, matching coding capability of GPT-4o, making it one of the best open-source coding models for coding assistant like applications.
Llama Guard 3 8B
Llama Guard 3-8B is a fine-tuned version of Meta’s Llama 3.1 model, specifically designed for content safety classification. It can be used to evaluate both inputs (prompts) and outputs (responses) of LLMs for content safety moderation. It functions as a LLM that generates output indicating whether a given prompt or response is deemed safe or unsafe. If deemed unsafe, it also identifies the specific content categories that are violated, aligning with the 14 MLCommons standardized hazards taxonomy.

And there’s more! This release also includes upgrades to the max context length for the following models.

Llama 3.2 1B model: max sequence length increased from 4k to 16k
Llama 3.1 70B model: max sequence length increased from 64k to 128k
Llama 3.1 405B model: max sequence length increased from 8k to 16k

Please refer to the Supported Models page for more details of the supported configurations and their model cards.