Supported Models

karan.srivastava · December 14, 2024, 8:00am

Access DeepSeek R1, Llama 3.3, 3.2, and 3.1 family of models, along with the Qwen2.5 family of models at full precision via the SambaNova Cloud API!

All models are available to all tiers.

Model details:

DeepSeek-R1 family

DeepSeek Distill R1 Llama 70B
- Model ID: DeepSeek-R1-Distill-Llama-70B
- Developer: DeepSeek AI
- Context length: 16k tokens
- Model card: View on Hugging Face

Tulu 3 family

Llama 3.1 Tulu 3 405B
- Model ID: Llama-3.1-Tulu-3-405B
- Developer: Allen Institute for AI
- Context length: 16k tokens
- Model card: View on Hugging Face

Qwen 2.5 family

Qwen2.5 Coder 32B
- Model ID: Qwen2.5-Coder-32B-Instruct
- Developer: Qwen
- Context length: 16k tokens
- Model card: View on Hugging Face
QwQ 32B Preview
- Model ID: QwQ-32B-Preview
- Developer: Qwen
- Context length: 16k tokens
- Model card: View on Hugging Face
Qwen2.5 72B
- Model ID: Qwen2.5-72B-Instruct
- Developer: Qwen
- Context length: 16k tokens
- Model card: View on Hugging Face
Qwen2 Audio ^[1]
- Model ID: Qwen2-Audio-7B-Instruct
- Developer: Qwen
- Context length: 16k tokens
- Model card: View on Hugging Face

Llama 3.3 family

Llama 3.3 70B
- Model ID: Meta-Llama-3.3-70B-Instruct
- Developer: Meta
- Context length: 16k tokens (Support up to 128k in Beta. Please contract our support to get early access)
- Model card: View on Hugging Face

Llama 3.2 family

Llama 3.2 1B
- Model ID: Meta-Llama-3.2-1B-Instruct
- Developer: Meta
- Context length: 16k tokens
- Model card: View on Hugging Face
Llama 3.2 3B
- Model ID: Meta-Llama-3.2-3B-Instruct
- Developer: Meta
- Context length: 8k tokens
- Model card: View on Hugging Face
Llama 3.2 11B Vision
- Model ID: Llama-3.2-11B-Vision-Instruct
- Developer: Meta
- Context length: 4k tokens
- Model card: View on Hugging Face
Llama 3.2 90B Vision
- Model ID: Llama-3.2-90B-Vision-Instruct
- Developer: Meta
- Context length: 4k tokens
- Model card: View on Hugging Face

Llama 3.1 family

Llama 3.1 8B
- Model ID: Meta-Llama-3.1-8B-Instruct
- Developer: Meta
- Context length: 16k tokens
- Model card: View on GitHub
Llama 3.1 70B
- Model ID: Meta-Llama-3.1-70B-Instruct
- Developer: Meta
- Context length: 128k tokens
- Model card: View on GitHub
Llama 3.1 405B
- Model ID: Meta-Llama-3.1-405B-Instruct
- Developer: Meta
- Context length: 16k tokens
- Model card: View on GitHub
Llama Guard 3 8B
- Model ID: Meta-Llama-Guard-3-8B
- Developer: Meta
- Context length: 8k tokens
- Model card: View on Meta’s model card

Explanation of Qwen2 Audio The Qwen2 Audio refers to a specific artificial intelligence model developed by Qwen, designed to process and generate audio-related content. With a model ID of Qwen2-Audio-7B-Instruct, it is part of the Qwen 2.5 family of models and has a context length of 16k tokens. This model is available via the SambaNova Cloud API, suggesting its application in tasks that require audio understanding or generation, such as voice assistants, audio transcription, or music synthesis. The inclusion of “Audio” in its name distinguishes it from other models in the Qwen2.5 family that might be more focused on text-based tasks. (Explanation by AI) ↩︎

Topic		Replies	Views
Samba-1 model on playground SambaNova Devs dev	1	72	November 3, 2024
Release Notes - October 29, 2024 Release Notes doc	0	658	October 1, 2024
Context Length for the Meta-Llama-3.1-405B-Instruct is too small Discussion	17	797	November 18, 2024
Llama3.2 on sambanova SambaNova Devs dev	4	224	September 26, 2024
About the SambaNova Documentation category SambaNova Documentation	0	247	August 6, 2024

Supported Models

Model details:

DeepSeek-R1 family

Tulu 3 family

Qwen 2.5 family

Llama 3.3 family

Llama 3.2 family

Llama 3.1 family

Related topics