Access DeepSeek R1, Llama 3.3, 3.2, and 3.1 family of models, along with the Qwen2.5 family of models at full precision via the SambaNova Cloud API!
All models are available to all tiers.
Model details:
DeepSeek-R1 family
- DeepSeek Distill R1 Llama 70B
- Model ID:
DeepSeek-R1-Distill-Llama-70B
- Developer: DeepSeek AI
- Context length: 16k tokens
- Model card: View on Hugging Face
- Model ID:
Tulu 3 family
- Llama 3.1 Tulu 3 405B
- Model ID:
Llama-3.1-Tulu-3-405B
- Developer: Allen Institute for AI
- Context length: 16k tokens
- Model card: View on Hugging Face
- Model ID:
Qwen 2.5 family
- Qwen2.5 Coder 32B
- Model ID:
Qwen2.5-Coder-32B-Instruct
- Developer: Qwen
- Context length: 16k tokens
- Model card: View on Hugging Face
- Model ID:
- QwQ 32B Preview
- Model ID:
QwQ-32B-Preview
- Developer: Qwen
- Context length: 16k tokens
- Model card: View on Hugging Face
- Model ID:
- Qwen2.5 72B
- Model ID:
Qwen2.5-72B-Instruct
- Developer: Qwen
- Context length: 16k tokens
- Model card: View on Hugging Face
- Model ID:
- Qwen2 Audio [1]
- Model ID:
Qwen2-Audio-7B-Instruct
- Developer: Qwen
- Context length: 16k tokens
- Model card: View on Hugging Face
- Model ID:
Llama 3.3 family
- Llama 3.3 70B
- Model ID:
Meta-Llama-3.3-70B-Instruct
- Developer: Meta
- Context length: 16k tokens (Support up to 128k in Beta. Please contract our support to get early access)
- Model card: View on Hugging Face
- Model ID:
Llama 3.2 family
- Llama 3.2 1B
- Model ID:
Meta-Llama-3.2-1B-Instruct
- Developer: Meta
- Context length: 16k tokens
- Model card: View on Hugging Face
- Model ID:
- Llama 3.2 3B
- Model ID:
Meta-Llama-3.2-3B-Instruct
- Developer: Meta
- Context length: 8k tokens
- Model card: View on Hugging Face
- Model ID:
- Llama 3.2 11B Vision
- Model ID:
Llama-3.2-11B-Vision-Instruct
- Developer: Meta
- Context length: 4k tokens
- Model card: View on Hugging Face
- Model ID:
- Llama 3.2 90B Vision
- Model ID:
Llama-3.2-90B-Vision-Instruct
- Developer: Meta
- Context length: 4k tokens
- Model card: View on Hugging Face
- Model ID:
Llama 3.1 family
-
Llama 3.1 8B
- Model ID:
Meta-Llama-3.1-8B-Instruct
- Developer: Meta
- Context length: 16k tokens
- Model card: View on GitHub
- Model ID:
-
Llama 3.1 70B
- Model ID:
Meta-Llama-3.1-70B-Instruct
- Developer: Meta
- Context length: 128k tokens
- Model card: View on GitHub
- Model ID:
-
Llama 3.1 405B
- Model ID:
Meta-Llama-3.1-405B-Instruct
- Developer: Meta
- Context length: 16k tokens
- Model card: View on GitHub
- Model ID:
-
Llama Guard 3 8B
- Model ID:
Meta-Llama-Guard-3-8B
- Developer: Meta
- Context length: 8k tokens
- Model card: View on Meta’s model card
- Model ID:
Explanation of Qwen2 Audio The Qwen2 Audio refers to a specific artificial intelligence model developed by Qwen, designed to process and generate audio-related content. With a model ID of
Qwen2-Audio-7B-Instruct
, it is part of the Qwen 2.5 family of models and has a context length of 16k tokens. This model is available via the SambaNova Cloud API, suggesting its application in tasks that require audio understanding or generation, such as voice assistants, audio transcription, or music synthesis. The inclusion of “Audio” in its name distinguishes it from other models in the Qwen2.5 family that might be more focused on text-based tasks. (Explanation by AI) ↩︎