This document outlines standardized calling templates for all model offerings in SambaCloud, categorized by modality and capability. It includes key differences between each API usage pattern to ensure clarity and consistency for developers and users.
Authentication
All requests must include an API key:
-H "Authorization: Bearer <your-api-key>"
Model Categories & Usage Templates
Reasoning Models
These models are designed for complex reasoning, instruction following, and task planning.
| Model Name | Size |
|---|---|
| DeepSeek-R1-0528 | 671B |
| DeepSeek-R1-Distill-Llama-70B | 70B |
| Qwen3-32B | 32B |
Template
curl -H "Authorization: Bearer <your-api-key>" \
-H "Content-Type: application/json" \
-d '{
"stream": true,
"model": "<model-name>",
"messages": [
{
"role": "system",
"content": "You are a helpful assistant"
},
{
"role": "user",
"content": "Hello"
}
]
}' \
-X POST https://api.sambanova.ai/v1/chat/completions
Same template used for all Reasoning and Text models
Text Generation Models
Optimized for fast, fluent natural language generation and summarization.
| Model Name | Size |
|---|---|
| DeepSeek-V3-0324 | 671B |
| Llama-3.3-Swallow-70B-Instruct-v0.4 | 70B |
| Meta-Llama-3.1-8B-Instruct | 8B |
| Meta-Llama-3.3-70B-Instruct | 70B |
Template
Identical to Reasoning models:
curl -H "Authorization: Bearer <your-api-key>" \
-H "Content-Type: application/json" \
-d '{
"stream": true,
"model": "<model-name>",
"messages": [
{
"role": "system",
"content": "You are a helpful assistant"
},
{
"role": "user",
"content": "Hello"
}
]
}' \
-X POST https://api.sambanova.ai/v1/chat/completions
Image + Text (Multimodal)
Capable of interpreting visual input and combining it with language understanding.
Model Name
Llama-4-Maverick-17B-128E-Instruct
๐ Difference in Template
Instead of plain string content, the โmessagesโ block uses structured content with type: โtextโ and type: โimage_urlโ entries.
Template
curl -H "Authorization: Bearer <your-api-key>" \
-H "Content-Type: application/json" \
-d '{
"stream": true,
"model": "Llama-4-Maverick-17B-128E-Instruct",
"messages": [
{
"role": "user",
"content": [
{
"type": "text",
"text": "What do you see in this image?"
},
{
"type": "image_url",
"image_url": {
"url": "<image_in_base_64>"
}
}
]
}
]
}' \
-X POST https://api.sambanova.ai/v1/chat/completions
Audio / Text (Transcription)
Used to transcribe audio files using Whisper-like models.
| Model Name | Note |
|---|---|
| Whisper-Large-v3 | Beta |
๐ Difference in Template
This is a multipart/form-data POST request to a different endpoint, not JSON.
Template
curl --location 'https://api.sambanova.ai/v1/audio/transcriptions' \
--header 'Authorization: Bearer <your-api-key>' \
--form 'model="Whisper-Large-v3"' \
--form 'language="spanish"' \
--form 'response_format="json"' \
--form 'file=@"/path/to/audio/file.mp3"' \
--form 'stream="true"'
Summary of Differences
| Model Type | Endpoint | Body Type | Notes |
|---|---|---|---|
| Text / Reasoning | /v1/chat/completions | JSON | Standard messages array |
| Image + Text | /v1/chat/completions | JSON | Requires structured image content format |
| Audio / Text | /v1/audio/transcriptions | multipart/form-data | Separate endpoint, file upload required |
Best Practices
Always validate the model name from the official model list
Set โstreamโ: true for real-time output
Include a system role for consistent personality/context
Use proper formatting for multimodal models
Do not use chat JSON format for audio transcription