Release Notes - December 11, 2024

December 11, 2024

The latest Llama 3.3 70B model from Meta and the new leading open-source reasoning model QwQ from Alibaba’s Qwen team are now available on the SambaNova Cloud.

  • Llama 3.3 70B
    The latest Llama 3.3 70B model release from Meta showcases impressive capabilities across multiple domains, including reasoning, mathematical problem-solving, and general knowledge assessment. It delivers comparable performance to the Llama 3.1 405B. Benchmark comparisons suggest that it competes closely with leading proprietary models like OpenAI’s GPT-4o and Google’s Gemini Pro 1.5. This make it yet another leading example of how open-source models are rapidly catching up to, and even surpassing—proprietary models.

  • QwQ 32B Preview
    The QwQ-32B-Preview model is an experimental AI model designed to enhance reasoning capabilities developed by Alibaba’s Qwen team. With 32.5 billion parameters, it excels in complex tasks such as mathematics and programming. Notably, it achieves scores of 65.2% on the Graduate-Level Google-Proof Q&A (GPQA), 50.0% on the American Invitational Mathematics Examination (AIME), 90.6% on MATH-500, and 50.0% on LiveCodeBench, indicating strong analytical proficiency. Given this is a preview release, it has limitations, including potential language mixing, recursive reasoning loops, and areas needing improvement like common sense reasoning and nuanced language understanding.

Please refer to the Supported Models page for more details of the supported configurations and their model cards.

2 Likes