Are Llama Vision Models 'Blind,' or Do I Just Not Know How to Use Them?

Hi @malawad , Can you please re-confirm whether you are using this request format for vision models.
[{ "type": "text", "text": "What's in this image?" }, { "type": "image_url", "image_url": { "url": "base64 encoded string of image" } }]

Sample request body is mentioned here Sambanova_cloud_api_reference

Thanks & Regards

4 Likes