Issues running a model

Andrea_Mikula · February 18, 2025, 1:40pm

Hello there,

I am trying to deploy the Qwen-2-72B-4bit model. The main issue I’m encountering is that after successfully deploying with Docker, my requests are failing with error code 524.

Additionally, since this is a vision model, I need guidance on how to properly include images in the requests. I want to ensure the image handling isn’t causing the error code.

Thank you in advance.

David · February 18, 2025, 1:59pm

Hi @Andrea_Mikula,

We have example apps for VL models deployed with VLLM such as Deploy Qwen 2.5 VL 72B Instruct One-Click App - Koyeb.

In it, we suggest way to generate completions with the model as follow:

import os

from openai import OpenAI

client = OpenAI(
  api_key=os.environ.get("OPENAI_API_KEY", "fake"),
  base_url="https://<YOUR_DOMAIN_PREFIX>.koyeb.app/v1",
)

chat_completion = client.chat.completions.create(
    messages=[
      {
        "role": "user",
        "content": [
          {
            "type": "image_url",
            "image_url": {
              "url": "https://images.unsplash.com/photo-1506744038136-46273834b3fb"
            },
          },
          {"type": "text", "text": "Describe the image."},
        ],
      },
    ],
    model="Qwen/Qwen2.5-VL-72B-Instruct",
    max_tokens=50,
)

print(chat_completion.to_json(indent=4))

It is probably what you are looking for?

Topic		Replies	Views
Error while trying the one click deployment Troubleshooting and help help , deployments	1	27	February 25, 2025
Changelog #80 - Qwen 2.5 VL 7B Instruct and Qwen 2.5 VL 72B Instruct One-Click Models, Improved Scale to Zero Cold Start, and more Announcements changelog , scale-to-zero , 1-click-model , llm	1	520	February 14, 2025
Changelog #89 - Tenstorrent Private Preview: New One-Click Models & Fix for Sporadic Card Init/Reset Delays, Improved Errors on Deployment Failures, and more Announcements changelog , deployments , control-panel , events , tenstorrent	1	287	April 18, 2025
Image download failure. An unexpected error occurred. Troubleshooting and help help , deployments	3	48	September 5, 2024
Deployment of finetuned model How To deployments	1	11	March 3, 2025

Issues running a model

Related topics