Changelog #83 - QwQ 32B and R1 1776 Distill Llama 70B One-Click Models, 8x H100 GPUs, and more

Hello, and welcome to this week’s changelog update! Let’s dive into what’s new:

  1. New One-Click AI Models: QwQ 32B, R1 1776 Distill Llama 70B, and more

    You can now deploy the latest QwQ 32B from Qwen, R1 1776 Distill Llama 70B from Perplexity, and Coder and Math-specific models based on Qwen 2.5 with one click from the catalog.

  2. 2x, 4x, and 8x H100 GPU Instances available on request

    We now offer 2x, 4x, and 8x H100 GPU instances. These new instances allow you to right-size GPU acceleration for your workloads on H100.

  3. Control panel: New instance and regions selector

    We’ve simplified instance and region selection for services under a unified section. This change eliminates the need to adjust these settings separately.

1 Like