Changelog #55 - Serverless GPUs in Private Preview, performance improvements on API endpoints, and new 1-click apps

David · May 3, 2024, 1:44pm

Hello, and welcome to this week’s changelog update! Let’s jump into what’s new:

Serverless GPUs in Private Preview

We’re excited to share that Serverless GPUs are available for all your AI inference needs directly through the Koyeb platform! We’re starting with 4 Instances: RTX 4000 SFF ADA, L4, V100, and L40S. These GPUs provide up to 48GB of vRAM, 733 TFLOPS and 900GB/s of memory bandwidth to support large models including LLMs and text-to-image models. To access these GPU Instances, join the preview on koyeb.com/ai.

Serverless GPUs in Private preview750×370 109 KB
Performance improvements on API endpoints

We’ve improved the performance of some API endpoints. It is now faster to list Koyeb resources like Apps, Services, and Secrets so you can quickly access the current state of your infrastructure.
New one-click apps and tutorials: Directus, Flowise, and Wasp

We published three new tutorials to deploy Directus, Flowise, and Wasp on Koyeb. Directus is an open data platform that serves as a headless CMS and powerful Backend-as-a-Service (BaaS). Flowise is an open-source AI tool primarily oriented towards building custom LLM workflows and creating AI agents. Wasp is a full-stack web framework built to provide a modern take on a Rails-like development experience.

Topic		Replies	Views
Changelog #60 - Serverless RTX GPUs in technical preview, Autoscaling: Scale based on concurrent requests, and more Announcements changelog , autoscaling , secrets , gpu , infisical	3	543	June 7, 2024
Changelog #71 - Multi A100, RTX A6000, and L40S GPUs available on request, show service changes staged for next deployment in the CLI, and more Announcements changelog , deployments , cli , services , gpu	1	78	October 25, 2024
Changelog #34 - $7M Funding, Postgres New Regions, Automatic PORT Variable Injection, and New Free Tier Announcements database , regions , postgres , port , free-tier	1	807	November 2, 2023
Changelog #76 - New plans, deploy AI models in one click, and more Announcements changelog , pricing , metrics , secrets , 1-click-model	1	206	December 20, 2024
Changelog #81 - Nvidia A100 and H100 GPUs Available in Dallas, Custom Request Timeout for Scale Plan Users, and more Announcements docker , changelog , deployments , gpu , tenstorrent	1	55	February 21, 2025