How to enable auto-tool-choice and tool-call-parser for Gemma 3 (12B/27B)

Emerico_Developer · February 6, 2026, 3:37am

Hello there,

We are attempting to serve Gemma 3 (google/gemma-3-12b-it) via vLLM and want to enable native OpenAI-compatible tool calling.

When sending a chat completion request that includes:

"tool_choice": "auto"

the API consistently returns the following error:

Error: status_code: 400 “auto” tool choice requires --enable-auto-tool-choice and --tool-call-parser to be set

We have configured the following environment variables in the Koyeb service:

After redeploying the service, the same error is still returned.

Jen_Person · February 6, 2026, 10:23am

If you’re using our one-click deploy version of gemma-3-12b-it, the envars you need are ENABLE_AUTO_TOOL_CHOICEandTOOL_CALL_PARSER.

Topic		Replies	Views
Changelog #105: Koyeb Sandboxes now available in public preview, tool calling now easier to enable for one-click deploy models, and more Announcements changelog , 1-click-model , sandboxes	1	95	November 21, 2025
Vllm with custom model - how? Troubleshooting and help help , docker , deployments	2	20	January 30, 2026
Latest blog article on RAG Chat General documentation	3	182	March 17, 2024
Issues running a model General	1	41	February 18, 2025
Changelog #109: Partial Updates Now Valid Using the PATCH Endpoint of the Koyeb API, New Tutorial Using Ollama with Koyeb Sandboxes, and more Announcements ollama , gpu , llm , sandboxes	1	277	January 23, 2026