Sampling parameters & vLLM settings for tau2-bench?

#4
by lewtun - opened

Hello Qwen team,

I was wondering if you could share the sampling parameters you used for tau2-bench? By default, the benchmark uses temperature=0 and I wonder if you changed this to the values adopted in the Qwen3 tech report (namely, temperature=0.6, top_p=0.95 and top_k=20)?

Also, if you used vllm to generate the trajectories, can you share the command that was used to create the server? Currently I create the server as follows:

vllm serve Qwen/Qwen3-4B-Thinking-2507 \
    --tensor-parallel-size 1 \
    --trust-remote-code \
    --enable-auto-tool-choice \
    --tool-call-parser hermes \
    --host 0.0.0.0 --port 8000

I am unsure if one also needs to provide a reasoning parser as well.

Thank you!

Sign up or log in to comment