Sampling parameters & vLLM settings for tau2-bench?
#4
by
lewtun
- opened
Hello Qwen team,
I was wondering if you could share the sampling parameters you used for tau2-bench
? By default, the benchmark uses temperature=0
and I wonder if you changed this to the values adopted in the Qwen3 tech report (namely, temperature=0.6
, top_p=0.95
and top_k=20
)?
Also, if you used vllm
to generate the trajectories, can you share the command that was used to create the server? Currently I create the server as follows:
vllm serve Qwen/Qwen3-4B-Thinking-2507 \
--tensor-parallel-size 1 \
--trust-remote-code \
--enable-auto-tool-choice \
--tool-call-parser hermes \
--host 0.0.0.0 --port 8000
I am unsure if one also needs to provide a reasoning parser as well.
Thank you!