dian1414
/

Qwen3-1.7B-GRPO-zero3

Generated from Trainer

Model card Files Files and versions

Qwen3-1.7B-GRPO-zero3 / merges.txt

dian1414's picture

End of training

9a9abe1 verified 7 months ago

history contribute delete

1.67 MB

File too large to display, you can check the raw version instead.