Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Chaew00n
/
test-policy-optimization-query-rewrite-llama3B-prompt1
like
0
Transformers
Safetensors
Generated from Trainer
trl
grpo
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
test-policy-optimization-query-rewrite-llama3B-prompt1
/
.gitattributes
Commit History
Training in progress, step 5000, checkpoint
a7c29a9
verified
Chaew00n
commited on
Jun 11
Training in progress, step 4000, checkpoint
f388005
verified
Chaew00n
commited on
Jun 11
Training in progress, step 3000, checkpoint
85806fa
verified
Chaew00n
commited on
Jun 11
Training in progress, step 2000, checkpoint
489022b
verified
Chaew00n
commited on
Jun 11
Training in progress, step 1000, checkpoint
afdf379
verified
Chaew00n
commited on
Jun 11
Training in progress, step 1000
6c6a9b2
verified
Chaew00n
commited on
Jun 11
initial commit
0b75de0
verified
Chaew00n
commited on
Jun 11