Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Neelectric
/
Llama-3.1-8B-Instruct_SFT_Math-220kv00.08

Text Generation
Transformers
Safetensors
llama
Generated from Trainer
sft
open-r1
trl
conversational
text-generation-inference
Model card Files Files and versions
xet
Community
1
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Query re: Llama-3.1-8B SFT/GRPO Model Versions (Scalpel vs. Hammer)

👍 1
3
#1 opened 21 days ago by
chenth
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs