Sean13/llama-8b-instruct-simpo-full-label_smoothing-0.1 Text Generation • 266k • Updated 11 days ago • 14
shuoxing/llama3-8b-full-pretrain-mix-low-tweet-1m-en-no-packing-new Text Generation • 266k • Updated 11 days ago • 24
shuoxing/llama3-8b-full-pretrain-mix-mid-tweet-1m-en-no-packing-new Text Generation • 266k • Updated 11 days ago • 28
Sean13/llama-8b-instruct-v0.2-cpo-full-label_smoothing-0.1 Text Generation • 266k • Updated 11 days ago • 11
shuoxing/llama3-8b-full-pretrain-mix-high-tweet-1m-en-no-packing-new Text Generation • 266k • Updated 11 days ago • 31
shuoxing/llama3-8b-full-pretrain-control-tweet-1m-en-no-packing-new Text Generation • 266k • Updated 11 days ago • 27