Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

lapp0
/
distily_bench_obj_cross_v2

TensorBoard
Safetensors
Distily
gpt_neo
Generated from Trainer
8-bit precision
bitsandbytes
Model card Files Files and versions
xet
Metrics Training metrics Community
distily_bench_obj_cross_v2 / logs
32.2 MB
  • 1 contributor
History: 8 commits

This model has 1 file scanned as unsafe.

lapp0's picture
lapp0
End of training
6b1d380 verified over 1 year ago
  • attn_loss_fn=None, attn_weight=0, gradient_accumulation_steps=1, hs_loss_fn=mse, hs_weight=2.0, learning_rate=0.0004, lr_scheduler_kwargs=__num_cycles___4_, lr_scheduler_type=cosine_with_restarts, max
    Training in progress, step 12375 over 1 year ago
  • attn_loss_fn=None, attn_weight=0, gradient_accumulation_steps=1, hs_loss_fn=mse, hs_weight=2.0, learning_rate=0.0004, lr_scheduler_kwargs=__num_cycles___8_, lr_scheduler_type=cosine_with_restarts, max
    End of training over 1 year ago
  • attn_loss_fn=None, attn_weight=0, gradient_accumulation_steps=1, hs_loss_fn=mse, hs_weight=2.0, learning_rate=0.0004, lr_scheduler_type=cosine_with_restarts, max_grad_norm=None, num_cycles=4, optim=pa
    Training in progress, step 12375 over 1 year ago