arxiv:2410.04612
Jonathan Chang
jdchang
AI & ML interests
None yet
Organizations
models
95
jdchang/test_rm_8b
Feature Extraction
•
8B
•
Updated
•
4
jdchang/patch_14b
Text Generation
•
15B
•
Updated
•
1
jdchang/norm_test_400
Text Generation
•
15B
•
Updated
jdchang/norm_test_200
Text Generation
•
15B
•
Updated
jdchang/norm_test
Text Generation
•
15B
•
Updated
jdchang/bt-model-lr-7e-06-step-955
2B
•
Updated
•
1
jdchang/bt-model-lr-7e-06-step-954
2B
•
Updated
jdchang/bt-model-lr-3e-05-step-955
2B
•
Updated
jdchang/bt-model-lr-1e-05-step-955
2B
•
Updated
jdchang/bt-model-lr-3e-05-step-954
2B
•
Updated
datasets
60
jdchang/distill-llama70-n16-rollin-llama-t2s
Viewer
•
Updated
•
302k
•
3
jdchang/distill-qwen32-n16-rollin-llama-t2s
Viewer
•
Updated
•
302k
•
5
jdchang/distill-qwen14-n16-rollin-llama-t2s
Viewer
•
Updated
•
302k
•
4
jdchang/distill-qwen7-n16-rollin-llama-t2s
Viewer
•
Updated
•
302k
•
2
jdchang/distill-llama70-n16-rollin-t2s
Viewer
•
Updated
•
302k
•
5
jdchang/distill-qwen32-n16-rollin-t2s
Viewer
•
Updated
•
302k
•
3
jdchang/distill-qwen14-n16-rollin-t2s
Viewer
•
Updated
•
302k
•
10
jdchang/distill-qwen7-n16-rollin-t2s
Viewer
•
Updated
•
302k
•
4
jdchang/qsharp-bt-mixture
Viewer
•
Updated
•
27.2k
•
2
jdchang/qsharp-bt-32b
Viewer
•
Updated
•
31.9k
•
2