Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_16384_epoch_1 Text Generation • 4B • Updated 15 days ago • 29
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_8192_epoch_1 Text Generation • 4B • Updated 15 days ago • 19
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_50000_seq_4096_epoch_1 Text Generation • 4B • Updated 16 days ago • 29
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_16384_epoch_1 Text Generation • 4B • Updated 16 days ago • 23
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_4096_epoch_1 Text Generation • 4B • Updated 16 days ago • 25
Ujan/Qwen3-4B-Base_DeepMath-103K_samples_10000_seq_8192_epoch_1 Text Generation • 4B • Updated 16 days ago • 20
Ujan/lts_DeepMath-103K_samples_10000_seq_16384_Qwen3-30B-A3B-Thinking-2507_22_23_24_0.8 Viewer • Updated 10 days ago • 11k • 10
Ujan/lts_pruned_DeepMath-103K_samples_10000_seq_16384_Qwen3-4B-Thinking-2507_17_18_19_0.5 Viewer • Updated 10 days ago • 11k • 12
Ujan/lts_pruned_processed_DeepMath-103K_samples_50000_seq_16384_Qwen3-4B-Thinking-2507_sparsity_0.5 Viewer • Updated 11 days ago • 51k • 15
Ujan/lts_pruned_processed_DeepMath-103K_samples_10000_seq_16384_Qwen3-4B-Thinking-2507_sparsity_0.8 Viewer • Updated 11 days ago • 11k • 11
Ujan/lts_pruned_processed_DeepMath-103K_samples_10000_seq_16384_Qwen3-4B-Thinking-2507_sparsity_0.5 Viewer • Updated 12 days ago • 11k • 30