File size: 606 Bytes
1d8c73c
 
48cc6f2
 
 
 
 
 
 
 
1d8c73c
 
 
 
 
 
 
48cc6f2
1d8c73c
 
 
48cc6f2
 
 
1d8c73c
48cc6f2
 
 
 
1d8c73c
 
 
48cc6f2
1d8c73c
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
---
library_name: transformers
license: apache-2.0
datasets:
- meta-math/MetaMathQA
language:
- en
base_model:
- Qwen/Qwen3-4B
pipeline_tag: text-generation
---

# Model Card for Model ID


### Training Data

https://huggingface.co/datasets/meta-math/MetaMathQA

#### Training Hyperparameters

batch_size = 8,
epoch = 1,
learning_rate = 1e-4

Lora:
    r=16,
    lora_alpha=32,
    lora_dropout=0.05

#### Metrics

  metrics={'train_runtime': 729.5559, 'train_samples_per_second': 9.746, 'train_steps_per_second': 0.306, 'total_flos': 7.949170591137792e+16, 'train_loss': 2.817356810976037, 'epoch': 1.0}