Categorizer Model and Training Dataset
Collection
1 item
β’
Updated
Project page: https://huggingface.co/BenchHub. Code: https://github.com/rladmstn1714/BenchHub
BenchHub-Cat-7b is a category classification model based on Qwen2.5-7B, fine-tuned to assign natural language queries to structured category triplets: (subject, skill, target).
| Hyperparameter | Value |
|---|---|
| Sequence Length | 8192 |
| Learning Rate | 2 Γ 10β»β΅ |
| Batch Size (Effective) | 256 |
| Epochs | 3 |
| Scheduler | Cosine Decay |
| Warmup Ratio | 0.05 |
| Optimizer | Method from [19] |
| Trainer | DeepSpeed ZeRO-3 |
| Hardware | 4Γ A6000 48GB GPUs |
| Training Time | ~5 hours per run |
Input: Natural language question or instruction
Output: Triplet (subject, skill, target), such as:
{ "subject_type": "history",
"task_type": "reasoning",
"target_type": "korea"}
### Instruction:
Classify the following query into subject, skill, and target.
### Query:
How did Confucianism shape education in East Asia?
### Output:
{ "subject_type": "history",
"task_type": "reasoning",
"target_type": "korea"}
Apache 2.0