A newer version of this model is available: sthenno-com/miscii-14b-0218

miscii-14b-1028

Role-based Instructions

Just parse the following as your system prompt. Note there is NO special-tokens here.

An example system prompt:

system_prompt: str = (
    """<|context_start|>personas<|context_sep|>
<|persona_start|>user<|persona_sep|>
{user_persona}<|persona_end|>
<|persona_start|>assistant<|persona_sep|>
{assistant_persona}<|persona_end|><|context_end|>""".format(
        user_persona="""I am Miscii.
I am the designer of Sthenno.
[Optional: Additional statements]""",
        assistant_persona="""I am Sthenno.
I speak in Chinese.
[Optional: Additional statements]""",
    )
)

Training

See Report for miscii-1020 for more details.


Open LLM Leaderboard Evaluation Results

Metric Value
Avg. 35.05
IFEval (0-Shot) 82.37
BBH (3-Shot) 49.26
MATH Lvl 5 (4-Shot) 6.34
GPQA (0-shot) 14.21
MuSR (0-shot) 12.00
MMLU-PRO (5-shot) 46.14

Open LLM Leaderboard Evaluation Results

Refined:

Metric Value
Avg. 42.38
IFEval (0-Shot) 82.37
BBH (3-Shot) 49.26
MATH Lvl 5 (4-Shot) 50.30
GPQA (0-shot) 14.21
MuSR (0-shot) 12.00
MMLU-PRO (5-shot) 46.14

There’s nothing more to Show\large{\text{There's nothing more to Show}}

Downloads last month
5
Safetensors
Model size
15B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for sthenno-com/miscii-14b-1028

Base model

Qwen/Qwen2.5-14B
Finetuned
(210)
this model
Finetunes
2 models
Merges
5 models
Quantizations
6 models

Datasets used to train sthenno-com/miscii-14b-1028

Space using sthenno-com/miscii-14b-1028 1

Collection including sthenno-com/miscii-14b-1028

Evaluation results