UNAversal-2x7B-v1

Merely Phase 1 UNA, only MLP's and its kinda of a beta. The goal was to produce a small but powerful MoE.

This is a 2 MoE model, of 7B each expert. Based on intel-neural series v3.

Tasks	Version	Filter	n-shot	Metric	Value		Stderr
arc_challenge	Yaml	none	25	acc	0.7133	±	0.0132
		none	25	acc_norm	0.7235	±	0.0131
arc_easy	Yaml	none	0	acc	0.8674	±	0.0070
		none	0	acc_norm	0.8291	±	0.0077
boolq	Yaml	none	0	acc	0.8768	±	0.0057
lambada_openai	Yaml	none	0	perplexity	3.6656	±	0.0841
		none	0	acc	0.7017	±	0.0064
mathqa	Yaml	none	0	acc	0.3474	±	0.0087
		none	0	acc_norm	0.3585	±	0.0088
piqa	Yaml	none	0	acc	0.8411	±	0.0085
		none	0	acc_norm	0.8526	±	0.0083
sciq	Yaml	none	0	acc	0.9600	±	0.0062
		none	0	acc_norm	0.9370	±	0.0077

Safetensors

Model size

13B params

Tensor type

BF16

Evaluation results