Text Generation
Safetensors
English
llama
shining-valiant
shining-valiant-2
valiant
valiant-labs
llama-3.1
llama-3.1-instruct
llama-3.1-instruct-8b
llama-3
llama-3-instruct
llama-3-instruct-8b
8b
science
physics
biology
chemistry
compsci
computer-science
engineering
technical
conversational
chat
instruct
Eval Results
Upload folder using huggingface_hub
#3
by
sequelbox
- opened
- README.md +18 -5
- config.json +1 -1
- generation_config.json +1 -1
- model-00001-of-00007.safetensors +1 -1
- model-00002-of-00007.safetensors +1 -1
- model-00003-of-00007.safetensors +1 -1
- model-00004-of-00007.safetensors +1 -1
- model-00005-of-00007.safetensors +1 -1
- model-00006-of-00007.safetensors +1 -1
- tokenizer.json +1 -6
README.md
CHANGED
|
@@ -15,10 +15,21 @@ tags:
|
|
| 15 |
- llama-3-instruct
|
| 16 |
- llama-3-instruct-8b
|
| 17 |
- 8b
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 18 |
- conversational
|
| 19 |
- chat
|
| 20 |
- instruct
|
| 21 |
base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
|
|
|
|
|
|
|
|
|
|
| 22 |
model_type: llama
|
| 23 |
license: llama3.1
|
| 24 |
---
|
|
@@ -29,14 +40,16 @@ license: llama3.1
|
|
| 29 |
|
| 30 |
Shining Valiant 2 is a chat model built on Llama 3.1 8b, finetuned on our data for friendship, insight, knowledge and enthusiasm.
|
| 31 |
- Finetuned on [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) for best available general performance
|
| 32 |
-
- Trained on
|
| 33 |
|
| 34 |
|
| 35 |
## Version
|
| 36 |
|
| 37 |
-
This is the **2024-
|
| 38 |
|
| 39 |
-
|
|
|
|
|
|
|
| 40 |
|
| 41 |
Help us and recommend Shining Valiant 2 to your friends!
|
| 42 |
|
|
@@ -73,9 +86,9 @@ print(outputs[0]["generated_text"][-1])
|
|
| 73 |
## The Model
|
| 74 |
Shining Valiant 2 is built on top of Llama 3.1 8b Instruct.
|
| 75 |
|
| 76 |
-
The current version of Shining Valiant 2 is trained
|
| 77 |
|
| 78 |
-
Our private data adds specialist knowledge and Shining Valiant's personality: she's friendly, enthusiastic, insightful, knowledgeable, and loves to learn! Magical.
|
| 79 |
|
| 80 |
|
| 81 |

|
|
|
|
| 15 |
- llama-3-instruct
|
| 16 |
- llama-3-instruct-8b
|
| 17 |
- 8b
|
| 18 |
+
- science
|
| 19 |
+
- physics
|
| 20 |
+
- biology
|
| 21 |
+
- chemistry
|
| 22 |
+
- compsci
|
| 23 |
+
- computer-science
|
| 24 |
+
- engineering
|
| 25 |
+
- technical
|
| 26 |
- conversational
|
| 27 |
- chat
|
| 28 |
- instruct
|
| 29 |
base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
|
| 30 |
+
datasets:
|
| 31 |
+
- sequelbox/Celestia
|
| 32 |
+
- sequelbox/Supernova
|
| 33 |
model_type: llama
|
| 34 |
license: llama3.1
|
| 35 |
---
|
|
|
|
| 40 |
|
| 41 |
Shining Valiant 2 is a chat model built on Llama 3.1 8b, finetuned on our data for friendship, insight, knowledge and enthusiasm.
|
| 42 |
- Finetuned on [meta-llama/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct) for best available general performance
|
| 43 |
+
- Trained on a variety of high quality data; focused on science, engineering, technical knowledge, and structured reasoning
|
| 44 |
|
| 45 |
|
| 46 |
## Version
|
| 47 |
|
| 48 |
+
This is the **2024-09-16** release of Shining Valiant 2 for Llama 3.1 8b.
|
| 49 |
|
| 50 |
+
We've improved and open-sourced our new baseline [science-instruct dataset](https://huggingface.co/datasets/sequelbox/Celestia). This release features improvements in physics, chemistry, biology, and computer science.
|
| 51 |
+
|
| 52 |
+
Future upgrades will continue to expand Shining Valiant's technical knowledge base.
|
| 53 |
|
| 54 |
Help us and recommend Shining Valiant 2 to your friends!
|
| 55 |
|
|
|
|
| 86 |
## The Model
|
| 87 |
Shining Valiant 2 is built on top of Llama 3.1 8b Instruct.
|
| 88 |
|
| 89 |
+
The current version of Shining Valiant 2 is trained on technical knowledge using [sequelbox/Celestia](https://huggingface.co/datasets/sequelbox/Celestia) and general chat capability using [sequelbox/Supernova.](https://huggingface.co/datasets/sequelbox/Supernova)
|
| 90 |
|
| 91 |
+
Our private data adds specialist knowledge and Shining Valiant's personality: she's friendly, enthusiastic, insightful, knowledgeable, and loves to learn! Magical. (As a general note: we're hoping to replace and open-source this part of Shining Valiant's dataset with synthetic data soon!)
|
| 92 |
|
| 93 |
|
| 94 |

|
config.json
CHANGED
|
@@ -33,7 +33,7 @@
|
|
| 33 |
"rope_theta": 500000.0,
|
| 34 |
"tie_word_embeddings": false,
|
| 35 |
"torch_dtype": "float32",
|
| 36 |
-
"transformers_version": "4.
|
| 37 |
"use_cache": true,
|
| 38 |
"vocab_size": 128256
|
| 39 |
}
|
|
|
|
| 33 |
"rope_theta": 500000.0,
|
| 34 |
"tie_word_embeddings": false,
|
| 35 |
"torch_dtype": "float32",
|
| 36 |
+
"transformers_version": "4.44.2",
|
| 37 |
"use_cache": true,
|
| 38 |
"vocab_size": 128256
|
| 39 |
}
|
generation_config.json
CHANGED
|
@@ -8,5 +8,5 @@
|
|
| 8 |
],
|
| 9 |
"temperature": 0.6,
|
| 10 |
"top_p": 0.9,
|
| 11 |
-
"transformers_version": "4.
|
| 12 |
}
|
|
|
|
| 8 |
],
|
| 9 |
"temperature": 0.6,
|
| 10 |
"top_p": 0.9,
|
| 11 |
+
"transformers_version": "4.44.2"
|
| 12 |
}
|
model-00001-of-00007.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 4886466168
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:dcebe7b4eacb57cbc4e03e60f0d4e1eec8a1471455a3fdbc953edfaca5c8763e
|
| 3 |
size 4886466168
|
model-00002-of-00007.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 4832007448
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:756b38e9412a00dc12d14823d48c9a71732a1c0318fd9bb48661e9589ddb9ac1
|
| 3 |
size 4832007448
|
model-00003-of-00007.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 4999813112
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4d3ff8801d13032241f11b23af8bf458181a87b41b3e6497cf7cc503a0469ce6
|
| 3 |
size 4999813112
|
model-00004-of-00007.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 4999813128
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:35ee4a044f0e1c92ba26c63b584ac344740d70fff1f3d86d073810bc8e610d66
|
| 3 |
size 4999813128
|
model-00005-of-00007.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 4832007496
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7b6123fecf735935528930e989780254f5bd5eb78b872cda5677f04479d09c25
|
| 3 |
size 4832007496
|
model-00006-of-00007.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 4999813120
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:895b3445cc9cb423b5c8b67c289eecd411f860ea3d7255857beb8fcb8e990621
|
| 3 |
size 4999813120
|
tokenizer.json
CHANGED
|
@@ -1,11 +1,6 @@
|
|
| 1 |
{
|
| 2 |
"version": "1.0",
|
| 3 |
-
"truncation":
|
| 4 |
-
"direction": "Right",
|
| 5 |
-
"max_length": 6900,
|
| 6 |
-
"strategy": "LongestFirst",
|
| 7 |
-
"stride": 0
|
| 8 |
-
},
|
| 9 |
"padding": null,
|
| 10 |
"added_tokens": [
|
| 11 |
{
|
|
|
|
| 1 |
{
|
| 2 |
"version": "1.0",
|
| 3 |
+
"truncation": null,
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4 |
"padding": null,
|
| 5 |
"added_tokens": [
|
| 6 |
{
|