christian-bick commited on
Commit
21cbbc1
·
verified ·
1 Parent(s): 245eeb9

Upload folder using huggingface_hub

Browse files
README.md ADDED
@@ -0,0 +1,74 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: agpl-3.0
3
+ library_name: pytorch
4
+ base_model:
5
+ - Qwen/Qwen3-VL-4B-Instruct
6
+ tags:
7
+ - rgcn
8
+ - embedding
9
+ - onnx
10
+ ---
11
+
12
+ # EduGraph Embed
13
+
14
+ This model generates embeddings for labels from the
15
+ [EduGraph Ontology](httpss://github.com/christian-bick/edugraph-ontology).
16
+
17
+ When combined with an [EduGraph Classification Model](httpss://github.com/christian-bick/edugraph-classify-qwen3vl),
18
+ we can determine similarity between any type of learning content covered by the EduGraph ontology.
19
+ For example, in tandem, the two models can determine whether some content of a math learning app
20
+ trains the exact same set of skills tested in a paper quiz, by providing nothing else than a screenshot
21
+ and a photo.
22
+
23
+ ## How it works
24
+
25
+ The model determines similarity based on the *structure* of the EduGraph Ontology. It respects
26
+ various types of entity relationships to determine similarity, most importantly, parent-child and sibling
27
+ relationships within the graph in addition to the semantic similarity of their definitions.
28
+
29
+ For example, the model will reliably place labels like `IntegerAddition` and `FractionAddition`
30
+ closer together than, say, `ShapeIdentification`.
31
+
32
+ To accomplish this, the model generates knowledge graph embeddings that
33
+ map the ontology structure into a high-dimensional vector space using a
34
+ [Relational Graph Convolutional Network (R-GCN)](httpss://arxiv.org/abs/1703.06103).
35
+
36
+ ## Limitations
37
+
38
+ This model is centered around the EduGraph ontology. The embedding model was trained
39
+ on the entities and relationships in this ontology. Consequently, it can only embed
40
+ labels that are defined as entities within this ontology.
41
+
42
+ ## Risks
43
+
44
+ **Important:** Currently this model is in a research status and has not been evaluated under real-world conditions.
45
+
46
+ * **ONLY use this model for research, experimentation and evaluation**
47
+ * **Do NOT use in a classroom environment**
48
+ * **Do NOT use for automations that might impact children**
49
+
50
+ ## Using the Model
51
+
52
+ ### Preparation
53
+
54
+ 1) Download the following files:
55
+
56
+ - `embed_entities_biased.onnx`
57
+ - `embed_entities.pt`
58
+
59
+ 2) Install the following dependencies:
60
+
61
+ - `torch`
62
+ - `numpy`
63
+ - `onnxruntime`
64
+
65
+ ### Reference Example
66
+
67
+ See [entity_embeddings_infer.py](httpss://github.com/christian-bick/edugraph-embed/blob/master/src/edugraph/embed/entity_embeddings_infer.py)
68
+ for reference usage.
69
+
70
+ ## License
71
+
72
+ This project is licensed under the GNU Affero General Public License. See the [LICENSE](LICENSE) file for details.
73
+
74
+ If these license terms are not working for you, then contact us, and we can discuss alternative options.
embed_entities.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:55b10aa169320b084cfeb83439071a456b1d923681a9a1a5bf3f1ea5ea200fec
3
+ size 516195
embed_entities_biased.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:29a346b61acc03373bf9680f7882ab70bac2440db633cf8577534413e3f930e0
3
+ size 2471425
embed_entities_neutral.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8cb74338767dacf2f11f46bddbf6cd22106e83312140f2bc5262bc7c5aa1a668
3
+ size 2470723
embed_entities_text.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:293d209283ac9e0faf4a5f846c566071bd4ff12af1162d7c8bd52120b3bb1848
3
+ size 516373