Clover-Hill commited on
Commit
8d8f794
·
verified ·
1 Parent(s): c587aea

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +61 -3
README.md CHANGED
@@ -1,3 +1,61 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ base_model:
6
+ - Qwen/Qwen2.5-0.5B
7
+ ---
8
+
9
+ ## Model Description
10
+
11
+ This Memory Decoder model is trained on the Law domain and can be adapted to enhance any model in the Qwen2 and Qwen2.5 families.
12
+
13
+ **Paper:** [Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models](https://www.arxiv.org/abs/2508.09874)
14
+
15
+ **GitHub:** [https://github.com/LUMIA-Group/MemoryDecoder](https://github.com/LUMIA-Group/MemoryDecoder/tree/main)
16
+
17
+ ## Training & Evaluation Data
18
+
19
+ **Law Domain Dataset:** [AsyLex](https://huggingface.co/datasets/clairebarale/AsyLex)
20
+
21
+ **Test Split:** [MemoryDecoder-domain-data](https://huggingface.co/datasets/Clover-Hill/MemoryDecoder-domain-data)
22
+
23
+ ## Performance Results
24
+
25
+ ### Qwen2 Family
26
+
27
+ | Model | Base Model | Base + MemDec |
28
+ |-------|------------|---------------|
29
+ | Qwen2-0.5B | 10.23 | 4.57 |
30
+ | Qwen2-1.5B | 7.69 | 4.32 |
31
+ | Qwen2-7B | 5.92 | 4.00 |
32
+ | Qwen2-72B | 4.84 | 3.69 |
33
+
34
+ ### Qwen2.5 Family
35
+
36
+ | Model | Base Model | Base + MemDec |
37
+ |-------|------------|---------------|
38
+ | Qwen2.5-0.5B | 9.86 | 4.57 |
39
+ | Qwen2.5-1.5B | 7.42 | 4.29 |
40
+ | Qwen2.5-3B | 6.68 | 4.16 |
41
+ | Qwen2.5-7B | 5.94 | 4.01 |
42
+ | Qwen2.5-14B | 5.35 | 3.86 |
43
+ | Qwen2.5-32B | 5.18 | 3.81 |
44
+ | Qwen2.5-72B | 4.84 | 3.70 |
45
+
46
+ *Perplexity scores on Law domain test set. Lower is better.*
47
+
48
+ ## Citation
49
+
50
+ ```bibtex
51
+ @article{cao2025memory,
52
+ title={Memory decoder: A pretrained, plug-and-play memory for large language models},
53
+ author={Cao, Jiaqi and Wang, Jiarui and Wei, Rubin and Guo, Qipeng and Chen, Kai and Zhou, Bowen and Lin, Zhouhan},
54
+ journal={arXiv preprint arXiv:2508.09874},
55
+ year={2025}
56
+ }
57
+ ```
58
+
59
+ ## Contact
60
+
61
+ For questions and support: [email protected]