Update README.md
Browse files
README.md
CHANGED
|
@@ -8,6 +8,8 @@ base_model:
|
|
| 8 |
- ielabgroup/bert-base-uncased-fineweb100bt-smae
|
| 9 |
---
|
| 10 |
|
|
|
|
|
|
|
| 11 |
This model is a bert-baase-uncased size model initialized with [ielabgroup/bert-base-uncased-fineweb100bt-smae](https://huggingface.co/ielabgroup/bert-base-uncased-fineweb100bt-smae) fine-tuned on MS MARCO dataset with Starbucks Representation Learning (SRL) method. SRL enables elastic layer-dimension embedding generation for search.
|
| 12 |
|
| 13 |
The following layer-dimension pairs are involved during fine-tuning: [(2, 32), (4, 64), (6, 128), (8, 256), (10, 512), (12, 768)]
|
|
|
|
| 8 |
- ielabgroup/bert-base-uncased-fineweb100bt-smae
|
| 9 |
---
|
| 10 |
|
| 11 |
+
Model used in [Starbucks: Improved Training for 2D Matryoshka Embeddings](https://arxiv.org/pdf/2410.13230)
|
| 12 |
+
|
| 13 |
This model is a bert-baase-uncased size model initialized with [ielabgroup/bert-base-uncased-fineweb100bt-smae](https://huggingface.co/ielabgroup/bert-base-uncased-fineweb100bt-smae) fine-tuned on MS MARCO dataset with Starbucks Representation Learning (SRL) method. SRL enables elastic layer-dimension embedding generation for search.
|
| 14 |
|
| 15 |
The following layer-dimension pairs are involved during fine-tuning: [(2, 32), (4, 64), (6, 128), (8, 256), (10, 512), (12, 768)]
|