Commit
·
1a21b22
1
Parent(s):
6d68b9d
add GGUF quants
Browse files- .gitattributes +1 -0
- README.md +72 -0
- nomic-embed-text-v1.5.Q2_K.gguf +3 -0
- nomic-embed-text-v1.5.Q3_K_L.gguf +3 -0
- nomic-embed-text-v1.5.Q3_K_M.gguf +3 -0
- nomic-embed-text-v1.5.Q3_K_S.gguf +3 -0
- nomic-embed-text-v1.5.Q4_0.gguf +3 -0
- nomic-embed-text-v1.5.Q4_K_M.gguf +3 -0
- nomic-embed-text-v1.5.Q4_K_S.gguf +3 -0
- nomic-embed-text-v1.5.Q5_0.gguf +3 -0
- nomic-embed-text-v1.5.Q5_K_M.gguf +3 -0
- nomic-embed-text-v1.5.Q5_K_S.gguf +3 -0
- nomic-embed-text-v1.5.Q6_K.gguf +3 -0
- nomic-embed-text-v1.5.Q8_0.gguf +3 -0
- nomic-embed-text-v1.5.f16.gguf +3 -0
- nomic-embed-text-v1.5.f32.gguf +3 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
*.gguf filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
|
@@ -1,3 +1,75 @@
|
|
| 1 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
license: apache-2.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
+
base_model: nomic-ai/nomic-embed-text-v1.5
|
| 3 |
+
inference: false
|
| 4 |
+
language:
|
| 5 |
+
- en
|
| 6 |
license: apache-2.0
|
| 7 |
+
model_creator: Nomic
|
| 8 |
+
model_name: nomic-embed-text-v1.5
|
| 9 |
+
model_type: bert
|
| 10 |
+
pipeline_tag: sentence-similarity
|
| 11 |
+
quantized_by: Nomic
|
| 12 |
+
tags:
|
| 13 |
+
- feature-extraction
|
| 14 |
+
- sentence-similarity
|
| 15 |
---
|
| 16 |
+
|
| 17 |
+
# nomic-embed-text-v1.5 - GGUF
|
| 18 |
+
|
| 19 |
+
Original model: [nomic-embed-text-v1.5](https://huggingface.co/nomic-ai/nomic-embed-text-v1.5)
|
| 20 |
+
|
| 21 |
+
|
| 22 |
+
## Description
|
| 23 |
+
|
| 24 |
+
This repo contains llama.cpp-compatible files for [nomic-embed-text-v1.5](https://huggingface.co/nomic-ai/nomic-embed-text-v1.5) in GGUF format.
|
| 25 |
+
|
| 26 |
+
llama.cpp will default to 2048 tokens of context with these files. To use the full 8192 tokens that Nomic Embed is benchmarked on, you will have to choose a context extension method. The original model uses Dynamic NTK-Aware RoPE scaling, but that is not currently available in llama.cpp. A combination of YaRN and linear scaling is an acceptable substitute.
|
| 27 |
+
|
| 28 |
+
These files were converted and quantized with llama.cpp commit [594fca3fe](https://github.com/ggerganov/llama.cpp/commit/594fca3fefe27b8e95cfb1656eb0e160ad15a793).
|
| 29 |
+
|
| 30 |
+
## Example `llama.cpp` Command
|
| 31 |
+
|
| 32 |
+
Compute a single embedding:
|
| 33 |
+
```shell
|
| 34 |
+
./embedding -ngl 99 -m nomic-embed-text-v1.5.f16.gguf -c 8192 -b 8192 --rope-scaling yarn --rope-freq-scale .75 -p 'search_query: What is TSNE?'
|
| 35 |
+
```
|
| 36 |
+
|
| 37 |
+
You can also submit a batch of texts to embed, as long as the total number of tokens does not exceed the context length. Only the first three embeddings are shown by the `embedding` example.
|
| 38 |
+
|
| 39 |
+
texts.txt:
|
| 40 |
+
```
|
| 41 |
+
search_query: What is TSNE?
|
| 42 |
+
search_query: Who is Laurens Van der Maaten?
|
| 43 |
+
```
|
| 44 |
+
|
| 45 |
+
Compute multiple embeddings:
|
| 46 |
+
```shell
|
| 47 |
+
./embedding -ngl 99 -m nomic-embed-text-v1.5.f16.gguf -c 8192 -b 8192 --rope-scaling yarn --rope-freq-scale .75 -f texts.txt
|
| 48 |
+
```
|
| 49 |
+
|
| 50 |
+
|
| 51 |
+
## Compatibility
|
| 52 |
+
|
| 53 |
+
These files are compatible with llama.cpp as commit [ea9c8e114](https://github.com/ggerganov/llama.cpp/commit/ea9c8e11436ad50719987fa23a289c74b7b40d40) from 2/13/2024.
|
| 54 |
+
|
| 55 |
+
|
| 56 |
+
## Provided Files
|
| 57 |
+
|
| 58 |
+
The below table shows the mean squared error of the embeddings produced by these quantizations of Nomic Embed relative to the Sentence Transformers implementation.
|
| 59 |
+
|
| 60 |
+
Name | Quant | Size | MSE
|
| 61 |
+
-----|-------|------|-----
|
| 62 |
+
[nomic-embed-text-v1.5.Q2\_K.gguf](https://huggingface.co/nomic-ai/nomic-embed-text-v1.5-GGUF/blob/main/nomic-embed-text-v1.5.Q2_K.gguf) | Q2\_K | 48 MiB | 2.33e-03
|
| 63 |
+
[nomic-embed-text-v1.5.Q3\_K\_S.gguf](https://huggingface.co/nomic-ai/nomic-embed-text-v1.5-GGUF/blob/main/nomic-embed-text-v1.5.Q3_K_S.gguf) | Q3\_K\_S | 57 MiB | 1.19e-03
|
| 64 |
+
[nomic-embed-text-v1.5.Q3\_K\_M.gguf](https://huggingface.co/nomic-ai/nomic-embed-text-v1.5-GGUF/blob/main/nomic-embed-text-v1.5.Q3_K_M.gguf) | Q3\_K\_M | 65 MiB | 8.26e-04
|
| 65 |
+
[nomic-embed-text-v1.5.Q3\_K\_L.gguf](https://huggingface.co/nomic-ai/nomic-embed-text-v1.5-GGUF/blob/main/nomic-embed-text-v1.5.Q3_K_L.gguf) | Q3\_K\_L | 69 MiB | 7.93e-04
|
| 66 |
+
[nomic-embed-text-v1.5.Q4\_0.gguf](https://huggingface.co/nomic-ai/nomic-embed-text-v1.5-GGUF/blob/main/nomic-embed-text-v1.5.Q4_0.gguf) | Q4\_0 | 75 MiB | 6.32e-04
|
| 67 |
+
[nomic-embed-text-v1.5.Q4\_K\_S.gguf](https://huggingface.co/nomic-ai/nomic-embed-text-v1.5-GGUF/blob/main/nomic-embed-text-v1.5.Q4_K_S.gguf) | Q4\_K\_S | 75 MiB | 6.71e-04
|
| 68 |
+
[nomic-embed-text-v1.5.Q4\_K\_M.gguf](https://huggingface.co/nomic-ai/nomic-embed-text-v1.5-GGUF/blob/main/nomic-embed-text-v1.5.Q4_K_M.gguf) | Q4\_K\_M | 81 MiB | 2.42e-04
|
| 69 |
+
[nomic-embed-text-v1.5.Q5\_0.gguf](https://huggingface.co/nomic-ai/nomic-embed-text-v1.5-GGUF/blob/main/nomic-embed-text-v1.5.Q5_0.gguf) | Q5\_0 | 91 MiB | 2.35e-04
|
| 70 |
+
[nomic-embed-text-v1.5.Q5\_K\_S.gguf](https://huggingface.co/nomic-ai/nomic-embed-text-v1.5-GGUF/blob/main/nomic-embed-text-v1.5.Q5_K_S.gguf) | Q5\_K\_S | 91 MiB | 2.00e-04
|
| 71 |
+
[nomic-embed-text-v1.5.Q5\_K\_M.gguf](https://huggingface.co/nomic-ai/nomic-embed-text-v1.5-GGUF/blob/main/nomic-embed-text-v1.5.Q5_K_M.gguf) | Q5\_K\_M | 95 MiB | 6.55e-05
|
| 72 |
+
[nomic-embed-text-v1.5.Q6\_K.gguf](https://huggingface.co/nomic-ai/nomic-embed-text-v1.5-GGUF/blob/main/nomic-embed-text-v1.5.Q6_K.gguf) | Q6\_K | 108 MiB | 5.58e-05
|
| 73 |
+
[nomic-embed-text-v1.5.Q8\_0.gguf](https://huggingface.co/nomic-ai/nomic-embed-text-v1.5-GGUF/blob/main/nomic-embed-text-v1.5.Q8_0.gguf) | Q8\_0 | 140 MiB | 5.79e-06
|
| 74 |
+
[nomic-embed-text-v1.5.f16.gguf](https://huggingface.co/nomic-ai/nomic-embed-text-v1.5-GGUF/blob/main/nomic-embed-text-v1.5.f16.gguf) | F16 | 262 MiB | 4.21e-10
|
| 75 |
+
[nomic-embed-text-v1.5.f32.gguf](https://huggingface.co/nomic-ai/nomic-embed-text-v1.5-GGUF/blob/main/nomic-embed-text-v1.5.f32.gguf) | F32 | 262 MiB | 6.08e-11
|
nomic-embed-text-v1.5.Q2_K.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:758227679b93e653eb712c442a16fcdf7d65b15a6eefd6ce8053edd7f42d82a4
|
| 3 |
+
size 49361088
|
nomic-embed-text-v1.5.Q3_K_L.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:139edcebb55846c0c7685b57e25c9758c67f9749a1276a0e886031b9a5bc63ff
|
| 3 |
+
size 71593088
|
nomic-embed-text-v1.5.Q3_K_M.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7140b9170efe9912ec4be3f19d4492050f7be8dc9492ac99a1892bcadca30e9c
|
| 3 |
+
size 67169408
|
nomic-embed-text-v1.5.Q3_K_S.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5b13eb7c4ed0ec144435f2a47f49acc04ce79c3081b5bc72f5a717da0905c7a9
|
| 3 |
+
size 59649152
|
nomic-embed-text-v1.5.Q4_0.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:01e3f351b0c697201d6901d38b29c9e0affa06c65b040359546c822198aa8f64
|
| 3 |
+
size 77802880
|
nomic-embed-text-v1.5.Q4_K_M.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0e3b6f681b81772c5d29b3b4f6b01c0e71ee64febba159ae182d3a238ccb63fb
|
| 3 |
+
size 84106624
|
nomic-embed-text-v1.5.Q4_K_S.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1acc092631aeb6fd52009c3d2b494fbee5c474ae6275e22bebb7c7d032a71b15
|
| 3 |
+
size 78097792
|
nomic-embed-text-v1.5.Q5_0.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ca07f2eba2354436bb0be187c6df15a1e754975f1f10f299c6513914d95a0b71
|
| 3 |
+
size 94888768
|
nomic-embed-text-v1.5.Q5_K_M.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5b4795c9f018653bea1626d8f95cf09d0ee0fabcbc14dfdca3f7f9320b352bd3
|
| 3 |
+
size 99588928
|
nomic-embed-text-v1.5.Q5_K_S.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c2c341f67341b9093a409710fb578b3f972f3c18ad2c8a0303cacf99529394cd
|
| 3 |
+
size 94888768
|
nomic-embed-text-v1.5.Q6_K.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e6a20b714d35f2e2f9a1e418f08826ac9303ac998961c26c8634196658a13867
|
| 3 |
+
size 113042528
|
nomic-embed-text-v1.5.Q8_0.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:880522894afda99389f2c979aed97c085d0f905eba61ef196307f95245754114
|
| 3 |
+
size 146146432
|
nomic-embed-text-v1.5.f16.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:e4060a62a157f92458ac71dc610a8909a9ad7ba1f1b4aaa52f121e207c1f9c63
|
| 3 |
+
size 274290560
|
nomic-embed-text-v1.5.f32.gguf
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ab33b0bd5d1d3e993c37f9f580c8a17bbb17309997096c5d88308da2c6a4fa63
|
| 3 |
+
size 547664768
|