knowledgator
/

gliner-llama-1B-v1.0

Token Classification

information extraction

entity recognition

Model card Files Files and versions

Ihor commited on Sep 1, 2024

Commit

d05dd44

·

verified ·

1 Parent(s): 5fa1e93

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -77,11 +77,15 @@ European Championship => competitions
 If you want to use flash attention or increase sequence length, please, check the following code:
 ```python
 model = GLiNER.from_pretrained("knowledgator/gliner-llama-1B-v1.0",
                                 _attn_implementation = 'flash_attention_2',
-                                                max_len = 2048).to('cuda:0')
 ```
 ### Benchmarks
 Below you can see the table with benchmarking results on various named entity recognition datasets:

 If you want to use flash attention or increase sequence length, please, check the following code:
 ```python
+from gliner import GLiNER
+import torch
 model = GLiNER.from_pretrained("knowledgator/gliner-llama-1B-v1.0",
                                 _attn_implementation = 'flash_attention_2',
+                                                max_len = 2048).to('cuda:0', dtype=torch.float16)
 ```
 ### Benchmarks
 Below you can see the table with benchmarking results on various named entity recognition datasets: