nlpaueb
/

sec-bert-base

Model card Files Files and versions

nlpaueb commited on Apr 2, 2022

Commit

50b73c5

·

1 Parent(s): b6b9f45

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -23,9 +23,9 @@ widget:
 SEC-BERT is a family of BERT models for the financial domain, intended to assist financial NLP research and FinTech applications.
 SEC-BERT consists of the following models:
-* SEC-BERT-BASE (this model): Same architecture as BERT-BASE trained on financial documents.
-* [SEC-BERT-NUM](https://huggingface.co/nlpaueb/sec-bert-num): Same as SEC-BERT-BASE but we replace every number token with a [NUM] pseudo-token handling all numeric expressions in a uniform manner, disallowing their fragmentation
-* [SEC-BERT-SHAPE](https://huggingface.co/nlpaueb/sec-bert-shape): Same as SEC-BERT-BASE but we replace numbers with pseudo-tokens that represent the number’s shape, so numeric expressions (of known shapes) are no longer fragmented, e.g., '53.2' becomes '[XX.X]' and '40,200.5' becomes '[XX,XXX.X]'.
 </div>
 ## Pre-training corpus

 SEC-BERT is a family of BERT models for the financial domain, intended to assist financial NLP research and FinTech applications.
 SEC-BERT consists of the following models:
+* **SEC-BERT-BASE** (this model): Same architecture as BERT-BASE trained on financial documents.
+* [**SEC-BERT-NUM**](https://huggingface.co/nlpaueb/sec-bert-num): Same as SEC-BERT-BASE but we replace every number token with a [NUM] pseudo-token handling all numeric expressions in a uniform manner, disallowing their fragmentation
+* [**SEC-BERT-SHAPE**](https://huggingface.co/nlpaueb/sec-bert-shape): Same as SEC-BERT-BASE but we replace numbers with pseudo-tokens that represent the number’s shape, so numeric expressions (of known shapes) are no longer fragmented, e.g., '53.2' becomes '[XX.X]' and '40,200.5' becomes '[XX,XXX.X]'.
 </div>
 ## Pre-training corpus