ltgoslo commited on
Commit
fcf84ad
·
verified ·
1 Parent(s): 9b48adf

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -2
README.md CHANGED
@@ -8,9 +8,14 @@ pinned: false
8
  ---
9
  https://hplt-project.org/
10
 
11
- Our project name, HPLT, is an acronym for High Performance Language Technologies. We are aiming high at combining large quantities of data, a number of languages and high-performance computing to build powerful and efficient language and translation models. Another goal of HPLT is to publish the results of this project in a shared space with open licenses.
 
 
12
 
13
- - [HPLT datasets paper](https://arxiv.org/abs/2503.10267)
 
 
 
14
  - Version 2 of the HPLT datasets (193 languages):
15
  - https://hplt-project.org/datasets/v2.0
16
  - https://hf.co/datasets/HPLT/HPLT2.0_cleaned
 
8
  ---
9
  https://hplt-project.org/
10
 
11
+ Our project name, HPLT, is an acronym for High Performance Language Technologies.
12
+ We combine large quantities of data, a number of languages and high-performance computing to build powerful and efficient datasets for language and translation models.
13
+ Another goal of HPLT is to publish the results of this project in a shared space with open licenses.
14
 
15
+ - Version 3 of the HPLT datasets (198 languages):
16
+ - https://hplt-project.org/datasets/v3.0
17
+ - https://hf.co/datasets/HPLT/HPLT3.0
18
+ - [HPLT datasets ACL'2025 paper](https://aclanthology.org/2025.acl-long.854/)
19
  - Version 2 of the HPLT datasets (193 languages):
20
  - https://hplt-project.org/datasets/v2.0
21
  - https://hf.co/datasets/HPLT/HPLT2.0_cleaned