Update README.md
Browse files
README.md
CHANGED
|
@@ -8,9 +8,14 @@ pinned: false
|
|
| 8 |
---
|
| 9 |
https://hplt-project.org/
|
| 10 |
|
| 11 |
-
Our project name, HPLT, is an acronym for High Performance Language Technologies.
|
|
|
|
|
|
|
| 12 |
|
| 13 |
-
-
|
|
|
|
|
|
|
|
|
|
| 14 |
- Version 2 of the HPLT datasets (193 languages):
|
| 15 |
- https://hplt-project.org/datasets/v2.0
|
| 16 |
- https://hf.co/datasets/HPLT/HPLT2.0_cleaned
|
|
|
|
| 8 |
---
|
| 9 |
https://hplt-project.org/
|
| 10 |
|
| 11 |
+
Our project name, HPLT, is an acronym for High Performance Language Technologies.
|
| 12 |
+
We combine large quantities of data, a number of languages and high-performance computing to build powerful and efficient datasets for language and translation models.
|
| 13 |
+
Another goal of HPLT is to publish the results of this project in a shared space with open licenses.
|
| 14 |
|
| 15 |
+
- Version 3 of the HPLT datasets (198 languages):
|
| 16 |
+
- https://hplt-project.org/datasets/v3.0
|
| 17 |
+
- https://hf.co/datasets/HPLT/HPLT3.0
|
| 18 |
+
- [HPLT datasets ACL'2025 paper](https://aclanthology.org/2025.acl-long.854/)
|
| 19 |
- Version 2 of the HPLT datasets (193 languages):
|
| 20 |
- https://hplt-project.org/datasets/v2.0
|
| 21 |
- https://hf.co/datasets/HPLT/HPLT2.0_cleaned
|