Synthetic data derived from finepdfs
MultiSynt
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
MultiSynt is a collaborative initiative between OpenEuroLLM and EuroLLM focused on developing high-quality multilingual synthetic datasets for language model pretraining. By combining expertise from both organizations, MultiSynt aims to advance the creation of synthetic training data that supports diverse European languages to enable more inclusive AI development across languages.
models
21
MultiSynt/nemotron-cc-danish-tower9b
Updated
•
53
MultiSynt/nemotron-cc-danish-opus
Updated
•
113
MultiSynt/nemotron-cc-portuguese-opus
Updated
•
1.12k
MultiSynt/nemotron-cc-dutch-opus
Updated
•
635
MultiSynt/nemotron-cc-basque-opus
Updated
•
790
MultiSynt/nemotron-cc-dutch-tower9b
Updated
•
1.02k
MultiSynt/nemotron-cc-spanish-tower9b
Updated
•
52
MultiSynt/nemotron-cc-italian-opus
Updated
•
1.1k
MultiSynt/nemotron-cc-italian-tower72b
Updated
•
1.15k
MultiSynt/nemotron-cc-finnish-tower9b
Updated
•
152
datasets
24
MultiSynt/nemotron-cc-portuguese-tower9b
Viewer
•
Updated
•
136M
•
30
MultiSynt/nemotron-cc-italian-tower9b
Viewer
•
Updated
•
136M
•
154
MultiSynt/nemotron-cc-polish-tower9b
Viewer
•
Updated
•
136M
•
97
MultiSynt/nemotron-cc-french-tower9b
Viewer
•
Updated
•
135M
•
72
MultiSynt/nemotron-cc-spanish-opus-qe
Viewer
•
Updated
•
3.29B
•
56
MultiSynt/nemotron-cc-dutch-tower9b
Viewer
•
Updated
•
135M
•
195
MultiSynt/nemotron-cc-icelandic-tower9b
Viewer
•
Updated
•
136M
•
119
MultiSynt/finepdfs-summaries
Viewer
•
Updated
•
1.57B
•
1.74k
•
1
MultiSynt/nemotron-cc-danish-tower9b
Viewer
•
Updated
•
138M
•
989
MultiSynt/nemotron-cc-finnish-opus-qe
Viewer
•
Updated
•
3.29B
•
333