The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models
Paper
•
2510.13996
•
Published
•
6
🤗 Hugging Face x 🌸 BigScience initiative to create open source community resources for LAMs.