Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
sky-2002
/
deepseek-tinystories-60M
like
0
Text Generation
PyTorch
roneneldan/TinyStories
English
deepseek
mixture-of-experts
Mixture of Experts
tinystories
language-model
multi-head-latent-attention
Model card
Files
Files and versions
xet
Community
55878ce
deepseek-tinystories-60M
235 MB
1 contributor
History:
11 commits
sky-2002
Upload deepseek_tinystories/processor.py
55878ce
verified
3 months ago
deepseek_tinystories
Upload deepseek_tinystories/processor.py
3 months ago
.gitattributes
1.52 kB
initial commit
3 months ago
README.md
1.66 kB
Upload README.md
3 months ago
config.json
337 Bytes
Upload config.json
3 months ago
modeling_deepseek.py
25.6 kB
Upload modeling_deepseek.py
3 months ago
processor.py
6.64 kB
Upload processor.py
3 months ago
pytorch_model.bin
235 MB
xet
Upload pytorch_model.bin
3 months ago
utils.py
1.46 kB
Upload utils.py
3 months ago