Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
FastFlowLM
/
GPT-OSS-20B-NPU2
like
1
Text Generation
Transformers
gpt_oss
conversational
mxfp4
arxiv:
2508.10925
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
4
Deploy
Use this model
main
GPT-OSS-20B-NPU2
14.5 GB
2 contributors
History:
11 commits
FastFlowLM
feat: update layer and version (
#3
)
8c4a40b
verified
15 days ago
.gitattributes
Safe
1.97 kB
feat: Faster prefill
about 1 month ago
README.md
Safe
7.12 kB
Create README.md
about 2 months ago
attn.xclbin
Safe
592 kB
xet
feat: upload all xclbins
about 2 months ago
config.json
Safe
1.95 kB
feat: update layer and version (#3)
15 days ago
dequant_mxfp4.xclbin
Safe
279 kB
xet
feat: Faster prefill
about 1 month ago
dequant_q4_1.xclbin
Safe
114 kB
xet
feat: Faster prefill
about 1 month ago
expert.xclbin
Safe
146 kB
xet
feat: upload all xclbins
about 2 months ago
layer.xclbin
Safe
453 kB
xet
feat: update layer and version (#3)
15 days ago
lm_head.xclbin
Safe
153 kB
xet
feat: upload verified xclbin
about 2 months ago
mm.xclbin
Safe
544 kB
xet
feat: Faster prefill
about 1 month ago
model.q4nx
Safe
14.5 GB
xet
feat: add weights
about 2 months ago
tokenizer.json
Safe
27.9 MB
xet
feat: upload verified xclbin
about 2 months ago
tokenizer_config.json
Safe
21.8 kB
feat: upload verified xclbin
about 2 months ago