Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

FastFlowLM
/
GPT-OSS-20B-NPU2

Text Generation
Transformers
gpt_oss
conversational
mxfp4
Model card Files Files and versions
xet
Community
4
GPT-OSS-20B-NPU2
14.5 GB
  • 2 contributors
History: 11 commits
FastFlowLM's picture
FastFlowLM
feat: update layer and version (#3)
8c4a40b verified 15 days ago
  • .gitattributes
    1.97 kB
    feat: Faster prefill about 1 month ago
  • README.md
    7.12 kB
    Create README.md about 2 months ago
  • attn.xclbin
    592 kB
    xet
    feat: upload all xclbins about 2 months ago
  • config.json
    1.95 kB
    feat: update layer and version (#3) 15 days ago
  • dequant_mxfp4.xclbin
    279 kB
    xet
    feat: Faster prefill about 1 month ago
  • dequant_q4_1.xclbin
    114 kB
    xet
    feat: Faster prefill about 1 month ago
  • expert.xclbin
    146 kB
    xet
    feat: upload all xclbins about 2 months ago
  • layer.xclbin
    453 kB
    xet
    feat: update layer and version (#3) 15 days ago
  • lm_head.xclbin
    153 kB
    xet
    feat: upload verified xclbin about 2 months ago
  • mm.xclbin
    544 kB
    xet
    feat: Faster prefill about 1 month ago
  • model.q4nx
    14.5 GB
    xet
    feat: add weights about 2 months ago
  • tokenizer.json
    27.9 MB
    xet
    feat: upload verified xclbin about 2 months ago
  • tokenizer_config.json
    21.8 kB
    feat: upload verified xclbin about 2 months ago