SmolLM2-1.7B-Executorch-Q8DA4W

This repository contains the smollm2_1_7b_q8da4w.pte model, exported for use with ExecuTorch.

Details

This model is ready to be used in mobile applications (iOS/Android) via the ExecuTorch runtime or react-native-executorch.

Download smollm2_1_7b_q8da4w.pte and the tokenizer files (tokenizer.json, vocab.json, merges.txt).
Place them in your app's asset folder.
Load with ExecuTorch runtime.

SmolLM2 uses byte-level BPE tokenizer (similar to GPT-2), not SentencePiece like Llama.
Tokenizer files are: tokenizer.json, vocab.json, merges.txt

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Quantized

Finetuned

(109)

this model