beezu/zerofata_GLM-4.5-Iceblink-v2-106B-A12B-MLX-MXFP4
This model beezu/zerofata_GLM-4.5-Iceblink-v2-106B-A12B-MLX-MXFP4 was converted to MLX format from zerofata/GLM-4.5-Iceblink-v2-106B-A12B using mlx-lm version 0.28.3.
Use with mlx
pip install mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("beezu/zerofata_GLM-4.5-Iceblink-106B-A12B-MLX-MXFP4")
prompt = "hello"
if tokenizer.chat_template is not None:
messages = [{"role": "user", "content": prompt}]
prompt = tokenizer.apply_chat_template(
messages, add_generation_prompt=True
)
response = generate(model, tokenizer, prompt=prompt, verbose=True)
- Downloads last month
- 135
Model tree for beezu/zerofata_GLM-4.5-Iceblink-v2-106B-A12B-MLX-MXFP4
Base model
zai-org/GLM-4.5-Air
Finetuned
zerofata/GLM-4.5-Iceblink-v2-106B-A12B