NexaAI
/

gpt-oss-20b-MLX-4bit

4-bit precision

Model card Files Files and versions

NexaAI/gpt-oss-20b-MLX-4bit

Quickstart

Run them directly with nexa-sdk installed In nexa-sdk CLI:

NexaAI/gpt-oss-20b-MLX-4bit

Overview

This is a 4-bit quantized version of the OpenAI GPT OSS 20B model, optimized for Apple Silicon using the MLX framework. The model was successfully converted from the original gpt_oss architecture to MLX format using the development version of mlx-lm.

Reference

Original model card: InferenceIllusionist/gpt-oss-20b-MLX-4bit

Downloads last month: 80

Safetensors

Model size

21B params

Tensor type

BF16

·

U32

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for NexaAI/gpt-oss-20b-MLX-4bit

Base model

openai/gpt-oss-20b

Quantized

(115)

this model

Collection including NexaAI/gpt-oss-20b-MLX-4bit

LLM - MLX

Text Generations Models in MLX format, hand picked by Nexa Team. • 6 items • Updated Sep 24 • 2