Llama-MedX v3.2 (GGUF)

Quantized build of the Llama-medx_v3.2 medical assistant model packaged for Ollama / llama.cpp runtimes. This export includes the Modelfile generated from the original Ollama registry entry and a GGUF binary derived from the upstream Hugging Face release.

Base model: skumar9/Llama-medx_v3.2
Architecture: Meta Llama 3.1 8B fine-tuned for medical QA

Variant

Variant	Size	Blob
`latest`	1.88 GB	`sha256-dde5aa3fc5ffc17176b5e8bdc82f587b24b2678c6c66101bf7da77af9f7ccdff`

Usage with Ollama

ollama create llama-medx-v32 -f modelfiles/llama-medx_v32--latest.Modelfile
ollama run llama-medx-v32

Source

Originally published on my Ollama profile: https://ollama.com/richardyoung/llama-medx_v32

Downloads last month: 11

GGUF

Model size

3B params

Architecture

llama

Hardware compatibility

We're not able to determine the quantization variants.

View all variants

Model tree for richardyoung/llama-medx-v32

Base model

skumar9/Llama-medx_v3.2

Quantized

(6)

this model

Collections including richardyoung/llama-medx-v32

Medical & Healthcare AI

Collection

Models and datasets for medical AI research. Includes CardioEmbed embeddings for cardiology, medical LLMs, and synthetic patient datasets. • 9 items • Updated 15 days ago

GGUF Models for Ollama

Collection

Ready-to-use GGUF quantizations for Ollama, llama.cpp, and local inference. • 8 items • Updated 9 days ago