metadata
			license: apache-2.0
datasets:
  - HuggingFaceM4/OBELICS
  - HuggingFaceM4/the_cauldron
  - HuggingFaceM4/Docmatix
  - HuggingFaceM4/WebSight
language:
  - en
tags:
  - multimodal
  - vision
  - image-text-to-text
  - mlx
library_name: transformers
mlx-community/Idefics3-8B-Llama3-4bit
This model was converted to MLX format from HuggingFaceM4/Idefics3-8B-Llama3 using mlx-vlm version 0.1.12.
Refer to the original model card for more details on the model.
Use with mlx
pip install -U mlx-vlm
python -m mlx_vlm.generate --model mlx-community/Idefics3-8B-Llama3-4bit --max-tokens 100 --temp 0.0 --prompt "Describe this image." --image <path_to_image>