| license: gemma | |
| library_name: transformers | |
| tags: | |
| - mlx | |
| widget: | |
| - messages: | |
| - role: user | |
| content: How does the brain work? | |
| inference: | |
| parameters: | |
| max_new_tokens: 200 | |
| extra_gated_heading: Access Gemma on Hugging Face | |
| extra_gated_prompt: To access Gemma on Hugging Face, you’re required to review and | |
| agree to Google’s usage license. To do this, please ensure you’re logged-in to Hugging | |
| Face and click below. Requests are processed immediately. | |
| extra_gated_button_content: Acknowledge license | |
| # batmac/gemma-1.1-2b-it-mlx-4bit | |
| This model was converted to MLX format from [`google/gemma-1.1-2b-it`]() using mlx-lm version **0.12.1**. | |
| Refer to the [original model card](https://huggingface.co/google/gemma-1.1-2b-it) for more details on the model. | |
| ## Use with mlx | |
| ```bash | |
| pip install mlx-lm | |
| ``` | |
| ```python | |
| from mlx_lm import load, generate | |
| model, tokenizer = load("batmac/gemma-1.1-2b-it-mlx-4bit") | |
| response = generate(model, tokenizer, prompt="hello", verbose=True) | |
| ``` | |