Add comprehensive model card for Monet-7B

by nielsr HF Staff - opened Nov 27, 2025

←

nielsr

Nov 27, 2025

This PR adds a comprehensive model card for the Monet-7B model, linking it to the paper Monet: Reasoning in Latent Visual Space Beyond Images and Language.

It includes:

Essential metadata: license: cc-by-nc-4.0, library_name: transformers, and pipeline_tag: image-text-to-text.
Links to the official paper and the GitHub repository.
An overview image from the GitHub repository.
A sample usage code snippet demonstrating how to use the model with the transformers library, noting that trust_remote_code=True is required due to custom components.
The BibTeX citation for the paper.

This update will significantly improve discoverability and ease of use for researchers and developers on the Hugging Face Hub.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Cannot merge

This branch has merge conflicts in the following files:

· Sign up or log in to comment