Add comprehensive model card for Monet-7B

#1
by nielsr HF Staff - opened

This PR adds a comprehensive model card for the Monet-7B model, linking it to the paper Monet: Reasoning in Latent Visual Space Beyond Images and Language.

It includes:

  • Essential metadata: license: cc-by-nc-4.0, library_name: transformers, and pipeline_tag: image-text-to-text.
  • Links to the official paper and the GitHub repository.
  • An overview image from the GitHub repository.
  • A sample usage code snippet demonstrating how to use the model with the transformers library, noting that trust_remote_code=True is required due to custom components.
  • The BibTeX citation for the paper.

This update will significantly improve discoverability and ease of use for researchers and developers on the Hugging Face Hub.

Cannot merge
This branch has merge conflicts in the following files:
  • README.md

Sign up or log in to comment