|
|
--- |
|
|
license: apache-2.0 |
|
|
language: |
|
|
- en |
|
|
- zh |
|
|
library_name: diffusers |
|
|
base_model: |
|
|
- Qwen/Qwen-Image-Edit |
|
|
pipeline_tag: image-to-image |
|
|
tags: |
|
|
- image-editing |
|
|
- consistency |
|
|
- aesthetics |
|
|
- DiT |
|
|
- Qwen-Image |
|
|
- ValiantCat |
|
|
--- |
|
|
|
|
|
<p align="center"> |
|
|
<img src="https://ai.static.ad2.cc/banner.png" width="1000"/> |
|
|
</p> |
|
|
|
|
|
--- |
|
|
|
|
|
# 🌈 Qwen-Image-Edit-MeiTu |
|
|
|
|
|
This model — **Qwen-Image-Edit-MeiTu** — is an improved variant of [Qwen/Qwen-Image-Edit](https://huggingface.co/Qwen/Qwen-Image-Edit), built with **DiT-based architecture fine-tuning** to enhance **visual consistency**, **aesthetic quality**, and **structural alignment** in complex edits. |
|
|
|
|
|
Developed by **Valiant Cat AI Lab**, this version aims to further close the gap between high-fidelity semantic editing and coherent artistic rendering, achieving a more natural and professional output across a wide range of prompts and subjects. |
|
|
|
|
|
--- |
|
|
|
|
|
## ✨ Key Improvements |
|
|
|
|
|
* **Enhanced Consistency**: |
|
|
Utilizes DiT (Diffusion Transformer) fine-tuning to ensure **structural stability** between input and edited regions, maintaining global spatial coherence. |
|
|
|
|
|
* **Aesthetic Optimization**: |
|
|
Trained with aesthetic discriminators and curated aesthetic score datasets, producing more **pleasing colors, contrast, and light balance**. |
|
|
|
|
|
* **Better Detail Preservation**: |
|
|
Improved low-level reconstruction for fine details such as **textures, faces, and typography**. |
|
|
|
|
|
* **Broader Scene Adaptability**: |
|
|
Performs well on **portraits, environments, product photos, and illustrations**, supporting both **semantic** and **appearance-based** editing. |
|
|
|
|
|
--- |
|
|
|
|
|
## 🖼️ Showcase |
|
|
|
|
|
Below are examples of **consistency and aesthetic improvement** in complex editing scenarios: |
|
|
|
|
|
| Input & Output | |
|
|
|----------------| |
|
|
| <img src="preview/result1.png" width="800"/> | |
|
|
| <img src="preview/result2.png" width="800"/> | |
|
|
| <img src="preview/result3.png" width="800"/> | |
|
|
| <img src="preview/result4.png" width="800"/> | |
|
|
| <img src="preview/result5.png" width="800"/> | |
|
|
|
|
|
|
|
|
|
|
|
## 💬 Recommended Prompts |
|
|
|
|
|
Try these prompts to explore the model’s strengths: |
|
|
|
|
|
* “make the lighting soft and cinematic with better balance” |
|
|
* “enhance the photo’s composition and maintain realism” |
|
|
* “refine skin tone and texture consistency” |
|
|
* “improve the global color tone and aesthetic harmony” |
|
|
* “increase photo realism and clarity without changing content” |
|
|
|
|
|
--- |
|
|
|
|
|
## 🧩 Integration with ComfyUI |
|
|
|
|
|
This model works seamlessly with a modified [ComfyUI Qwen-Image-Edit workflow](https://huggingface.co/valiantcat/Qwen-Image-Edit-MeiTu/blob/main/Qwen-Edit-MeiTu.json). |
|
|
Just use this model in the **Unet node** to workflow for edit image. |
|
|
|
|
|
--- |
|
|
|
|
|
## 📥 Download Model |
|
|
|
|
|
Weights available in **Safetensors** format: |
|
|
|
|
|
👉 [Download Qwen-Image-Edit-MeiTu](https://huggingface.co/valiantcat/Qwen-Image-Edit-MeiTu) |
|
|
|
|
|
--- |
|
|
|
|
|
## 🧠 Training |
|
|
|
|
|
This model was trained and optimized by the |
|
|
**AI Laboratory of Chongqing Valiant Cat Technology Co., LTD.** |
|
|
Visit [https://vvicat.com/](https://vvicat.com/) for business collaborations or research partnerships. |
|
|
|
|
|
--- |
|
|
|
|
|
## 📜 License |
|
|
|
|
|
Licensed under **Apache 2.0**. |
|
|
|
|
|
--- |
|
|
|
|
|
## 💼 Join Us |
|
|
|
|
|
We are hiring research engineers and creative ML practitioners at |
|
|
**Chongqing Valiant Cat Technology Co., LTD** — reach out via |
|
|
📧 **[email protected]** |
|
|
|
|
|
|