valiantcat
/

Qwen-Image-Edit-MeiTu

Model card Files Files and versions

Qwen-Image-Edit-MeiTu / README.md

starsfriday's picture

Update README.md

af4082b verified 9 days ago

|

history blame contribute delete

3.33 kB

	---
	license: apache-2.0
	language:
	- en
	- zh
	library_name: diffusers
	base_model:
	- Qwen/Qwen-Image-Edit
	pipeline_tag: image-to-image
	tags:
	- image-editing
	- consistency
	- aesthetics
	- DiT
	- Qwen-Image
	- ValiantCat
	---

	<p align="center">
	<img src="https://ai.static.ad2.cc/banner.png" width="1000"/>
	</p>

	---

	# 🌈 Qwen-Image-Edit-MeiTu

	This model — Qwen-Image-Edit-MeiTu — is an improved variant of [Qwen/Qwen-Image-Edit](https://huggingface.co/Qwen/Qwen-Image-Edit), built with DiT-based architecture fine-tuning to enhance visual consistency, aesthetic quality, and structural alignment in complex edits.

	Developed by Valiant Cat AI Lab, this version aims to further close the gap between high-fidelity semantic editing and coherent artistic rendering, achieving a more natural and professional output across a wide range of prompts and subjects.

	---

	## ✨ Key Improvements

	* Enhanced Consistency:
	Utilizes DiT (Diffusion Transformer) fine-tuning to ensure structural stability between input and edited regions, maintaining global spatial coherence.

	* Aesthetic Optimization:
	Trained with aesthetic discriminators and curated aesthetic score datasets, producing more pleasing colors, contrast, and light balance.

	* Better Detail Preservation:
	Improved low-level reconstruction for fine details such as textures, faces, and typography.

	* Broader Scene Adaptability:
	Performs well on portraits, environments, product photos, and illustrations, supporting both semantic and appearance-based editing.

	---

	## 🖼️ Showcase

	Below are examples of consistency and aesthetic improvement in complex editing scenarios:

	\| Input & Output \|
	\|----------------\|
	\| <img src="preview/result1.png" width="800"/> \|
	\| <img src="preview/result2.png" width="800"/> \|
	\| <img src="preview/result3.png" width="800"/> \|
	\| <img src="preview/result4.png" width="800"/> \|
	\| <img src="preview/result5.png" width="800"/> \|



	## 💬 Recommended Prompts

	Try these prompts to explore the model’s strengths:

	* “make the lighting soft and cinematic with better balance”
	* “enhance the photo’s composition and maintain realism”
	* “refine skin tone and texture consistency”
	* “improve the global color tone and aesthetic harmony”
	* “increase photo realism and clarity without changing content”

	---

	## 🧩 Integration with ComfyUI

	This model works seamlessly with a modified [ComfyUI Qwen-Image-Edit workflow](https://huggingface.co/valiantcat/Qwen-Image-Edit-MeiTu/blob/main/Qwen-Edit-MeiTu.json).
	Just use this model in the Unet node to workflow for edit image.

	---

	## 📥 Download Model

	Weights available in Safetensors format:

	👉 [Download Qwen-Image-Edit-MeiTu](https://huggingface.co/valiantcat/Qwen-Image-Edit-MeiTu)

	---

	## 🧠 Training

	This model was trained and optimized by the
	AI Laboratory of Chongqing Valiant Cat Technology Co., LTD.
	Visit [https://vvicat.com/](https://vvicat.com/) for business collaborations or research partnerships.

	---

	## 📜 License

	Licensed under Apache 2.0.

	---

	## 💼 Join Us

	We are hiring research engineers and creative ML practitioners at
	Chongqing Valiant Cat Technology Co., LTD — reach out via
	📧 [email protected]