|
|
--- |
|
|
license: apache-2.0 |
|
|
language: |
|
|
- en |
|
|
- zh |
|
|
base_model: |
|
|
- Tongyi-MAI/Z-Image-Turbo |
|
|
pipeline_tag: text-to-image |
|
|
tags: |
|
|
- text-to-image |
|
|
- image-generation |
|
|
- diffusion |
|
|
- comfyui |
|
|
- photorealistic |
|
|
- bilingual |
|
|
- chinese |
|
|
- english |
|
|
- 8-step |
|
|
- fast-generation |
|
|
--- |
|
|
|
|
|
# 🚀 Z-Image-Turbo-AIO | 8-Step Photorealistic Generation |
|
|
|
|
|
<div align="center"> |
|
|
|
|
|
**Ultra-Fast • Bilingual Text Rendering • All-in-One • FP8 & BF16** |
|
|
|
|
|
[](https://opensource.org/licenses/Apache-2.0) |
|
|
[](https://github.com/comfyanonymous/ComfyUI) |
|
|
|
|
|
</div> |
|
|
|
|
|
## ✨ What is Z-Image-Turbo-AIO? |
|
|
|
|
|
Z-Image-Turbo-AIO is an **All-in-One repackage** of Alibaba Tongyi Lab's 6B parameter photorealistic image generator, optimized for lightning-fast 8-step generation. This version includes **integrated VAE and Text Encoder** for maximum convenience - just download and generate! |
|
|
|
|
|
### Available Versions |
|
|
|
|
|
| Version | Size | Best For | |
|
|
|---------|------|----------| |
|
|
| 🟡 **FP8-AIO** | ~10GB | For most users | |
|
|
| 🌟 **BF16-AIO** | ~20GB | Maximum quality | |
|
|
|
|
|
## 🎯 Key Features |
|
|
|
|
|
- ⚡ **8-step generation** - 10-40 seconds per image, depends on your GPU |
|
|
- 📦 **All-in-One** - No separate VAE/Text Encoder downloads needed |
|
|
- 📸 **Photorealistic** - Professional quality output |
|
|
- 📖 **Bilingual** - English & Chinese text rendering |
|
|
- 🎯 **8GB VRAM** - Works on GPUs with 8GB VRAM |
|
|
- 🌐 **Apache 2.0** - Open license for any use |
|
|
|
|
|
## 🔄 Which Version Should I Choose? |
|
|
|
|
|
### 🟡 FP8-AIO (Recommended for most users) |
|
|
- ✅ Half the file size |
|
|
- ✅ Faster download |
|
|
- ✅ Excellent quality |
|
|
- ✅ Perfect for 8GB VRAM |
|
|
- ✅ Great for testing & everyday use |
|
|
|
|
|
### 🌟 BF16-AIO (Maximum precision) |
|
|
- ✅ BFloat16 full precision |
|
|
- ✅ Lossless quality |
|
|
- ✅ Great for testing & everyday use |
|
|
- ✅ Still works on 8GB VRAM |
|
|
|
|
|
## 📥 Quick Start (ComfyUI) |
|
|
|
|
|
### Installation |
|
|
|
|
|
1. Download your preferred version (FP8 or BF16) |
|
|
2. Place in `ComfyUI/models/checkpoints` |
|
|
3. Load with "Load Checkpoint" node |
|
|
4. Generate! |
|
|
|
|
|
### Recommended Settings |
|
|
|
|
|
| Parameter | Value | |
|
|
|-----------|-------| |
|
|
| Steps | 8 | |
|
|
| CFG | 1.0 | |
|
|
| Sampler | res_multistep | |
|
|
| Scheduler | simple | |
|
|
| Resolution | 1920×1088 | |
|
|
|
|
|
**That's it! No separate VAE or Text Encoder needed!** |
|
|
|
|
|
## 📊 Performance |
|
|
|
|
|
All tests on **RTX 4060 (8GB VRAM)** • FP8 • 1920×1088 • 8 steps |
|
|
|
|
|
| Test | Generation Time | |
|
|
|------|-----------------| |
|
|
| Urban Interior | ~32s | |
|
|
| Architecture | ~32-34s | |
|
|
| Food Photography | ~32s | |
|
|
| Bilingual Signage | ~32s | |
|
|
|
|
|
## 💡 Prompting Guide |
|
|
|
|
|
### ✅ Natural Language Works Best! |
|
|
|
|
|
**Good Example:** |
|
|
``` |
|
|
A cozy bookstore with floor-to-ceiling wooden shelves filled with |
|
|
colorful books, comfortable reading nooks with cushions near large |
|
|
windows, warm pendant lighting, peaceful afternoon atmosphere, |
|
|
professional interior photography |
|
|
``` |
|
|
|
|
|
**Bad Example:** |
|
|
``` |
|
|
bookstore, books, chairs, window, cozy, warm light, interior |
|
|
``` |
|
|
|
|
|
### 📖 Bilingual Text Rendering |
|
|
|
|
|
**English Text:** |
|
|
``` |
|
|
Neon sign reading "OPEN 24/7" in bright blue letters above entrance. |
|
|
Modern sans-serif font, glowing effect against brick wall. |
|
|
``` |
|
|
|
|
|
**Chinese Text:** |
|
|
``` |
|
|
Traditional tea house entrance with sign reading "古韵茶坊" in elegant |
|
|
gold Chinese calligraphy on red wooden board with ornate carved border. |
|
|
``` |
|
|
|
|
|
**Both Languages:** |
|
|
``` |
|
|
Modern cafe exterior with bilingual sign. "Morning Brew Coffee" in |
|
|
white elegant script above, "晨曦咖啡" in matching Chinese characters |
|
|
below. Both glowing warmly at dusk. |
|
|
``` |
|
|
|
|
|
### 📝 Prompting Tips |
|
|
|
|
|
| Do ✅ | Don't ❌ | |
|
|
|------|---------| |
|
|
| Use natural language descriptions | Use tag-style prompts (tag1, tag2) | |
|
|
| Be detailed (100-300 words optimal) | Write very short prompts (<50 words) | |
|
|
| Include lighting and mood | Add negative prompts (not used) | |
|
|
| Describe camera angle and style | Include conflicting instructions | |
|
|
| Specify materials and colors | | |
|
|
|
|
|
## 🙏 Credits & Acknowledgments |
|
|
|
|
|
### Original Model |
|
|
- **Developer:** Tongyi Lab (Alibaba Group) |
|
|
- **Architecture:** Single-Stream Diffusion Transformer (6B parameters) |
|
|
- **Algorithm:** Decoupled-DMD + DMDR |
|
|
- **License:** Apache 2.0 |
|
|
|
|
|
### AIO Conversion |
|
|
- **Created by:** [SeeSee21](https://huggingface.co/SeeSee21) |
|
|
- **Format:** Integrated VAE + Text Encoder |
|
|
- **Purpose:** Simplified single-file deployment |
|
|
|
|
|
### Resources |
|
|
- 🤗 [Original HuggingFace](https://huggingface.co/Tongyi-MAI/Z-Image-Turbo) |
|
|
- 💻 [GitHub Repository](https://github.com/Tongyi-MAI/Z-Image) |
|
|
- 🎨 [ComfyUI Files](https://huggingface.co/Comfy-Org/z_image_turbo) |
|
|
- 🖼️ [CivitAI Page](https://civit.ai/models/2173571) |
|
|
|
|
|
## 📈 Version History |
|
|
|
|
|
### v1.0 - Initial AIO Release |
|
|
- FP8-AIO version (10GB) |
|
|
- BF16-AIO version (20GB) |
|
|
- Integrated VAE + Text Encoder |
|
|
- Single-file deployment |
|
|
- Based on Tongyi-MAI/Z-Image-Turbo |
|
|
- Tested on RTX 4060 8GB |
|
|
- Optimized for 1920×1088 |
|
|
|
|
|
--- |
|
|
|
|
|
<div align="center"> |
|
|
|
|
|
**Download, load with "Load Checkpoint", and generate professional photos in seconds! 🚀** |
|
|
|
|
|
</div> |