--- license: apache-2.0 language: - en - zh base_model: - Tongyi-MAI/Z-Image-Turbo pipeline_tag: text-to-image tags: - text-to-image - image-generation - diffusion - comfyui - photorealistic - bilingual - chinese - english - 8-step - fast-generation --- # 🚀 Z-Image-Turbo-AIO | 8-Step Photorealistic Generation
**Ultra-Fast ‱ Bilingual Text Rendering ‱ All-in-One ‱ FP8 & BF16** [![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0) [![ComfyUI](https://img.shields.io/badge/ComfyUI-Compatible-green.svg)](https://github.com/comfyanonymous/ComfyUI)
## ✹ What is Z-Image-Turbo-AIO? Z-Image-Turbo-AIO is an **All-in-One repackage** of Alibaba Tongyi Lab's 6B parameter photorealistic image generator, optimized for lightning-fast 8-step generation. This version includes **integrated VAE and Text Encoder** for maximum convenience - just download and generate! ### Available Versions | Version | Size | Best For | |---------|------|----------| | 🟡 **FP8-AIO** | ~10GB | For most users | | 🌟 **BF16-AIO** | ~20GB | Maximum quality | ## 🎯 Key Features - ⚡ **8-step generation** - 10-40 seconds per image, depends on your GPU - 📩 **All-in-One** - No separate VAE/Text Encoder downloads needed - 📾 **Photorealistic** - Professional quality output - 📖 **Bilingual** - English & Chinese text rendering - 🎯 **8GB VRAM** - Works on GPUs with 8GB VRAM - 🌐 **Apache 2.0** - Open license for any use ## 🔄 Which Version Should I Choose? ### 🟡 FP8-AIO (Recommended for most users) - ✅ Half the file size - ✅ Faster download - ✅ Excellent quality - ✅ Perfect for 8GB VRAM - ✅ Great for testing & everyday use ### 🌟 BF16-AIO (Maximum precision) - ✅ BFloat16 full precision - ✅ Lossless quality - ✅ Great for testing & everyday use - ✅ Still works on 8GB VRAM ## đŸ“„ Quick Start (ComfyUI) ### Installation 1. Download your preferred version (FP8 or BF16) 2. Place in `ComfyUI/models/checkpoints` 3. Load with "Load Checkpoint" node 4. Generate! ### Recommended Settings | Parameter | Value | |-----------|-------| | Steps | 8 | | CFG | 1.0 | | Sampler | res_multistep | | Scheduler | simple | | Resolution | 1920×1088 | **That's it! No separate VAE or Text Encoder needed!** ## 📊 Performance All tests on **RTX 4060 (8GB VRAM)** ‱ FP8 ‱ 1920×1088 ‱ 8 steps | Test | Generation Time | |------|-----------------| | Urban Interior | ~32s | | Architecture | ~32-34s | | Food Photography | ~32s | | Bilingual Signage | ~32s | ## 💡 Prompting Guide ### ✅ Natural Language Works Best! **Good Example:** ``` A cozy bookstore with floor-to-ceiling wooden shelves filled with colorful books, comfortable reading nooks with cushions near large windows, warm pendant lighting, peaceful afternoon atmosphere, professional interior photography ``` **Bad Example:** ``` bookstore, books, chairs, window, cozy, warm light, interior ``` ### 📖 Bilingual Text Rendering **English Text:** ``` Neon sign reading "OPEN 24/7" in bright blue letters above entrance. Modern sans-serif font, glowing effect against brick wall. ``` **Chinese Text:** ``` Traditional tea house entrance with sign reading "ć€éŸ”èŒ¶ćŠ" in elegant gold Chinese calligraphy on red wooden board with ornate carved border. ``` **Both Languages:** ``` Modern cafe exterior with bilingual sign. "Morning Brew Coffee" in white elegant script above, "æ™šæ›Šć’–ć•Ą" in matching Chinese characters below. Both glowing warmly at dusk. ``` ### 📝 Prompting Tips | Do ✅ | Don't ❌ | |------|---------| | Use natural language descriptions | Use tag-style prompts (tag1, tag2) | | Be detailed (100-300 words optimal) | Write very short prompts (<50 words) | | Include lighting and mood | Add negative prompts (not used) | | Describe camera angle and style | Include conflicting instructions | | Specify materials and colors | | ## 🙏 Credits & Acknowledgments ### Original Model - **Developer:** Tongyi Lab (Alibaba Group) - **Architecture:** Single-Stream Diffusion Transformer (6B parameters) - **Algorithm:** Decoupled-DMD + DMDR - **License:** Apache 2.0 ### AIO Conversion - **Created by:** [SeeSee21](https://huggingface.co/SeeSee21) - **Format:** Integrated VAE + Text Encoder - **Purpose:** Simplified single-file deployment ### Resources - đŸ€— [Original HuggingFace](https://huggingface.co/Tongyi-MAI/Z-Image-Turbo) - đŸ’» [GitHub Repository](https://github.com/Tongyi-MAI/Z-Image) - 🎹 [ComfyUI Files](https://huggingface.co/Comfy-Org/z_image_turbo) - đŸ–Œïž [CivitAI Page](https://civit.ai/models/2173571) ## 📈 Version History ### v1.0 - Initial AIO Release - FP8-AIO version (10GB) - BF16-AIO version (20GB) - Integrated VAE + Text Encoder - Single-file deployment - Based on Tongyi-MAI/Z-Image-Turbo - Tested on RTX 4060 8GB - Optimized for 1920×1088 ---
**Download, load with "Load Checkpoint", and generate professional photos in seconds! 🚀**