File size: 7,276 Bytes
6f8c2e7 03ae46a 310e746 6f8c2e7 03a74f9 d1d432a 6f8c2e7 d0d32fc df90af8 d0d32fc 6f8c2e7 284eb66 6f8c2e7 d2a2487 284eb66 1a91393 d2a2487 6f8c2e7 4592100 6f8c2e7 d2a2487 6f8c2e7 72dd8d0 284eb66 747274a 6f8c2e7 747274a 6f8c2e7 747274a 6f8c2e7 747274a 6f8c2e7 747274a 6f8c2e7 747274a 6f8c2e7 0ff4462 6f8c2e7 f0329f1 6f8c2e7 f0329f1 6f8c2e7 9703ede 7c2474f 6f8c2e7 0ff4462 6f8c2e7 4592100 6f8c2e7 4592100 747274a 6f8c2e7 747274a 6f8c2e7 2c6f90f 747274a 6f8c2e7 747274a 6f8c2e7 0ff4462 747274a 6f8c2e7 747274a 6f8c2e7 747274a 6f8c2e7 747274a 6f8c2e7 747274a 6f8c2e7 747274a 6f8c2e7 817fe7d 747274a 6f8c2e7 747274a 6f8c2e7 747274a 6f8c2e7 b0c6474 6f8c2e7 747274a 6f8c2e7 747274a 6f8c2e7 9f883fd 5f02f41 747274a 6f8c2e7 747274a 6f8c2e7 f51ba67 6f8c2e7 d1d432a |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 |
---
base_model:
- Alpha-VLLM/Lumina-Image-2.0
license: apache-2.0
tags:
- stable-diffusion
- text-to-image
- comfyui
- diffusion-single-file
---
[中文版模型说明](https://huggingface.co/neta-art/Neta-Lumina/blob/main/README-ZH.md)
<br>
<br>
[**Neta Lumina Tech Report**](https://neta.art/blog/neta_lumina/)
## 📽️ Flash Preview
<video controls autoplay loop muted playsinline style="max-width:100%; border-radius:8px;">
<source src="https://pages-r2.neta.art/Neta_Lumina_Flash_PV.webm" type="video/webm" />
Your browser does not support the video tag.
</video>
# Introduction
**Neta Lumina** is a high‑quality anime‑style image‑generation model developed by Neta.art Lab.
Building on the open‑source **Lumina‑Image‑2.0** released by the Alpha‑VLLM team at Shanghai AI Laboratory, we fine‑tuned the model with a vast corpus of high‑quality anime images and multilingual tag data. The preliminary result is a compelling model with powerful comprehension and interpretation abilities (thanks to Gemma text encoder), ideal for illustration, posters, storyboards, character design, and more.
## Key Features
- Optimized for diverse creative scenarios such as Furry, Guofeng (traditional‑Chinese aesthetics), pets, etc.
- Wide coverage of characters and styles, from popular to niche concepts. (Still support danbooru tags!)
- Accurate natural‑language understanding with excellent adherence to complex prompts.
- Native multilingual support, with Chinese, English, and Japanese recommended first.
## Model Versions
For models in alpha tests, requst access at https://huggingface.co/neta-art/NetaLumina_Alpha if you are interested. We will keep updating.
### neta-lumina-v1.0
- **Official Release**: overall best performance
### neta-lumina-beta-0624-raw (archived)
- **Primary Goal**: General knowledge and anime‑style optimization
- **Data Set**: >13 million anime‑style images
- **>46,000** A100 Hours
- Higher upper limit, suitable for pro users. Check [**Neta Lumina Prompt Book**](https://nieta-art.feishu.cn/wiki/RY3GwpT59icIQlkWXEfcCqIMnQd) for better results.
### neta-lumina-beta-0624-aes-experimental (archived)
- First beta release candidate
- **Primary Goal**: Enhanced aesthetics, pose accuracy, and scene detail
- **Data Set**: Hundreds of thousands of handpicked high‑quality anime images (fine‑tuned on an older version of raw model)
- User-friendly, suitable for most people.
<br>
# How to Use
[Try it at Hugging Face playground](https://huggingface.co/spaces/neta-art/NetaLumina_T2I_Playground)
## ComfyUI
Neta Lumina is built on the **Lumina2 Diffusion Transformer (DiT)** framework, please follow these steps precisely.
### Environment Requirements
Currently Neta Lumina runs only on ComfyUI:
- Latest ComfyUI installation
- ≥ 8 GB VRAM
### Downloads & Installation
**Original (component) release**
1. **Neta Lumina-Beta**
- Download link: https://huggingface.co/neta-art/Neta-Lumina/blob/main/Unet/neta-lumina-v1.0.safetensors
- Save path: `ComfyUI/models/unet/`
2. **Text Encoder (Gemma-2B)**
- Download link:https://huggingface.co/neta-art/Neta-Lumina/blob/main/Text%20Encoder/gemma_2_2b_fp16.safetensors
- Save path: `ComfyUI/models/text_encoders/`
3. **VAE Model (16-Channel FLUX VAE)**
- Download link: https://huggingface.co/neta-art/Neta-Lumina/blob/main/VAE/ae.safetensors
- Save path: `ComfyUI/models/vae/`
**Workflow**: load [`lumina_workflow.json`](https://huggingface.co/neta-art/Neta-Lumina/resolve/main/lumina_workflow.json) in ComfyUI.

- `UNETLoader` – loads the `.pth`
- `VAELoader` – loads `ae.safetensors`
- `CLIPLoader` – loads `gemma_2_2b_fp16.safetensors`
- `Text Encoder` – connects positive /negative prompts to K Sampler
**Simple merged release**
Download [`neta-lumina-v1.0-all-in-one.safetensors`](https://huggingface.co/neta-art/Neta-Lumina/blob/main/neta-lumina-v1.0-all-in-one.safetensors),
`md5sum = dca54fef3c64e942c1a62a741c4f9d8a`,
you may use ComfyUI’s simple checkpoint loader workflow.
### Recommended Settings
- **Sampler**: `res_multistep/ euler_ancestral`
- **Scheduler**: `linear_quadratic`
- **Steps**: 30
- **CFG (guidance)**: 4 – 5.5
- **EmptySD3LatentImage resolution**: 1024 × 1024, 768 × 1532, 968 × 1322, or >= 1024
<br>
# Prompt Book
Detailed prompt guidelines: [**Neta Lumina Prompt Book**](https://neta.art/blog/neta_lumina_prompt_book/)
<br>
# Community
- Discord: https://discord.com/invite/TTTGccjbEa
- QQ group: 1039442542
<br>
# Roadmap
## Model
- Continous base‑model training to raise reasoning capability.
- Aesthetic‑dataset iteration to improve anatomy, background richness, and overall appealness.
- Smarter, more versatile tagging tools to lower the creative barrier.
## Ecosystem
- LoRA training tutorials and components
- Experienced users may already fine‑tune via Lumina‑Image‑2.0’s open code.
- Development of advanced control / style‑consistency features (e.g., [Omini Control](https://arxiv.org/pdf/2411.15098)). [**Call for Collaboration!**](https://discord.com/invite/TTTGccjbEa)
<br>
# License & Disclaimer
- Neta Lumina is released under [**Apache License 2.0**](https://www.apache.org/licenses/LICENSE-2.0)
<br>
# Participants & Contributors
- Special thanks to the **Alpha‑VLLM** team for open‑sourcing **Lumina‑Image‑2.0**
- **Model development**: **Neta.art Lab (Civitai)**
- Core Trainer: **li_li** [Civitai](https://civitai.com/user/li_li) ・ [Hugging Face](https://huggingface.co/heziiiii)
<br>
- **Partners**
- **nebulae**: [Civitai](https://civitai.com/user/kitarz) ・ [Hugging Face](https://huggingface.co/NebulaeWis)
- **生姜**: [Hugging Face](https://huggingface.co/ssj0021)
- **孙一**
- [**narugo1992**](https://github.com/narugo1992) & [**deepghs**](https://huggingface.co/deepghs): open datasets, processing tools, and models
- [**Naifu**](https://github.com/Mikubill/naifu) trainer at [Mikubill](https://github.com/Mikubill)
<br>
# Community Contributors
- **Evaluators & developers**: [二小姐](https://huggingface.co/Second222), [spawner](https://github.com/spawner1145), [Rnglg2](https://civitai.com/user/Rnglg2)
- **Other contributors**: [沉迷摸鱼](https://www.pixiv.net/users/22433944), [poi](https://x.com/poi______1), AshenWitch, [十分无奈](https://www.pixiv.net/users/15750592), [GHOSTLX](https://civitai.com/user/ghostlxh), [wenaka](https://civitai.com/user/Wenaka_), [iiiiii](https://civitai.com/user/Blueberries_i), [年糕特工队](https://x.com/gaonian2331), [恩匹希](https://civitai.com/user/NPCde), 奶冻, [mumu](https://civitai.com/user/mumu520), [yizyin](https://civitai.com/user/yizyin), smile, Yang, 古神, 灵之药, [LyloGummy](https://civitai.com/user/LyloGummy), 雪时
<br>
# Appendix & Resources
- **TeaCache**: https://github.com/spawner1145/CUI-Lumina2-TeaCache
- **Advanced samplers & TeaCache guide (by spawner)**: https://docs.qq.com/doc/DZEFKb1ZrZVZiUmxw?nlc=1
- **Neta Lumina ComfyUI Manual (in Chinese)**: https://docs.qq.com/doc/DZEVQZFdtaERPdXVh
|