UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation
Abstract
UniDream is a text-to-3D generation framework using unified diffusion priors to produce realistic 3D objects with accurate relighting capabilities and improved albedo textures.
Recent advancements in text-to-3D generation technology have significantly advanced the conversion of textual descriptions into imaginative well-geometrical and finely textured 3D objects. Despite these developments, a prevalent limitation arises from the use of RGB data in diffusion or reconstruction models, which often results in models with inherent lighting and shadows effects that detract from their realism, thereby limiting their usability in applications that demand accurate relighting capabilities. To bridge this gap, we present UniDream, a text-to-3D generation framework by incorporating unified diffusion priors. Our approach consists of three main components: (1) a dual-phase training process to get albedo-normal aligned multi-view diffusion and reconstruction models, (2) a progressive generation procedure for geometry and albedo-textures based on Score Distillation Sample (SDS) using the trained reconstruction and diffusion models, and (3) an innovative application of SDS for finalizing PBR generation while keeping a fixed albedo based on Stable Diffusion model. Extensive evaluations demonstrate that UniDream surpasses existing methods in generating 3D objects with clearer albedo textures, smoother surfaces, enhanced realism, and superior relighting capabilities.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D (2023)
- ControlDreamer: Stylized 3D Generation with Multi-View ControlNet (2023)
- MetaDreamer: Efficient Text-to-3D Creation With Disentangling Geometry and Texture (2023)
- Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors (2023)
- Direct2.5: Diverse Text-to-3D Generation via Multi-view 2.5D Diffusion (2023)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
 AK
							AK 
					 
					 
						 
					 
					