--- base_model: - black-forest-labs/FLUX.1-dev license: apache-2.0 pipeline_tag: text-to-3d tags: - panorama - generation - perception - flow-matching - text-to-image - image-to-image library_name: diffusers --- # OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes We present [OmniX](https://arxiv.org/abs/2510.26800), a family of panoramic flow matching models for unified panorama generation, perception, and completion.
We introduce OmniX, a family of flow matching generative models that achieves unified panorama perception, generation, and completion. Using OmniX as a world generator, we can create graphics-ready 3D scenes ready for physically based rendering, relighting, and simualtion.
### Panorama Perception
Given an RGB panorama as input, OmniX can predict geometric, intrinsic, and semantic properties:
```bash
# Perception (Distance, Normal, Albedo, Roughness, Metallic, Semantic) from Panorama
python run_pano_perception.py --panorama "assets/examples/panorama.png" --output_dir "outputs/perception_from_panorama"
```
### Panorama Generation and Perception
Naturally, we can combine panorama generation and perception to obtain a panoramic image with multiple property annotations:
```bash
# Generation and Perception from Text
python run_pano_all.py --prompt "Photorealistic modern living room" --output_dir "outputs/generation_and_perception_from_text"
# Generation and Perception from Image and Text
python run_pano_all.py --image "assets/examples/image.png" --prompt "Photorealistic modern living room" --output_dir "outputs/generation_and_perception_from_image_and_text"
```
### Graphics-Ready Scene Generation (Beta)
Note that the code for graphics-ready scene reconstruction/generation is not ready and is still in progress.
```bash
# Generation from Text
python run_scene_generation.py --prompt "Photorealistic modern living room" --output_dir "outputs/construction_from_text"
# Generation from Text (Fast)
python run_scene_generation.py --prompt "Photorealistic modern living room" --output_dir "outputs/construction_fast_from_text" --rgb_as_albedo --disable_normal --use_default_pbr --fill_invalid_depth
# Generation from Image and Text
python run_scene_generation.py --image "assets/examples/image.png" --prompt "Photorealistic modern living room" --output_dir "outputs/construction_from_image_and_text"
# Generation from Image and Text (Fast)
python run_scene_generation.py --image "assets/examples/image.png" --prompt "Photorealistic modern living room" --output_dir "outputs/construction_fast_from_image_and_text" --rgb_as_albedo --disable_normal --use_default_pbr --fill_invalid_depth
# Generation from Panorama
python run_scene_generation.py --panorama "assets/examples/panorama.png" --output_dir "outputs/construction_from_panorama"
# Generation from Panorama (Fast)
python run_scene_generation.py --panorama "assets/examples/panorama.png" --output_dir "outputs/construction_fast_from_panorama" --rgb_as_albedo --disable_normal --use_default_pbr --fill_invalid_depth
```
## 👏 Acknowledgement
This repository is based on many amazing research works and open-source projects: [PanFusion](https://github.com/chengzhag/PanFusion), [DreamCube](https://github.com/Yukun-Huang/DreamCube), [WorldGen](https://github.com/ZiYang-xie/WorldGen), [diffusers](https://github.com/huggingface/diffusers), [equilib](https://github.com/haruishi43/equilib), etc. Thanks all the authors for their selfless contributions to the community!
## 😉 Citation
If you find this repository helpful for your work, please consider citing it as follows:
```bib
@article{omnix,
title={OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes},
author={Huang, Yukun and Yu, Jiwen and Zhou, Yanning and Wang, Jianan and Wang, Xintao and Wan, Pengfei and Liu, Xihui},
journal={arXiv preprint arXiv:2510.26800},
year={2025}
}
```