Update README.md
Browse files
README.md
CHANGED
|
@@ -9,7 +9,7 @@ tags:
|
|
| 9 |
|
| 10 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 11 |
This model is a 3D VAE that encodes video into a compact latent space conditioned on a content frame. It compresses a video by a factor of \\(\frac{T}{4}\times\frac{H}{32}\times\frac{W}{32}\\), enabling 4096x downsampling.
|
| 12 |
-
It is part of the [Reducio-DiT](https://arxiv.org/abs/
|
| 13 |
|
| 14 |
|
| 15 |
## Model Details
|
|
@@ -19,7 +19,7 @@ It is part of the [Reducio-DiT](https://arxiv.org/abs/xxxx), which is a video ge
|
|
| 19 |
<!-- Provide the basic links for the model. -->
|
| 20 |
|
| 21 |
- **Repository:** [GitHub Repository](https://github.com/microsoft/Reducio-VAE)
|
| 22 |
-
- **Paper:** [arXiv](https://arxiv.org/abs/
|
| 23 |
|
| 24 |
## Uses
|
| 25 |
|
|
@@ -62,7 +62,7 @@ Metrics on 1K Pexels validation set and UCF-101:
|
|
| 62 |
@article{tian2024reducio,
|
| 63 |
title={REDUCIO! Generating 1024*1024 Video within 16 Seconds using Extremely Compressed Motion Latents},
|
| 64 |
author={Tian, Rui and Dai, Qi and Bao, Jianmin and Qiu, Kai and Yang, Yifan and Luo, Chong and Wu, Zuxuan and Jiang, Yu-Gang},
|
| 65 |
-
journal={arXiv preprint arXiv:
|
| 66 |
year={2024}
|
| 67 |
}
|
| 68 |
```
|
|
|
|
| 9 |
|
| 10 |
<!-- Provide a quick summary of what the model is/does. -->
|
| 11 |
This model is a 3D VAE that encodes video into a compact latent space conditioned on a content frame. It compresses a video by a factor of \\(\frac{T}{4}\times\frac{H}{32}\times\frac{W}{32}\\), enabling 4096x downsampling.
|
| 12 |
+
It is part of the [Reducio-DiT](https://arxiv.org/abs/2411.13552), which is a video generation method. Codebase available [here](https://github.com/microsoft/Reducio-VAE).
|
| 13 |
|
| 14 |
|
| 15 |
## Model Details
|
|
|
|
| 19 |
<!-- Provide the basic links for the model. -->
|
| 20 |
|
| 21 |
- **Repository:** [GitHub Repository](https://github.com/microsoft/Reducio-VAE)
|
| 22 |
+
- **Paper:** [arXiv](https://arxiv.org/abs/2411.13552)
|
| 23 |
|
| 24 |
## Uses
|
| 25 |
|
|
|
|
| 62 |
@article{tian2024reducio,
|
| 63 |
title={REDUCIO! Generating 1024*1024 Video within 16 Seconds using Extremely Compressed Motion Latents},
|
| 64 |
author={Tian, Rui and Dai, Qi and Bao, Jianmin and Qiu, Kai and Yang, Yifan and Luo, Chong and Wu, Zuxuan and Jiang, Yu-Gang},
|
| 65 |
+
journal={arXiv preprint arXiv:2411.13552},
|
| 66 |
year={2024}
|
| 67 |
}
|
| 68 |
```
|