Update README.md
Browse files
README.md
CHANGED
|
@@ -1,43 +1,8 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: openrail++
|
| 3 |
-
tags:
|
| 4 |
-
- stable-diffusion
|
| 5 |
-
inference: false
|
| 6 |
-
---
|
| 7 |
-
|
| 8 |
-
# Stable Diffusion x4 upscaler model card
|
| 9 |
-
This model card focuses on the model associated with the Stable Diffusion Upscaler, available [here](https://github.com/Stability-AI/stablediffusion).
|
| 10 |
-
This model is trained for 1.25M steps on a 10M subset of LAION containing images `>2048x2048`. The model was trained on crops of size `512x512` and is a text-guided [latent upscaling diffusion model](https://arxiv.org/abs/2112.10752).
|
| 11 |
-
In addition to the textual input, it receives a `noise_level` as an input parameter, which can be used to add noise to the low-resolution input according to a [predefined diffusion schedule](configs/stable-diffusion/x4-upscaling.yaml).
|
| 12 |
-
|
| 13 |
-

|
| 14 |
-
|
| 15 |
-
- Use it with the [`stablediffusion`](https://github.com/Stability-AI/stablediffusion) repository: download the `x4-upscaler-ema.ckpt` [here](https://huggingface.co/stabilityai/stable-diffusion-x4-upscaler/resolve/main/x4-upscaler-ema.ckpt).
|
| 16 |
-
- Use it with 🧨 [`diffusers`](https://huggingface.co/stabilityai/stable-diffusion-x4-upscaler#examples)
|
| 17 |
-
|
| 18 |
-
|
| 19 |
-
## Model Details
|
| 20 |
-
- **Developed by:** Robin Rombach, Patrick Esser
|
| 21 |
-
- **Model type:** Diffusion-based text-to-image generation model
|
| 22 |
-
- **Language(s):** English
|
| 23 |
-
- **License:** [CreativeML Open RAIL++-M License](https://huggingface.co/stabilityai/stable-diffusion-2/blob/main/LICENSE-MODEL)
|
| 24 |
-
- **Model Description:** This is a model that can be used to generate and modify images based on text prompts. It is a [Latent Diffusion Model](https://arxiv.org/abs/2112.10752) that uses a fixed, pretrained text encoder ([OpenCLIP-ViT/H](https://github.com/mlfoundations/open_clip)).
|
| 25 |
-
- **Resources for more information:** [GitHub Repository](https://github.com/Stability-AI/).
|
| 26 |
-
- **Cite as:**
|
| 27 |
-
|
| 28 |
-
@InProceedings{Rombach_2022_CVPR,
|
| 29 |
-
author = {Rombach, Robin and Blattmann, Andreas and Lorenz, Dominik and Esser, Patrick and Ommer, Bj\"orn},
|
| 30 |
-
title = {High-Resolution Image Synthesis With Latent Diffusion Models},
|
| 31 |
-
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
|
| 32 |
-
month = {June},
|
| 33 |
-
year = {2022},
|
| 34 |
-
pages = {10684-10695}
|
| 35 |
-
}
|
| 36 |
|
| 37 |
|
| 38 |
## Examples
|
| 39 |
|
| 40 |
-
Using the [🤗's Diffusers library](https://github.com/huggingface/diffusers)
|
| 41 |
|
| 42 |
```bash
|
| 43 |
pip install diffusers transformers accelerate scipy safetensors
|
|
@@ -47,12 +12,12 @@ pip install diffusers transformers accelerate scipy safetensors
|
|
| 47 |
import requests
|
| 48 |
from PIL import Image
|
| 49 |
from io import BytesIO
|
| 50 |
-
from diffusers import
|
| 51 |
import torch
|
| 52 |
|
| 53 |
# load model and scheduler
|
| 54 |
-
model_id = "
|
| 55 |
-
pipeline =
|
| 56 |
pipeline = pipeline.to("cuda")
|
| 57 |
|
| 58 |
# let's download an image
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
|
| 2 |
|
| 3 |
## Examples
|
| 4 |
|
| 5 |
+
Using the [🤗's Diffusers library](https://github.com/huggingface/diffusers) in a simple and efficient manner.
|
| 6 |
|
| 7 |
```bash
|
| 8 |
pip install diffusers transformers accelerate scipy safetensors
|
|
|
|
| 12 |
import requests
|
| 13 |
from PIL import Image
|
| 14 |
from io import BytesIO
|
| 15 |
+
from diffusers import StableDiffusionUpscaleLDM3DPipeline
|
| 16 |
import torch
|
| 17 |
|
| 18 |
# load model and scheduler
|
| 19 |
+
model_id = "Intel/ldm3d-hr"
|
| 20 |
+
pipeline = StableDiffusionUpscaleLDM3DPipeline.from_pretrained(model_id, torch_dtype=torch.float16)
|
| 21 |
pipeline = pipeline.to("cuda")
|
| 22 |
|
| 23 |
# let's download an image
|