lamm-mit
/

SDXL-leaf-inspired

	@@ -148,4 +148,115 @@ grid
148
149	![image/png](https://cdn-uploads.huggingface.co/production/uploads/623ce1c6b66fedf374859fe7/R7sr9kAwZjRk_80oMY54h.png)
150

151

 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/623ce1c6b66fedf374859fe7/R7sr9kAwZjRk_80oMY54h.png)
+## Fine-tuning script
+Download this script: [SDXL DreamBooth-LoRA_Fine-Tune.ipynb](https://huggingface.co/lamm-mit/SDXL-leaf-inspired/resolve/main/SDXL_DreamBooth_LoRA_Fine-Tune.ipynb)
+You need to create a local folder ```leaf_concept_dir_SDXL``` and add the leaf images (provided in this repository, see subfolder).
+The code will automatically download the training script.
+The training script can handle custom prompts associated with each image, which are generated using BLIP.
+For instance, for the images used here, they are:
+```raw
+['<leaf microstructure>, a close up of a green plant with a lot of small holes',
+ '<leaf microstructure>, a close up of a leaf with a small insect on it',
+ '<leaf microstructure>, a close up of a plant with a lot of green leaves',
+ '<leaf microstructure>, a close up of a green plant with a yellow light',
+ '<leaf microstructure>, a close up of a green plant with a white center',
+ '<leaf microstructure>, arafed leaf with a white line on the center',
+ '<leaf microstructure>, a close up of a leaf with a yellow light shining through it',
+ '<leaf microstructure>, arafed image of a green plant with a yellow cross']
+```
+Training then proceeds as:
+```python
+HF_username = 'lamm-mit'
+pretrained_model_name_or_path="stabilityai/stable-diffusion-xl-base-1.0"
+pretrained_vae_model_name_or_path="madebyollin/sdxl-vae-fp16-fix"
+instance_prompt ="<leaf microstructure>"
+instance_data_dir = "./leaf_concept_dir_SDXL/"
+val_prompt = "a vase that resembles a <leaf microstructure>, high quality"
+val_epochs = 100
+instance_output_dir="leaf_LoRA_SDXL_V10" #for checkpointing
+```
+Dataset generatio with custom per-image captions
+```python
+import requests
+from transformers import AutoProcessor, BlipForConditionalGeneration
+import torch
+import glob
+from PIL import Image
+import json
+device = "cuda" if torch.cuda.is_available() else "cpu"
+# load the processor and the captioning model
+blip_processor = AutoProcessor.from_pretrained("Salesforce/blip-image-captioning-large")
+blip_model = BlipForConditionalGeneration.from_pretrained("Salesforce/blip-image-captioning-large",torch_dtype=torch.float16).to(device)
+# captioning utility
+def caption_images(input_image):
+    inputs = blip_processor(images=input_image, return_tensors="pt").to(device, torch.float16)
+    pixel_values = inputs.pixel_values
+    generated_ids = blip_model.generate(pixel_values=pixel_values, max_length=50)
+    generated_caption = blip_processor.batch_decode(generated_ids, skip_special_tokens=True)[0]
+    return generated_caption
+caption_prefix = f"{instance_prompt}, "
+with open(f'{instance_data_dir}metadata.jsonl', 'w') as outfile:
+  for img in imgs_and_paths:
+      caption = caption_prefix + caption_images(img[1]).split("\n")[0]
+      entry = {"file_name":img[0].split("/")[-1], "prompt": caption}
+      json.dump(entry, outfile)
+      outfile.write('\n')
+```
+This produces a JSON file in the ```instance_data_dir``` directory:
+```json
+{"file_name": "0.jpeg", "prompt": "<leaf microstructure>, a close up of a green plant with a lot of small holes"}
+{"file_name": "1.jpeg", "prompt": "<leaf microstructure>, a close up of a leaf with a small insect on it"}
+{"file_name": "2.jpeg", "prompt": "<leaf microstructure>, a close up of a plant with a lot of green leaves"}
+{"file_name": "3.jpeg", "prompt": "<leaf microstructure>, a close up of a leaf with a yellow substance in it"}
+{"file_name": "87.jpg", "prompt": "<leaf microstructure>, a close up of a green plant with a yellow light"}
+{"file_name": "88.jpg", "prompt": "<leaf microstructure>, a close up of a green plant with a white center"}
+{"file_name": "90.jpg", "prompt": "<leaf microstructure>, arafed leaf with a white line on the center"}
+{"file_name": "91.jpg", "prompt": "<leaf microstructure>, arafed image of a green leaf with a white spot"}
+{"file_name": "92.jpg", "prompt": "<leaf microstructure>, a close up of a leaf with a yellow light shining through it"}
+{"file_name": "94.jpg", "prompt": "<leaf microstructure>, arafed image of a green plant with a yellow cross"}
+```
+```raw
+!accelerate launch train_dreambooth_lora_sdxl.py \
+  --pretrained_model_name_or_path="{pretrained_model_name_or_path}" \
+  --pretrained_vae_model_name_or_path="{pretrained_vae_model_name_or_path}"\
+  --dataset_name="{instance_data_dir}" \
+  --output_dir="{instance_output_dir}" \
+  --caption_column="prompt"\
+  --mixed_precision="fp16" \
+  --instance_prompt="{instance_prompt}" \
+  --validation_prompt="{val_prompt}" \
+  --validation_epochs="{val_epochs}" \
+  --resolution=1024 \
+  --train_batch_size=1 \
+  --gradient_accumulation_steps=3 \
+  --gradient_checkpointing \
+  --learning_rate=1e-4 \
+  --snr_gamma=5.0 \
+  --lr_scheduler="constant" \
+  --lr_warmup_steps=0 \
+  --mixed_precision="fp16" \
+  --use_8bit_adam \
+  --max_train_steps=500 \
+  --checkpointing_steps=500 \
+  --seed="0"
+```