Update README.md
Browse files
README.md
CHANGED
|
@@ -25,6 +25,89 @@ The Kontext-CAM-Left-View is an experimental adapter for black-forest-lab's FLUX
|
|
| 25 |
|
| 26 |
> You modified the prompt, altering its properties and subjective elements. Note: this is an experimental adapter and may contain artifacts.
|
| 27 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 28 |
## Trigger words
|
| 29 |
|
| 30 |
You should use `[photo content]` to trigger the image generation.
|
|
|
|
| 25 |
|
| 26 |
> You modified the prompt, altering its properties and subjective elements. Note: this is an experimental adapter and may contain artifacts.
|
| 27 |
|
| 28 |
+
---
|
| 29 |
+
|
| 30 |
+
## **Sample Inferences : Demo**
|
| 31 |
+
|
| 32 |
+
<table style="width:100%; border-collapse:collapse;">
|
| 33 |
+
<tr>
|
| 34 |
+
<td style="width:50%; text-align:center;">
|
| 35 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/lZ8asnkoamFUH1ClFgn6H.jpeg"
|
| 36 |
+
alt="Kontext-CAM-Left-View" style="width:100%; height:auto;"/>
|
| 37 |
+
</td>
|
| 38 |
+
<td style="width:50%; text-align:center;">
|
| 39 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/F92WuRNLReDYS-nXXBLUz.webp"
|
| 40 |
+
alt="Kontext-CAM-Left-View" style="width:100%; height:auto;"/>
|
| 41 |
+
</td>
|
| 42 |
+
</tr>
|
| 43 |
+
</table>
|
| 44 |
+
|
| 45 |
+
<table style="width:100%; border-collapse:collapse;">
|
| 46 |
+
<tr>
|
| 47 |
+
<td style="width:50%; text-align:center;">
|
| 48 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/Txk4Mnk7q6wkGFdpe276J.jpeg"
|
| 49 |
+
alt="Kontext-CAM-Left-View" style="width:100%; height:auto;"/>
|
| 50 |
+
</td>
|
| 51 |
+
<td style="width:50%; text-align:center;">
|
| 52 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/uHdGFGI-4plezer-JSAjs.webp"
|
| 53 |
+
alt="Kontext-CAM-Left-View" style="width:100%; height:auto;"/>
|
| 54 |
+
</td>
|
| 55 |
+
</tr>
|
| 56 |
+
</table>
|
| 57 |
+
|
| 58 |
+
---
|
| 59 |
+
|
| 60 |
+
## Parameter Settings
|
| 61 |
+
|
| 62 |
+
| Setting | Value |
|
| 63 |
+
| ------------------------ | ------------------------ |
|
| 64 |
+
| Module Type | Adapter |
|
| 65 |
+
| Base Model | FLUX.1 Kontext Dev - fp8 |
|
| 66 |
+
| Trigger Words | [photo content], render the image from the left-side perspective, keeping consistent lighting, textures, and proportions. Maintain the realism of all surrounding elements while revealing previously unseen left-side details consistent with the object’s or scene’s structure. |
|
| 67 |
+
| Image Processing Repeats | 42 |
|
| 68 |
+
| Epochs | 22 |
|
| 69 |
+
| Save Every N Epochs | 1 |
|
| 70 |
+
|
| 71 |
+
Labeling: DeepCaption-VLA-7B(natural language & English)
|
| 72 |
+
|
| 73 |
+
Total Images Used for Training : 800 Image Pairs (400 Start, 400 End)
|
| 74 |
+
|
| 75 |
+
## Training Parameters
|
| 76 |
+
|
| 77 |
+
| Setting | Value |
|
| 78 |
+
| --------------------------- | --------- |
|
| 79 |
+
| Seed | - |
|
| 80 |
+
| Clip Skip | - |
|
| 81 |
+
| Text Encoder LR | 0.00001 |
|
| 82 |
+
| UNet LR | 0.00005 |
|
| 83 |
+
| LR Scheduler | constant |
|
| 84 |
+
| Optimizer | AdamW8bit |
|
| 85 |
+
| Network Dimension | 64 |
|
| 86 |
+
| Network Alpha | 32 |
|
| 87 |
+
| Gradient Accumulation Steps | - |
|
| 88 |
+
|
| 89 |
+
## Label Parameters
|
| 90 |
+
|
| 91 |
+
| Setting | Value |
|
| 92 |
+
| --------------- | ----- |
|
| 93 |
+
| Shuffle Caption | - |
|
| 94 |
+
| Keep N Tokens | - |
|
| 95 |
+
|
| 96 |
+
## Advanced Parameters
|
| 97 |
+
|
| 98 |
+
| Setting | Value |
|
| 99 |
+
| ------------------------- | ----- |
|
| 100 |
+
| Noise Offset | 0.03 |
|
| 101 |
+
| Multires Noise Discount | 0.1 |
|
| 102 |
+
| Multires Noise Iterations | 10 |
|
| 103 |
+
| Conv Dimension | - |
|
| 104 |
+
| Conv Alpha | - |
|
| 105 |
+
| Batch Size | - |
|
| 106 |
+
| Steps | 3300 & 400(warm up) |
|
| 107 |
+
| Sampler | euler |
|
| 108 |
+
|
| 109 |
+
---
|
| 110 |
+
|
| 111 |
## Trigger words
|
| 112 |
|
| 113 |
You should use `[photo content]` to trigger the image generation.
|