You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
incorporate the findings of Ting Chens new paper, ability to noise at higher levels (and normalize variance of noised image automatically). they show good results using RIN at higher resolution images using this technique. also redo all the schedules to be gamma-centric
Copy file name to clipboardExpand all lines: README.md
+14-2Lines changed: 14 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -12,6 +12,8 @@ Additionally, we will try adding an extra linear attention on the main branch as
12
12
13
13
The insight of being able to self-condition on any hidden state of the network as well as the newly proposed sigmoid noise schedule are the two main findings.
14
14
15
+
This repository also contains the ability to <ahref="https://arxiv.org/abs/2301.10972">noise higher resolution images more</a>, using the `scale` keyword argument on the `GaussianDiffusion` class. It also contains the simple linear gamma schedule proposed in that paper.
16
+
15
17
## Appreciation
16
18
17
19
- <ahref="https://stability.ai/">Stability.ai</a> for the generous sponsorship to work on cutting edge artificial intelligence research
@@ -42,7 +44,8 @@ diffusion = GaussianDiffusion(
42
44
image_size=128,
43
45
use_ddim=False,
44
46
timesteps=400,
45
-
train_prob_self_cond=0.9# how often to self condition on latents
47
+
train_prob_self_cond=0.9, # how often to self condition on latents
48
+
scale=1. # this will be set to < 1. for more noising and leads to better convergence when training on higher resolution images (512, 1024) - input noised images will be auto variance normalized
46
49
).cuda()
47
50
48
51
trainer = Trainer(
@@ -81,7 +84,8 @@ diffusion = GaussianDiffusion(
81
84
model,
82
85
image_size=128,
83
86
timesteps=1000,
84
-
train_prob_self_cond=0.9
87
+
train_prob_self_cond=0.9,
88
+
scale=1.
85
89
)
86
90
87
91
training_images = torch.randn(8, 3, 128, 128) # images are normalized from 0 to 1
0 commit comments