Updated on 2024-11-07

lxaw · lxaw · commit d85ac94689a9 · 2024-11-07T09:50:15.000-05:00
diff --git a/.DS_Store b/.DS_Store
diff --git a/papers/list.json b/papers/list.json
@@ -1,4 +1,13 @@
 [
+  {
+    "title": "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks",
+    "author": "Tim Salimans et al",
+    "year": "2016",
+    "topic": "normalization, gradient descent",
+    "venue": "Arxiv",
+    "description": "This paper introduces weight normalization, a simple reparameterization technique that decouples a neural network's weight vectors into their direction and magnitude by expressing w = (g/||v||)v, where g is a scalar and v is a vector. The key insight is that this decoupling improves optimization by making the conditioning of the gradient better - the direction and scale of weight updates can be learned somewhat independently, which helps avoid problems with pathological curvature in the optimization landscape. While inspired by batch normalization, weight normalization is deterministic and doesn't add noise to gradients or create dependencies between minibatch examples, making it well-suited for scenarios like reinforcement learning and RNNs where batch normalization is problematic. The authors also propose a data-dependent initialization scheme where g and bias terms are initialized to normalize the initial pre-activations of neurons, helping ensure good scaling of activations across layers at the start of training.",
+    "link": "https://arxiv.org/pdf/1602.07868"
+  },
   {
     "title": "Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models",
     "author": "Tuomas Kynkäänniemi et al",
diff --git a/papers_read.html b/papers_read.html
@@ -75,10 +75,10 @@ <h1>Here's where I keep a list of papers I have read.</h1>
         I typically use this to organize papers I found interesting. Please feel free to do whatever you want with it. Note that this is not every single paper I have ever read, just a collection of ones that I remember to put down.
     </p>
     <p id="paperCount">
-        So far, we have read 159 papers. Let's keep it up!
+        So far, we have read 160 papers. Let's keep it up!
     </p> 
     <small id="searchCount">
-        Your search returned 159 papers. Nice! 
+        Your search returned 160 papers. Nice! 
     </small>
     
     <div class="search-inputs">
@@ -105,6 +105,16 @@ <h1>Here's where I keep a list of papers I have read.</h1>
         </thead>
         <tbody>
         
+            <tr>
+                <td>Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks</td>
+                <td>Tim Salimans et al</td>
+                <td>2016</td>
+                <td>normalization, gradient descent</td>
+                <td>Arxiv</td>
+                <td>This paper introduces weight normalization, a simple reparameterization technique that decouples a neural network&#x27;s weight vectors into their direction and magnitude by expressing w = (g/||v||)v, where g is a scalar and v is a vector. The key insight is that this decoupling improves optimization by making the conditioning of the gradient better - the direction and scale of weight updates can be learned somewhat independently, which helps avoid problems with pathological curvature in the optimization landscape. While inspired by batch normalization, weight normalization is deterministic and doesn&#x27;t add noise to gradients or create dependencies between minibatch examples, making it well-suited for scenarios like reinforcement learning and RNNs where batch normalization is problematic. The authors also propose a data-dependent initialization scheme where g and bias terms are initialized to normalize the initial pre-activations of neurons, helping ensure good scaling of activations across layers at the start of training.</td>
+                <td><a href="https://arxiv.org/pdf/1602.07868" target="_blank">Link</a></td>
+            </tr>
+        
             <tr>
                 <td>Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models</td>
                 <td>Tuomas Kynkäänniemi et al</td>

Original file line number	Diff line number	Diff line change
`@@ -1,4 +1,13 @@`
`1`	`1`	`[`
	`2`	`+ {`
	`3`	`+ "title": "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks",`
	`4`	`+ "author": "Tim Salimans et al",`
	`5`	`+ "year": "2016",`
	`6`	`+ "topic": "normalization, gradient descent",`
	`7`	`+ "venue": "Arxiv",`
	`8`	+ "description": "This paper introduces weight normalization, a simple reparameterization technique that decouples a neural network's weight vectors into their direction and magnitude by expressing w = (g/\|\|v\|\|)v, where g is a scalar and v is a vector. The key insight is that this decoupling improves optimization by making the conditioning of the gradient better - the direction and scale of weight updates can be learned somewhat independently, which helps avoid problems with pathological curvature in the optimization landscape. While inspired by batch normalization, weight normalization is deterministic and doesn't add noise to gradients or create dependencies between minibatch examples, making it well-suited for scenarios like reinforcement learning and RNNs where batch normalization is problematic. The authors also propose a data-dependent initialization scheme where g and bias terms are initialized to normalize the initial pre-activations of neurons, helping ensure good scaling of activations across layers at the start of training.",
	`9`	`+ "link": "https://arxiv.org/pdf/1602.07868"`
	`10`	`+ },`
`2`	`11`	`{`
`3`	`12`	`"title": "Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models",`
`4`	`13`	`"author": "Tuomas Kynkäänniemi et al",`