Skip to content

Commit 2b53ce0

Browse files
committed
Updated on 2024-08-31
1 parent 03c1f75 commit 2b53ce0

File tree

2 files changed

+10
-1
lines changed

2 files changed

+10
-1
lines changed

index.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,7 @@ <h3>
3939
When?
4040
</h3>
4141
<p>
42-
Last time this was edited was 2024-08-30 (YYYY/MM/DD).
42+
Last time this was edited was 2024-08-31 (YYYY/MM/DD).
4343
</p>
4444
<small><a href="misc.html">misc</a></small>
4545
</body>

papers/list.json

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,14 @@
11
[
22

3+
{
4+
"title": "Drawing Early-Bird Tickets: Towards More Efficient Training of Deep Networks",
5+
"author": "Haoran You et al",
6+
"year": "2020",
7+
"topic": "early-bird, lottery-hypothesis, pruning, low-precision",
8+
"venue": "ICLR",
9+
"description": "The authors show that there exist early-bird (EB) tickets: small, but critical subnetworks for dense randomly intialized networks, that can be found using low-cost training schemes (low precision, early stopping). They also design a practical low compute method for finding these. They use mask distance. Basically, for each pruning iteration, a binary mask is created. This mask represents which parts of the network are kept (the \"ticket\", or pruned subnet) and which parts are removed. They then consider the scaling factor \"r\" in BN layers as indicators of significance. This r is learned during training and is used to scale normalized activations. The magnitude of r is an indicator of how important the channel is to the network's performance. After deciding which channels to prune based on r, the binary mask is created. If the channel is kept (not pruned), marked as 1 in the mask. Else, 0. For any two subnets, they then compute the \"mask distance\" (AKA Hamming distance) between the two ticketmasks. They measure the mask distance between consequtive epochs and draw EB tickets when such distance is smaller than some threshold.",
10+
"link": "https://arxiv.org/pdf/1909.11957"
11+
},
312
{
413
"title": "Learning both Weights and Connections for Efficient Neural Networks",
514
"author": "Song Han et al",

0 commit comments

Comments
 (0)