-
Notifications
You must be signed in to change notification settings - Fork 275
Open
Description
hi @nmaac @megvii-model , in ShuffleNetV1, there are two different modifications compared paper with this
one is for channel shuffle before/after dwconv, this is fixed in #16 #40
two is the usage of relu in shufflenetv1_unit.py here
if self.stride == 1:
return F.relu(x + x_proj)
elif self.stride == 2:
return torch.cat((self.branch_proj(x_proj), F.relu(x)), 1)
when stride=2, there is different with paper description, and this also different with our common usage. I want to know this is better design or just a mistake. Look forward to your reply
Metadata
Metadata
Assignees
Labels
No labels