You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
According to Table 2a and Table 3 in the paper of ViTDet, the bottleneck seems to be an alternative to global attention. Especially in Table 3, the way of bottleneck use less memory and less time than the way of global. However, in the code of ViTDet, the bottleneck seems to be an option instead of an alternative, meaning that the global attn will always be performed.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
According to Table 2a and Table 3 in the paper of ViTDet, the bottleneck seems to be an alternative to global attention. Especially in Table 3, the way of bottleneck use less memory and less time than the way of global. However, in the code of ViTDet, the bottleneck seems to be an option instead of an alternative, meaning that the global attn will always be performed.
detectron2/detectron2/modeling/backbone/vit.py
Lines 211 to 230 in 2b98c27
Table 3:
Beta Was this translation helpful? Give feedback.
All reactions