Is the '''image-level feature''' implemented in the wrong way? It should be a global feature with pooling and unpooling