How to use MOCO for a segmentation task?

Thank you for sharing code for moco. I have two questions for using MOCO in a segmentation task?

1. In a segmentation task, the input of a network is H\*W\*3, and the last two feature map should be: H\*W\*C ---> H\*W\*1. The last feature map of segmentation task is two dimension rather than one dimension in classification task. How can we calculate the InfoNCE loss for this two dimension feature map?
If the feature map is one dimension, the f_q and f_k are both 1\*n, and f_q\*f_k.T will return one value. But if the f_q and f_k are m\*n, the f_q\*f_k.T will return a matrix.
In my thought, we can transform the f_q/f_k from m\*n to 1\*mn, and then f_q\*f_k.T will return one value. Am I right? Please correct me if my thought is wrong.

2. In the classification task, the output should be the same for any augmented image. For a image X and a rotated image Y, the output is feature_X, feature_Y, and feature_X should be equal as feature_Y because they are the same class. In this way, feature_X\*feature_Y.T should return 1.
However, if imageY=rotation(imageX), feature_Y should be rotation(feature_X). Then, feature_X\*feature_Y.T should not be 1. How can we deal with this problem? Or should we inversely rotate feature_Y as irotated_feature_Y, and calculate feature_X\*irotated_feature_Y.T?

Any suggestion is appreciated.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to use MOCO for a segmentation task? #27

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

How to use MOCO for a segmentation task? #27

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions