Question about inference and auroc measure

The process used to generate predicted labels in the inference.py script does not seem to take multi-label classification into account. The ground truth labels contain probabilities for multiple lesions (since this is a multi-label classification task), but the script uses `np.argmax` to select only a single lesion out of 14 possible lesions (ChestX-ray 14 dataset).

In summary, is the inference.py script wrong for this task? Or is the task itself not intended to be multi-label classification?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question about inference and auroc measure #7

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Question about inference and auroc measure #7

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions