Skip to content

Question about inference and auroc measure #7

@reflelia

Description

@reflelia

The process used to generate predicted labels in the inference.py script does not seem to take multi-label classification into account. The ground truth labels contain probabilities for multiple lesions (since this is a multi-label classification task), but the script uses np.argmax to select only a single lesion out of 14 possible lesions (ChestX-ray 14 dataset).

In summary, is the inference.py script wrong for this task? Or is the task itself not intended to be multi-label classification?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions