[Bug]: anomaly map normalization in efficient ad looks wrong

### Describe the bug

Poiting out a few things:

- A) it looks like the val split in the current yaml config of efficientad is cheating : )
- B) in its lightning module, the validation phase does the whole job in the start then behaves like a test phase 
- C) the set of images used for the penalty in $L_{\text{ST}}$  is a small sample of imagenet

# A) normalization params should depend on a split from the train set

The default YAML config is using a random split from the test set 
 
https://github.com/openvinotoolkit/anomalib/blob/c97762c8781aacfc31d100ba862aa730a3262f7a/src/anomalib/models/efficient_ad/config.yaml#L18

which is usual for models that only do evaluation on the val set, but in efficientad

![image](https://github.com/openvinotoolkit/anomalib/assets/24547377/ba3a49e4-8dc1-4872-b35e-a61c92e31b0d)

(from [http://arxiv.org/abs/2303.14535](http://arxiv.org/abs/2303.14535), Section 3.4, page 6, left column)

:point_up:  the validation set is like a 2nd-phase (or 3rd counting the distillation) of the training, where $q^{\text{ST}}_a, q^{\text{ST}}_b, q^{\text{AE}}_a, q^{\text{AE}}_b$ are fitted. 

It should be possible to split the train set and use it there.

---

# B) validation start/step design is weird

I think lightning's design is made essentially to avoid loops on the batches. 
By implementing validation_step(batch) you define what the model should do with that data -- which, for EfficientAD, is to find the $q$ parameters. In the current design, there is a for loop in the start

https://github.com/openvinotoolkit/anomalib/blob/c97762c8781aacfc31d100ba862aa730a3262f7a/src/anomalib/models/efficient_ad/lightning_model.py#L245-L252

function `map_norm_quantiles()`

https://github.com/openvinotoolkit/anomalib/blob/c97762c8781aacfc31d100ba862aa730a3262f7a/src/anomalib/models/efficient_ad/lightning_model.py#L154-L179

which is re-doing this loop-over-batches structure, while `validation_step()` is just predicting (which would happen in the test_step() or predict_step()): 

https://github.com/openvinotoolkit/anomalib/blob/c97762c8781aacfc31d100ba862aa730a3262f7a/src/anomalib/models/efficient_ad/lightning_model.py#L253-L266


> Side note:  is it a good idea to have `tqdm` inside the model? 

---

# C) small imagenet

The data used in the penalty term of the student's loss function should be the same as the one from the teacher distillation (imagenet in the paper).

The code is currently downloading a reduced version with 10 classes. 

The user should be able to use a custom folder (with imagenet already setup).

### Code of Conduct

- [X] I agree to follow this project's Code of Conduct

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bug]: anomaly map normalization in efficient ad looks wrong #1370

Describe the bug

A) normalization params should depend on a split from the train set

B) validation start/step design is weird

C) small imagenet

Code of Conduct

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Bug]: anomaly map normalization in efficient ad looks wrong #1370

Description

Describe the bug

A) normalization params should depend on a split from the train set

B) validation start/step design is weird

C) small imagenet

Code of Conduct

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions