Tutorial 9: One dataset for test & dev #3586
-
Hello, in Tutorial 9 when training a retriever with data from facebook-ai there is only one dataset for testing and evaluation. Is there a reason why? They are defined as:
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Hi @nicir, I think it is a common practice to get validation accuracy. In most real-world applications/competitions, test data doesn't come with labels and it is just used for generating model inferences that would be evaluated later by humans or automatically in case of competitions. Dev data is anyway not used for loss computation, so there's no harm in using it for testing the model's accuracy. Although, it should be the same as dev/validation accuracy. Does this answer your question? Please let me know if I misunderstood anything :) |
Beta Was this translation helpful? Give feedback.
Hi @nicir,
I think it is a common practice to get validation accuracy.
In most real-world applications/competitions, test data doesn't come with labels and it is just used for generating model inferences that would be evaluated later by humans or automatically in case of competitions. Dev data is anyway not used for loss computation, so there's no harm in using it for testing the model's accuracy. Although, it should be the same as dev/validation accuracy.
Does this answer your question? Please let me know if I misunderstood anything :)