Evaluation of a QA System Clarification #6256

AbelHutten · 2023-11-08T15:26:36Z

AbelHutten
Nov 8, 2023

I am creating a pure retriever. I wish to evaluate its performance. I have question-answer pairs, including the location of the answer in the text, for a subset of documents. I wish to use these QA pairs to evaluate my retriever. I don't want to use the haystack annotation tool, so I can not generate the required SQuAD json file this way. I could create it by hand, but what do I have to fill in in the "id" field?

The evaluation tutorial (https://haystack.deepset.ai/tutorials/05_evaluation) mentions this: "Alternative: Define queries and labels directly", and goes on to create a MultiLabel object. Again, there is this Id field here that I don't understand how to fill out.

If anyone could help me figure this out, I would be very thankful!

Also more generally, I am somewhat confused on the role of doc_index, label_index and add_eval_data(). If anyone could explain or link to an explanation of what exactly these indices are doing and how that fits into the story, that would also be greatly appreciated.

My sincere thanks in advance to anyone taking the time to help me.

Answered by AbelHutten

Nov 9, 2023

Managed to figure it out. I just put random numbers in the id slot and it works.

View full answer

AbelHutten · 2023-11-09T09:42:28Z

AbelHutten
Nov 9, 2023
Author

Managed to figure it out. I just put random numbers in the id slot and it works.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Evaluation of a QA System Clarification #6256

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Evaluation of a QA System Clarification #6256

Uh oh!

Uh oh!

AbelHutten Nov 8, 2023

Replies: 1 comment

Uh oh!

AbelHutten Nov 9, 2023 Author

AbelHutten
Nov 8, 2023

AbelHutten
Nov 9, 2023
Author