Evals with 0% success rate allowed? #37

AlessioGr · 2023-03-14T20:39:08Z

AlessioGr
Mar 14, 2023

Are Evals with 0% success rate allowed - if I believe GPT should be able to solve it and if a human can solve it easily?

placcaumuhire · 2023-03-14T21:24:53Z

placcaumuhire
Mar 14, 2023

Evals, or evaluations, are tests that are used to measure how well a machine learning model is performing. They are typically designed to assess the model's ability to complete a specific task or solve a specific problem.

In general, it is not desirable for an evaluation to have a 0% success rate, as this means that the model is not able to perform the task at all. However, there may be cases where a 0% success rate is acceptable, depending on the specific circumstances of the evaluation.

For example, if a human can easily solve the problem being tested, but the machine learning model cannot, it may still be useful to evaluate the model's performance in order to identify areas where it needs improvement. In this case, a 0% success rate would indicate that the model needs significant work in order to be able to perform the task as well as a human can.

Ultimately, the decision of whether to allow evaluations with a 0% success rate will depend on the goals and objectives of the specific project or application.

0 replies

placcaumuhire · 2023-03-14T21:27:13Z

placcaumuhire
Mar 14, 2023

Evals" are like tests for robots to see how well they can understand and do things. Sometimes, a test might be too hard for the robot and it can't do it, so it gets a score of zero. It's okay if a robot gets a zero score sometimes, but if it keeps getting zero scores all the time, then we might need to change the test or help the robot get better.

0 replies

Ein-Tim · 2023-03-15T07:48:49Z

Ein-Tim
Mar 15, 2023

Just copy pasting answers from ChatGPT doesn't help here. I'd suggest to refrain from doing so in the future.

0 replies

placcaumuhire · 2023-03-15T08:23:53Z

placcaumuhire
Mar 15, 2023

ChatGPT said: Just Chill!

1 reply

placcaumuhire Mar 15, 2023

Sent From a Macbook Pro in Masaka, Kigali, Rwanda! @Ein-Tim 🤓😎🥸😂

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Evals with 0% success rate allowed? #37

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 4 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Evals with 0% success rate allowed? #37

Uh oh!

Uh oh!

AlessioGr Mar 14, 2023

Replies: 4 comments · 1 reply

Uh oh!

placcaumuhire Mar 14, 2023

Uh oh!

placcaumuhire Mar 14, 2023

Uh oh!

Ein-Tim Mar 15, 2023

Uh oh!

placcaumuhire Mar 15, 2023

Uh oh!

placcaumuhire Mar 15, 2023

AlessioGr
Mar 14, 2023

Replies: 4 comments 1 reply

placcaumuhire
Mar 14, 2023

placcaumuhire
Mar 14, 2023

Ein-Tim
Mar 15, 2023

placcaumuhire
Mar 15, 2023