Replies: 1 comment
-
I'm closing this discussion as I have transferred it (back) into an issue, for details see #390 (comment). |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Eval description
An eval checking whether GPT-3.5 & GPT-4 can accurately match given lyrics to the song name.
Problem and motivation
After some testing with GPT-3.5 & GPT-4 (via ChatGPT+) I was honestly disappointed at how bad the LLM performed on these tasks, especially because a simple Google search of the lyrics nearly always brought up the correct song.
Examples
Example one: ❌❌
Example two: ❌❌
Example three: ❌✅
Example eval prompt
Is this something you're interested in working on
I'd really like to provide this eval however I currently neither have the time to do so nor sufficient technical skills. Thus, I'm sharing my idea here and hope that someone will provide a PR based on this idea!
Beta Was this translation helpful? Give feedback.
All reactions