You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In many of the MedHELM scenarios, the data is redacted for compliance reasons. This makes the benchmark understanding difficult for users exploring the leaderboard.
The request is to add, possibly in the header of the benchmark page (see below), an example of what the model is being asked.
As an example, this is what we are including in the appendix of the paper to exemplify the task being evaluated.