You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I think I might have spotted a possible typo on page 10 in the pdf.
In the screenshot below I have highlighted the word "harm" in yellow which I think should be changed to "harmful" (since I assume it refers to the word "content").
So that the sentence would go like this:
... the RLHF algorithm is particularly useful to mitigate the issues of generating harmful or toxic content for LLMs ...