Skip to content

Commit eb1a271

Browse files
authored
Update RLHF_with__PPO.md
1 parent c1b8228 commit eb1a271

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

docs/RLHF_with__PPO.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,9 @@
11
# Reinforcement Learning from Human Feedback with PPO
22

3-
![Uploading image.png…]()
43

54

5+
![Uploading Screenshot 2024-11-21 083539.png…]()
6+
67

78
What is it, and why is it so confusing? Well, in this file, I will take you on a new adventure, and we will learn what **RLHF** with **PPO** actually means.
89

0 commit comments

Comments
 (0)