I have two questions

1.reward是什么？这似乎是强化学习的内容，论文里也没提到过，我想知道reward是怎么定义的？
2.link_prediction里并没有用到bias，这是为什么呢？

1. What is reward? it seems to be the content of reinforcement learning, and it is not mentioned in the paper. I want to know how to get the reward?
2. Bias is not used in link_prediction, why is this?