1.reward是什么?这似乎是强化学习的内容,论文里也没提到过,我想知道reward是怎么定义的?
2.link_prediction里并没有用到bias,这是为什么呢?
- What is reward? it seems to be the content of reinforcement learning, and it is not mentioned in the paper. I want to know how to get the reward?
- Bias is not used in link_prediction, why is this?