Skip to content

Sampling from Probability Distributions #4

@rayush7

Description

@rayush7

Hi @wgcban

Thank you for your paper and code for AdaMAE.

In this line a Multinomial distribution is used for sampling the indices for the visible tokens given the probability p_x. Could you please explain if this operation would be differentiable during back-propagation?

From what I understand REINFORCE is applied in this part (from Line 71 to Line 80). Is there any connection between sampling from a Categorical distribution in this part and the one from Multinomial distribution above? I am a bit confused. Could you please clarify?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions