Using MaskedCategorical with ProbabilisticActor #2910

MaHaArt · 2025-04-18T08:37:32Z

MaHaArt
Apr 18, 2025

I’m working on a reinforcement learning scenario with discrete action types and continuous parameters. I’m using a ProbabilisticActor with a CompositeDistribution. Initially, I used Categorical for the discrete action type and masked invalid actions directly in the logits. As a result, the KL divergence started to explode during training.

I’m now considering switching to torchrl.modules.MaskedCategorical instead of Categorical. However, it seems that the mask is not being passed correctly.

Question: Has anyone successfully used MaskedCategorical with a ProbabilisticActor and could share some hints?

ainseph · 2025-04-22T12:18:04Z

ainseph
Apr 22, 2025

Hi, I too had difficulties using MaskedCategorical (and MaskedOneHotCategorical -- not sure of the difference) with ProbabilisticActor using CompositeDistribution. and was not able to get it to work.

I ended up using OneHotCategorical (https://pytorch.org/rl/0.6/reference/generated/torchrl.modules.OneHotCategorical.html#torchrl.modules.OneHotCategorical), and applied the masking myself in the forward pass (replace masked locations with float('-inf')). Note that this may require passing in both the input and the mask as separate inputs to your nn.Module, which to the best of my knowledge is not permitted if you're using nn.Sequential (and thus you must use the functional API).

0 replies

MaHaArt · 2025-05-05T17:54:04Z

MaHaArt
May 5, 2025
Author

Thanks for your feedback! I also wasn't successful with CompositeDistribution, so I wrote my own specific distribution class, where I can define log_prob and entropy flexibly and in detail. Next time, I would go this route right from the start—CompositeDistribution doesn't offer that much added value and ends up being a bit of a black box.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Using MaskedCategorical with ProbabilisticActor #2910

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Using MaskedCategorical with ProbabilisticActor #2910

Uh oh!

MaHaArt Apr 18, 2025

Replies: 2 comments

Uh oh!

ainseph Apr 22, 2025

Uh oh!

MaHaArt May 5, 2025 Author

MaHaArt
Apr 18, 2025

ainseph
Apr 22, 2025

MaHaArt
May 5, 2025
Author