-
Couldn't load subscription status.
- Fork 154
Open
Labels
Description
Dyna-Q is a conceptual algorithm that illustrates how real and simulated experience can be combined in building a policy. Planning in RL terminology refers to using simulated experience generated by a model to find or improve a policy for interacting with a modeled environment
Any plans on having this agent in mushrrom rl ?
Additional context
Add any other context or screenshots about the feature request here.