Skip to content

[Rust] GrpoRewards: Implement Shaping Functions #18

@kyjohnso

Description

@kyjohnso

@nikg4 @taenin what other functions should we put into the GrpoRewards struct? other rewards functions? Here is a first cut from sonnet

  1. Shaping Functions:
    • potential_based_shaping(state, next_state, gamma) - Implement potential-based reward shaping
    • curiosity_reward(state, next_state, prediction) - Generate intrinsic motivation rewards
    • constraint_penalty(state, action) - Apply penalties for constraint violations

Implement potential_based_shaping, curiosity_reward, constraint_penalty functions

Originally posted by @kyjohnso in #13

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions