[Rust] GrpoRewards: Implement Shaping Functions

> @nikg4 @taenin what other functions should we put into the GrpoRewards struct? other rewards functions? Here is a first cut from sonnet
> 
> 
> 3. **Shaping Functions**:
>    - `potential_based_shaping(state, next_state, gamma)` - Implement potential-based reward shaping
>    - `curiosity_reward(state, next_state, prediction)` - Generate intrinsic motivation rewards
>    - `constraint_penalty(state, action)` - Apply penalties for constraint violations
> 

Implement potential_based_shaping, curiosity_reward, constraint_penalty functions

 _Originally posted by @kyjohnso in [#13](https://github.com/oumi-ai/roumi/issues/13#issuecomment-2745276290)_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Rust] GrpoRewards: Implement Shaping Functions #18

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Rust] GrpoRewards: Implement Shaping Functions #18

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions