imperfect-reward-function

Star

Here are 2 public repositories matching this topic...

Facebear-ljx / RGM

Star

The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)

pytorch offline-reinforcement-learning imperfect-reward-function

Updated Mar 3, 2023
Python

Brezy024 / Mind-the-Gap

Star

# Mind-the-GapMind the Gap aims to enhance Chain of Thought (CoT) tuning for better AI performance. Join us in exploring innovative solutions and contributing to the project! 🐙🌟

android genomics anime range image-translation open-set domain-adaptation styletransfer interval-set debruijn-graph open-set-recognition open-set-domain-adaptation stylegan2 stylegan-image-manipulation offline-reinforcement-learning iclr2022 single-shot-domain-adaptation imperfect-reward-function

Updated Aug 20, 2025
Python

Improve this page

Add a description, image, and links to the imperfect-reward-function topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the imperfect-reward-function topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

imperfect-reward-function

Here are 2 public repositories matching this topic...

Facebear-ljx / RGM

Brezy024 / Mind-the-Gap

Improve this page

Add this topic to your repo