Skip to content

Fix dqn model evals #381

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 4 commits into from
May 6, 2023
Merged

Fix dqn model evals #381

merged 4 commits into from
May 6, 2023

Conversation

sdpkjc
Copy link
Collaborator

@sdpkjc sdpkjc commented May 6, 2023

Description

Fixes #380

Types of changes

  • Bug fix
  • New feature
  • New algorithm
  • Documentation

Checklist:

  • I've read the CONTRIBUTION guide (required).
  • I have ensured pre-commit run --all-files passes (required).
  • I have updated the tests accordingly (if applicable).
  • I have updated the documentation and previewed the changes via mkdocs serve.
    • I have explained note-worthy implementation details.
    • I have explained the logged metrics.
    • I have added links to the original paper and related papers.

If you need to run benchmark experiments for a performance-impacting changes:

  • I have contacted @vwxyzjn to obtain access to the openrlbenchmark W&B team.
  • I have used the benchmark utility to submit the tracked experiments to the openrlbenchmark/cleanrl W&B project, optionally with --capture-video.
  • I have performed RLops with python -m openrlbenchmark.rlops.
    • For new feature or bug fix:
      • I have used the RLops utility to understand the performance impact of the changes and confirmed there is no regression.
    • For new algorithm:
      • I have created a table comparing my results against those from reputable sources (i.e., the original paper or other reference implementation).
    • I have added the learning curves generated by the python -m openrlbenchmark.rlops utility to the documentation.
    • I have added links to the tracked experiments in W&B, generated by python -m openrlbenchmark.rlops ....your_args... --report, to the documentation.

@vercel
Copy link

vercel bot commented May 6, 2023

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
cleanrl ✅ Ready (Inspect) Visit Preview 💬 Add feedback May 6, 2023 9:56pm

@sdpkjc
Copy link
Collaborator Author

sdpkjc commented May 6, 2023

Next -> Add test cases

@sdpkjc
Copy link
Collaborator Author

sdpkjc commented May 6, 2023

Can we modify the existing test cases to test them, or create a new test file for them?

def test_dqn_jax():
    subprocess.run(
        "python cleanrl/dqn_atari_jax.py --save-model True --learning-starts 10 --total-timesteps 16 --buffer-size 10 --batch-size 4",
        shell=True,
        check=True,
    )

@sdpkjc
Copy link
Collaborator Author

sdpkjc commented May 6, 2023

The model evaluation dependency environment is related to the algorithm dependency environment. If we create a new test file for the model evaluation, it will multiply the number of test files. I suggest just adding --save-model True, or something simple way.

@vwxyzjn
Copy link
Owner

vwxyzjn commented May 6, 2023

The model evaluation dependency environment is related to the algorithm dependency environment. If we create a new test file for the model evaluation, it will multiply the number of test files. I suggest just adding --save-model True, or something simple way.

This sounds good to me!

Copy link
Owner

@vwxyzjn vwxyzjn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Feel free to merge when you are ready

Copy link
Owner

@vwxyzjn vwxyzjn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Feel free to merge when you are ready

@sdpkjc
Copy link
Collaborator Author

sdpkjc commented May 6, 2023

Thanks for your review. 👌🫡

@sdpkjc sdpkjc merged commit e19f858 into vwxyzjn:master May 6, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Bug of cleanrl_utils/evals
2 participants