Fix dqn model evals #381

sdpkjc · 2023-05-06T21:14:29Z

Description

Fixes #380

Types of changes

Bug fix
New feature
New algorithm
Documentation

Checklist:

I've read the CONTRIBUTION guide (required).
I have ensured pre-commit run --all-files passes (required).
I have updated the tests accordingly (if applicable).
I have updated the documentation and previewed the changes via mkdocs serve.
- I have explained note-worthy implementation details.
- I have explained the logged metrics.
- I have added links to the original paper and related papers.

If you need to run benchmark experiments for a performance-impacting changes:

I have contacted @vwxyzjn to obtain access to the openrlbenchmark W&B team.
I have used the benchmark utility to submit the tracked experiments to the openrlbenchmark/cleanrl W&B project, optionally with --capture-video.
I have performed RLops with python -m openrlbenchmark.rlops.
- For new feature or bug fix:
  - I have used the RLops utility to understand the performance impact of the changes and confirmed there is no regression.
- For new algorithm:
  - I have created a table comparing my results against those from reputable sources (i.e., the original paper or other reference implementation).
- I have added the learning curves generated by the python -m openrlbenchmark.rlops utility to the documentation.
- I have added links to the tracked experiments in W&B, generated by python -m openrlbenchmark.rlops ....your_args... --report, to the documentation.

vercel · 2023-05-06T21:14:33Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
cleanrl	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	May 6, 2023 9:56pm

sdpkjc · 2023-05-06T21:15:01Z

Next -> Add test cases

sdpkjc · 2023-05-06T21:23:21Z

Can we modify the existing test cases to test them, or create a new test file for them?

def test_dqn_jax():
    subprocess.run(
        "python cleanrl/dqn_atari_jax.py --save-model True --learning-starts 10 --total-timesteps 16 --buffer-size 10 --batch-size 4",
        shell=True,
        check=True,
    )

sdpkjc · 2023-05-06T21:29:41Z

The model evaluation dependency environment is related to the algorithm dependency environment. If we create a new test file for the model evaluation, it will multiply the number of test files. I suggest just adding --save-model True, or something simple way.

vwxyzjn · 2023-05-06T22:20:28Z

The model evaluation dependency environment is related to the algorithm dependency environment. If we create a new test file for the model evaluation, it will multiply the number of test files. I suggest just adding --save-model True, or something simple way.

This sounds good to me!

vwxyzjn

LGTM. Feel free to merge when you are ready

vwxyzjn

LGTM. Feel free to merge when you are ready

sdpkjc · 2023-05-06T22:39:27Z

Thanks for your review. 👌🫡

fix dqn model evals

eb991ff

vercel bot deployed to Preview May 6, 2023 21:14 View deployment

add eval model test cases

34f68ed

vercel bot deployed to Preview May 6, 2023 21:42 View deployment

fix pre-commit

ae20018

vercel bot deployed to Preview May 6, 2023 21:45 View deployment

fix tests ci

30dd75e

vercel bot deployed to Preview May 6, 2023 21:56 View deployment

sdpkjc requested a review from vwxyzjn May 6, 2023 21:57

vwxyzjn approved these changes May 6, 2023

View reviewed changes

sdpkjc merged commit e19f858 into vwxyzjn:master May 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix dqn model evals #381

Fix dqn model evals #381

Uh oh!

sdpkjc commented May 6, 2023 •

edited

Loading

Uh oh!

vercel bot commented May 6, 2023 •

edited

Loading

Uh oh!

sdpkjc commented May 6, 2023

Uh oh!

sdpkjc commented May 6, 2023

Uh oh!

sdpkjc commented May 6, 2023

Uh oh!

vwxyzjn commented May 6, 2023

Uh oh!

vwxyzjn left a comment

Uh oh!

vwxyzjn left a comment

Uh oh!

sdpkjc commented May 6, 2023

Uh oh!

Uh oh!

Fix dqn model evals #381

Fix dqn model evals #381

Uh oh!

Conversation

sdpkjc commented May 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Types of changes

Checklist:

Uh oh!

vercel bot commented May 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sdpkjc commented May 6, 2023

Uh oh!

sdpkjc commented May 6, 2023

Uh oh!

sdpkjc commented May 6, 2023

Uh oh!

vwxyzjn commented May 6, 2023

Uh oh!

vwxyzjn left a comment

Choose a reason for hiding this comment

Uh oh!

vwxyzjn left a comment

Choose a reason for hiding this comment

Uh oh!

sdpkjc commented May 6, 2023

Uh oh!

Uh oh!

sdpkjc commented May 6, 2023 •

edited

Loading

vercel bot commented May 6, 2023 •

edited

Loading