File tree Expand file tree Collapse file tree 1 file changed +5
-4
lines changed Expand file tree Collapse file tree 1 file changed +5
-4
lines changed Original file line number Diff line number Diff line change @@ -226,11 +226,12 @@ def main(cfg: "DictConfig"): # noqa: F821
226
226
current_frames = tensordict .numel ()
227
227
collected_frames += current_frames
228
228
229
- # Compared to the original paper, the replay buffer is not temporally sampled. We fill it with trajectories of length batch_length.
230
- # To be closer to the paper, we would need to fill it with trajectories of lentgh 1000 and then sample subsequences of length batch_length.
229
+ # Compared to the original paper, the replay buffer is not temporally
230
+ # sampled. We fill it with trajectories of length batch_length.
231
+ # To be closer to the paper, we would need to fill it with trajectories
232
+ # of length 1000 and then sample subsequences of length batch_length.
231
233
232
- # tensordict = tensordict.reshape(-1, cfg.batch_length)
233
- print (tensordict .shape )
234
+ tensordict = tensordict .reshape (- 1 , cfg .batch_length )
234
235
replay_buffer .extend (tensordict .cpu ())
235
236
logger .log_scalar (
236
237
"r_training" ,
You can’t perform that action at this time.
0 commit comments