Replies: 1 comment
-
@jamartinh These features might be handy and helpful, however, it is low priority for me as long as it only calculates statistics.
Any calculations have impact to some extent, so that it is better to choice worthwhile features to put inside. FYI: DeepMind's Reverb can limit sampling speed by specifying replay ratio, which is more meaningful, but which requires more works. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi @ymd-h I was thinking on a useful feature I think could be more or less simple to implement and also very useful for RL.
Adding basic stats inside replay buffers.
E.g. for instance
Perhaps many measures that are used that are usually calculated outside can be incorporated without penalizing performance such as sample age, ratio of priority updates and so on.
As an idea maybe, but not sure, a creation_timestamp and modification_timestamp can be incorporated also to any sample in the buffer, not sure on this, just thinking.
These kind of stats will help to track then training stats such as throughput and the ratio between samples_insertions / samples_consumed for instance and also can be combined with gradient_steps for other useful ratios or measures used in RL algos to trace performance and adjust learning.
Beta Was this translation helpful? Give feedback.
All reactions