RNN-RL

RNN+Transformer based reinforcement learning. Currently implemented for solving procedurally generated knapsack problems (https://en.wikipedia.org/wiki/Knapsack_problem)

Features:

Transformer-based input encoder.
GRU combines encodings with action history, using hidden layer for RL state representation.
Pytorch only DQL implementation. With memory buffer and double-Q implementation.
Bayesian Q-value output. Use MDN representation of Q-value for a richer understanding of expected reward. Could be used for custom exploration or inference strategies.

Documentation and code-cleaning work in progress.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
MDN @ 2449f1d		MDN @ 2449f1d
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
datamanager.py		datamanager.py
model.pt		model.pt
models.py		models.py
strats.py		strats.py

Provide feedback