Skip to content

Implementation of Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning

Notifications You must be signed in to change notification settings

thanhkaist/mimo_Q_network

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 

Repository files navigation

Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning for Accelerated Offline RL

PyTorch implementation of
Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning
[Paper]


πŸ“˜ Overview

Offline reinforcement learning (RL) eliminates the need for interactive data collection, making it a safe and scalable alternative for real-world applications. However, it suffers from extrapolation errors caused by out-of-distribution (OOD) data in static datasets.

This paper proposes an Uncertainty-Aware Rank-One MIMO Q Network Framework, a novel framework that addresses these challenges through:

  • βœ… Uncertainty quantification over OOD samples using a single network
  • πŸ“ˆ Lower confidence bound optimization to train robust policies
  • πŸ’‘ Rank-One Multi-Input Multi-Output (MIMO) architecture that mimics ensemble behavior with minimal computational cost

This framework significantly improves sample efficiency, robustness, and speed while outperforming prior methods on the D4RL benchmark.


πŸš€ Features

  • πŸ” Uncertainty-aware training objective based on lower confidence bound of Q-values
  • 🧠 Lightweight Rank-One MIMO Q-Network for modeling aleatoric and epistemic uncertainty
  • πŸ§ͺ Plug-and-play architecture compatible with standard offline RL pipelines (e.g., CQL, TD3+BC)
  • πŸ“Š State-of-the-art performance on D4RL tasks with high computational efficiency

πŸ§ͺ Installation

git clone https://github.com/thanhkaist/mimo_Q_network.git
cd mimo_Q_network
pip install -r requirements.txt

πŸ“š Citation

If you use this codebase or refer to the method in your research, please cite:

@article{nguyen2024uncertainty,
  title={Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning},
  author={Nguyen, Thanh and Luu, Tung M and Ton, Tri and Kim, Sungwoong and Yoo, Chang D},
  journal={IEEE Access},
  volume={12},
  pages={100972--100982},
  year={2024},
  publisher={IEEE}
}

About

Implementation of Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published