Yahoo! news article recommendation system by linUCB
-
Updated
Feb 1, 2018 - Python
Yahoo! news article recommendation system by linUCB
Bandit algorithms
A gymnasium-compatible framework to create reinforcement learning (RL) environment for solving the optimal power flow (OPF) problem. Contains five OPF benchmark environments for comparable research.
Implementation of the Adaptive Contextual Combinatorial Upper Confidence Bound (ACC-UCB) algorithm for the contextual combinatorial volatile multi-armed bandit setting.
Code for our AJCAI 2020 paper: "Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward".
Contextual bandit implementation using Keras
An illustrative project including some multi-armed bandit algorithms and contextual bandit algorithms
A Reinforcement Learning approach to a contextual bandit problem.
Add a description, image, and links to the contextual-bandit topic page so that developers can more easily learn about it.
To associate your repository with the contextual-bandit topic, visit your repo's landing page and select "manage topics."