#

gradient-bandit

Here are 3 public repositories matching this topic...

SanketAgrawal / ReinforcementLearning

Chapter wise implementation & analysis of all the algorithms in RL : An Intoduction by Richard S. Sutton and Andrew G. Barto

reinforcement-learning artificial-intelligence epsilon-greedy python-3 ucb k-armed-bandit gradient-bandit optimistic-inital-values

Updated Jul 18, 2020
Jupyter Notebook

hritikb / Reinforcement-Learning-Algorithms

reinforcement-learning q-learning grid-world epsilon-greedy sarsa dynamic-programming multi-armed-bandits policy-iteration value-iteration monte-carlo-methods temporal-differencing-learning upper-confidence-bound gradient-bandit optimistic-inital-values greedy-policy

Updated Jun 29, 2023
Jupyter Notebook

MehranTaghian / policy-gradient-methods

Implementation of some of the policy gradient methods in PyTorch.

pytorch policy-gradient reinforce actor-critic ppo online-supervised-learning gradient-bandit batch-reinforce

Updated Jul 27, 2022
Python

Improve this page

Add a description, image, and links to the gradient-bandit topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gradient-bandit topic, visit your repo's landing page and select "manage topics."