upper-confidence-bound

Here are 23 public repositories matching this topic...

Bin-Cao / Bgolearn

A Bayesian global optimization package for material design ｜ Adaptive Learning | Active Learning

material-design materials knowledge-gradient adaptive-learning materials-science materials-informatics active-learning expected-improvement upper-confidence-bound opportunity-cost bayesian-global-optimization predictive-entropy-search mlmd probability-of-improvement least-confidence margin-sampling entropy-based-approach augmented-expected-improvement trail-path bgolearn

Updated Oct 12, 2024
Jupyter Notebook

kapshaul / OnlineLearning

Star

Repository of Online Learning algorithms, including Bandits, UCB, and more.

machine-learning linear-regression bandit-learning online-learning adversarial-learning bandit upper-confidence-bound adaptive-ad

Updated Sep 28, 2024
Python

Evil0ctal / Upper-Confidence-Bound-Pywebio

Sponsor

Star

该仓库包含基于 PyWebIO 的 UCB（上置信界）算法在线演示，UCB 算法常用于多臂老虎机问题，以优化决策并最大化累积奖励。演示包括自动 UCB 算法模拟和交互式手动策略对比。

demo upper-confidence-bound pywebio

Updated Sep 22, 2024
Python

n-ferrante / MonteCarloTreeSearchCheckers

Star

This repository contains an implementation of checkers where different agents play against each other using different algorithms including Monte Carlo Tree Search, Alpha-Beta Pruning, and Minimax.

python reinforcement-learning monte-carlo-tree-search upper-confidence-bound checkers-ai

Updated Jun 21, 2024
Python

Retr0-code / Pong-RL

Star

Reinforcement learning used in the game of pong

cmake reinforcement-learning cpp q-learning ucb pong-game cpp20 boost-test upper-confidence-bound the-game-of-pong

Updated May 20, 2024
C++

loraalex / LoBook

Star

LoRa@FIIT algorithms comparison using jupyter notebooks

iot analysis lora ucb adr upper-confidence-bound lorafiit adaptive-data-rate

Updated Dec 10, 2023
Jupyter Notebook

Jayavathsan / MachineLearning-SciKitLearn

Star

Using SciKit Learn few Deep Learning Rules and Algorithms are implemented

reinforcement-learning clustering svm naive-bayes model-selection thompson-sampling xgboost classification dimensionality-reduction apriori k-means association-rules decision-tree principal-component-analysis linear-discriminant-analysis k-nearest-neighbor eclat upper-confidence-bound

Updated Aug 9, 2023
Jupyter Notebook

hritikb / Reinforcement-Learning-Algorithms

Star

reinforcement-learning q-learning grid-world epsilon-greedy sarsa dynamic-programming multi-armed-bandits policy-iteration value-iteration monte-carlo-methods temporal-differencing-learning upper-confidence-bound gradient-bandit optimistic-inital-values greedy-policy

Updated Jun 29, 2023
Jupyter Notebook

simonZhou86 / Tr_LinUCB

Star

Code for the paper "Truncated LinUCB for Stochastic Linear Bandits"

linear-bandits contextual-bandits upper-confidence-bound

Updated Jun 2, 2023
Python

aashish22bansal / Best-Ads-Predictor

Star

Predicting the best Ad from the given Ads.

reinforcement-learning thompson-sampling upper-confidence-bound

Updated May 26, 2022
Jupyter Notebook

lionelsamrat10 / Machine-learning-a-to-z

Star

This repo contains code templates of all the machine learning algorithms that are used, like Regression, Classification, Clustering, etc.

python machine-learning natural-language-processing reinforcement-learning deep-learning random-forest clustering naive-bayes machine-learning-algorithms regression thompson-sampling neural-networks classification dimensionality-reduction logistic-regression convolutional-neural-networks predictive-analytics artificial-neural-network principal-component-analysis upper-confidence-bound

Updated Feb 17, 2022
Jupyter Notebook

Nikronic / Machine-Learning-Models

Star

In This repository I made some simple to complex methods in machine learning. Here I try to build template style code.

Updated Nov 7, 2021
Python

krishnaaxo / Reinforcement-UCB-ThompsonSampling

Star

machine-learning reinforcement-learning thompson-sampling reinforcement-learning-algorithms upper-confidence-bounds upper-confidence-bound

Updated Jun 12, 2021
Jupyter Notebook

liuanji / WU-UCT

Star

A novel parallel UCT algorithm with linear speedup and negligible performance loss.

parallel-algorithm monte-carlo-tree-search upper-confidence-bound upper-confidence-trees

Updated Apr 26, 2021
Python

taylorjg / k-armed-bandit

Star

Web visualisation of the k-armed bandit problem

react web-worker epsilon-greedy multi-armed-bandit webworker k-armed-bandit upper-confidence-bound

Updated Feb 6, 2021
JavaScript

antoine-hochart / bandit_algo_evaluation

Star

Offline evaluation of multi-armed bandit algorithms

thompson-sampling epsilon-greedy policy-evaluation multi-armed-bandit upper-confidence-bound

Updated Dec 1, 2020
Python

salimandre / Monte-Carlo-Tree-Search-for-checkers-game

Star

We compare different policies for the checkers game using reinforcement learning algorithms.

python reinforcement-learning turtle-graphics ucb monte-carlo-tree-search checkers-game upper-confidence-bound mcts-algorithm

Updated Aug 24, 2020
Python

prabormukherjee / CTR_Testing

Star

Checking CTR(Click Thorugh Rate) of an ad using Thompson Sampling (Reinforcement Lrearning)

reinforcement-learning ml upper-confidence-bound ctr-testing

Updated Aug 12, 2020
Python

salimandre / Monte-Carlo-Tree-Search

Star

We implemented a Monte Carlo Tree Search (MCTS) from scratch and we successfully applied it to Tic-Tac-Toe game.

reinforcement-learning graphics mcts ucb monte-carlo-tree-search tic-tac-toe-game upper-confidence-bound

Updated Jul 9, 2020
Python

Lazarus789 / Reinforcement-Models

Star

thompson-sampling upper-confidence-bound

Updated Aug 12, 2019

Improve this page

Add a description, image, and links to the upper-confidence-bound topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the upper-confidence-bound topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

upper-confidence-bound

Here are 23 public repositories matching this topic...

Bin-Cao / Bgolearn

kapshaul / OnlineLearning

Evil0ctal / Upper-Confidence-Bound-Pywebio

n-ferrante / MonteCarloTreeSearchCheckers

Retr0-code / Pong-RL

loraalex / LoBook

Jayavathsan / MachineLearning-SciKitLearn

hritikb / Reinforcement-Learning-Algorithms

simonZhou86 / Tr_LinUCB

aashish22bansal / Best-Ads-Predictor

lionelsamrat10 / Machine-learning-a-to-z

Nikronic / Machine-Learning-Models

krishnaaxo / Reinforcement-UCB-ThompsonSampling

liuanji / WU-UCT

taylorjg / k-armed-bandit

antoine-hochart / bandit_algo_evaluation

salimandre / Monte-Carlo-Tree-Search-for-checkers-game

prabormukherjee / CTR_Testing

salimandre / Monte-Carlo-Tree-Search

Lazarus789 / Reinforcement-Models

Improve this page

Add this topic to your repo