Bandits

implement famous bandit algorithms

Requirements

I checked algorithms in the following environment:

OS X (10.9+), CPU
python (2.7.11)
numpy (1.11.3)
matplotlib (1.4.3)

K-armed Bernoulli Bandits

Epsilon-greedy
UCB
KL-UCB
Thompson sampling

Simple test can be done as follows:

python models.py

The result is shown as follows:

Contextual Bandits (incomplete)

1: Thompson sampling action selection with online logistic regression , Algorithm 3 at Chapelle, O., & Li, L. (2011). An empirical evaluation of thompson sampling. In Advances in neural information processing systems (pp. 2249-2257).

2: Epsilon greedy action selection with online logistic regression

3: Random action selection with online logistic regression (for comparison)

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
figures		figures
results		results
.python-version		.python-version
README.md		README.md
models.py		models.py
tmp_contextual_bandit.py		tmp_contextual_bandit.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bandits

Requirements

K-armed Bernoulli Bandits

Contextual Bandits (incomplete)

About

Releases

Packages

Languages

fullflu/bandit

Folders and files

Latest commit

History

Repository files navigation

Bandits

Requirements

K-armed Bernoulli Bandits

Contextual Bandits (incomplete)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages