GitHub - SamNPowers/large-scale-curiosity: An extension of the paper "Large-Scale Study of Curiosity-Driven Learning" to deal with stochastic environments

Status: Archive (code is provided as-is, no updates expected)

Large-Scale Study of Curiosity-Driven Learning

[Project Website] [Demo Video]

Yuri Burda*, Harri Edwards*, Deepak Pathak*,
Amos Storkey, Trevor Darrell, Alexei A. Efros
(* alphabetical ordering, equal contribution)

University of California, Berkeley
OpenAI
University of Edinburgh

This is a TensorFlow based implementation for our paper on large-scale study of curiosity-driven learning across 54 environments. Curiosity is a type of intrinsic reward function which uses prediction error as reward signal. In this paper, We perform the first large-scale study of purely curiosity-driven learning, i.e. without any extrinsic rewards, across 54 standard benchmark environments. We further investigate the effect of using different feature spaces for computing prediction error and show that random features are sufficient for many popular RL game benchmarks, but learned features appear to generalize better (e.g. to novel game levels in Super Mario Bros.). If you find this work useful in your research, please cite:

@inproceedings{largeScaleCuriosity2018,
    Author = {Burda, Yuri and Edwards, Harri and
              Pathak, Deepak and Storkey, Amos and
              Darrell, Trevor and Efros, Alexei A.},
    Title = {Large-Scale Study of Curiosity-Driven Learning},
    Booktitle = {arXiv:1808.04355},
    Year = {2018}
}

Installation and Usage

The following command should train a pure exploration agent on Breakout with default experiment parameters.

python run.py

To use more than one gpu/machine, use MPI (e.g. mpiexec -n 8 python run.py should use 1024 parallel environments to collect experience instead of the default 128 on an 8 gpu machine).

Data for plots in paper

Data for Figure-2: contains raw game score data along with the plotting code to generate Figure-2 in the paper.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
roboenvs		roboenvs
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
auxiliary_tasks.py		auxiliary_tasks.py
cnn_policy.py		cnn_policy.py
cppo_agent.py		cppo_agent.py
dynamics.py		dynamics.py
mpi_utils.py		mpi_utils.py
recorder.py		recorder.py
rollouts.py		rollouts.py
run.py		run.py
utils.py		utils.py
vec_env.py		vec_env.py
wrappers.py		wrappers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Large-Scale Study of Curiosity-Driven Learning

[Project Website] [Demo Video]

Installation and Usage

Data for plots in paper

Other helpful pointers

About

Releases

Packages

Languages

SamNPowers/large-scale-curiosity

Folders and files

Latest commit

History

Repository files navigation

Large-Scale Study of Curiosity-Driven Learning

[Project Website] [Demo Video]

Installation and Usage

Data for plots in paper

Other helpful pointers

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages