Constrained Exploration and Recovery from Experience Shaping

Constrained Exploration and Recovery from Experience Shaping is an algorithm for model-free reinforcement learning to actively reshape the action space of an agent during training so that reward-driven exploration is constrained within safety limits.

This repository accompanies the following paper on arXiv: https://arxiv.org/abs/1809.08925

Unconstrained Random Exploration	Constrained Random Exploration

Installing

This implementation requires Python 3 and relies on Tensorflow for building and training constraint networks. Depending on your setup, run:

pip install tensorflow-gpu

if you have a CUDA-compatible device or:

pip install tensorflow

For training constraint networks together with control policies, we built on top of the OpenAI Baselines framework. Install it with:

pip install baselines

We will maintain compatibility with the OpenAI Baselines master branch (last confirmed check on 2018-09-08: commit), though feel free to create an issue if you notice something wrong.

Quadratic program solving is performed using quadprog. Install first Cython:

pip install Cython

Then:

pip install quadprog

Finally, clone this repository and install the local package with pip:

git clone [email protected]:IBM/constrained-rl.git
cd constrained-rl
pip install -e .

Examples

Examples and reference data are provided in the examples directory:

Learning action space constraints from positive and negative demonstrations: fixed maze
Learning action space constraints from scratch: random obstacles with position and force control

License

The Constrained Exploration and Recovery from Experience Shaping Project uses the MIT software license.

Contributing to the project

Full details of how to contribute to this project are documented in the CONTRIBUTING.md file.

Maintainers

The project's maintainers: are responsible for reviewing and merging all pull requests and they guide the over-all technical direction of the project.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
ceres		ceres
examples		examples
CONDUCT.md		CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
DCO1.1.txt		DCO1.1.txt
HEADER		HEADER
LICENSE		LICENSE
MAINTAINERS.txt		MAINTAINERS.txt
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Constrained Exploration and Recovery from Experience Shaping

Installing

Examples

License

Contributing to the project

Maintainers

About

Releases

Packages

Languages

License

IBM/constrained-rl

Folders and files

Latest commit

History

Repository files navigation

Constrained Exploration and Recovery from Experience Shaping

Installing

Examples

License

Contributing to the project

Maintainers

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages