Safe Model-Based Reinforcement Learning via Ensembles and Model Predictive Control

This is an implementation of a model-based reinforcement learning agent that predicts the safety and scores of actions sequences by learning the environment's dynamics using an ensemble of neural networks. This algorithm achieves higher sample efficiency and comparable results to those reported in Safety Gym in the PointGoal-v1 task.

To reproduce results

Install Safety Gym then clone and pip install ., preferably within a conda or virtualenv python environment.

To run all of the experiments use

cd scripts
bash run_experiments.sh

Note that this command runs all 12 experiments at once, accordingly, please consider running each experiment seperately.

To visualize experiments results please run

python plot_results.py --data_path <your_data_path>

Hyper parameters and configuration can be found in the config directory.

Name		Name	Last commit message	Last commit date
Latest commit History 169 Commits
config		config
scripts		scripts
simba		simba
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Safe Model-Based Reinforcement Learning via Ensembles and Model Predictive Control

To reproduce results

About

Releases 1

Packages

Contributors 2

Languages

yardenas/ethz-safe-learning

Folders and files

Latest commit

History

Repository files navigation

Safe Model-Based Reinforcement Learning via Ensembles and Model Predictive Control

To reproduce results

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages