Plan-to-Explore implementation in PyTorch (Again)

Plan-to-explore

This repo implements the Plan-to-explore algorithm from Planning to Explore via Self-Supervised World Models based on the PlaNet-Pytorch. It has been confirmed working on the DeepMind Control Suite/MuJoCo environment. Hyperparameters have been taken from the paper.

Installation

To install all dependencies with Anaconda run using the following commands. Firstly use conda.

pip install -r requirements.txt

Training (e.g. DMC walker-walk zero-shot)

Zero-shot

python main.py --algo p2e --env walker-walk --action-repeat 2 --id name-of-experiement --zero-shot

Few-shot

python main.py --algo p2e --env walker-walk --action-repeat 2 --id name-of-experiement

For best performance with DeepMind Control Suite, try setting environment variable MUJOCO_GL=egl (see instructions and details here).

We used weights and biases for logging the runs.

You can see the performance from the zero-shot/few-shot trained policy on the test/episode_reward.

Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
README_notes.md		README_notes.md
conda_env.yml		conda_env.yml
env.py		env.py
environment.yml		environment.yml
main.py		main.py
memory.py		memory.py
models.py		models.py
planner.py		planner.py
requirements.txt		requirements.txt
results-dev		results-dev
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Plan-to-Explore implementation in PyTorch (Again)

Plan-to-explore

Installation

Training (e.g. DMC walker-walk zero-shot)

Links

About

Releases

Packages

Languages

License

wabbajack1/plan2explore-again

Folders and files

Latest commit

History

Repository files navigation

Plan-to-Explore implementation in PyTorch (Again)

Plan-to-explore

Installation

Training (e.g. DMC walker-walk zero-shot)

Links

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages