Reproducing CURL

By Gijs Koning and Chiel de Vries

This repository houses a reproduction of CURL. This is a neural network that aims to learn a useful representation of image data to be used for reinforcement learning.

Motivation

Reinforcement learning is a promising area in the field of machine learning. It is important for the future of Robotics and industrial automation. Therefore, we would like to learn more about this subject by trying to reproduce this paper. Furthermore, CURL is an unsupervised network, meaning that the network learns without the use of a ground truth. We believe that unsupervised learning is an very important topic because of this. Labelled data is expensive and time consuming to acquire. If a neural net can learn its task without the use of labelled data, it is much cheaper to run.

The project is part of the "Seminar Computer Vision by Deep Learning" course (CS4245) at TU Delft. Our work is relevant for this course because it concerns the creation of a useful representation of image data. This is a classic computer vision task that uses state of the art neural nets to accomplish its goal.

Finally, we want to push ourselves by documenting the process thoroughly. We think this can help us greatly in our studies and teach us not only about deep learning, but also about ourselves.

Relevant Pages

Other related papers and information

Soft Actor Critic (SAC) (Haarnoja et al., 2018)
Documentation
Implementation used by paper (Yarats et al., 2019):
Reinforcement Learning with Augmented Data
Learning Invariant Representations for Reinforcement Learning without Reconstruction
Decoupling Representation Learning from Reinforcement Learning
data-efficient reinforcement learning with self-predictive representations
CURL github
CURL for atari

Start training

conda env create -f conda_env.yml
conda activate curl
With CURL encoder: bash scripts/run.sh
Without encoder: bash scripts/run_identity.sh
Visualize training (for cartpole): tensorboard --logdir tmp/cartpole --host localhost --reload_interval 30 --host 0.0.0.0

Name		Name	Last commit message	Last commit date
Latest commit History 149 Commits
docs		docs
logs		logs
nice_videos		nice_videos
other		other
scripts		scripts
test		test
.gitignore		.gitignore
README.md		README.md
conda_env.yml		conda_env.yml
encoder.py		encoder.py
logger.py		logger.py
plot.ipynb		plot.ipynb
plot_data.py		plot_data.py
run.py		run.py
sac_curl.py		sac_curl.py
stats.py		stats.py
train.py		train.py
utils.py		utils.py
video.py		video.py
visualize.ipynb		visualize.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reproducing CURL

By Gijs Koning and Chiel de Vries

Motivation

Relevant Pages

Other related papers and information

Start training

About

Releases

Packages

Contributors 2

Languages

gijskoning/ReproducingCURL

Folders and files

Latest commit

History

Repository files navigation

Reproducing CURL

By Gijs Koning and Chiel de Vries

Motivation

Relevant Pages

Other related papers and information

Start training

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages