Skip to content

gijskoning/ReproducingCURL

Repository files navigation

Reproducing CURL

By Gijs Koning and Chiel de Vries

This repository houses a reproduction of CURL. This is a neural network that aims to learn a useful representation of image data to be used for reinforcement learning.

Motivation

Reinforcement learning is a promising area in the field of machine learning. It is important for the future of Robotics and industrial automation. Therefore, we would like to learn more about this subject by trying to reproduce this paper. Furthermore, CURL is an unsupervised network, meaning that the network learns without the use of a ground truth. We believe that unsupervised learning is an very important topic because of this. Labelled data is expensive and time consuming to acquire. If a neural net can learn its task without the use of labelled data, it is much cheaper to run.

The project is part of the "Seminar Computer Vision by Deep Learning" course (CS4245) at TU Delft. Our work is relevant for this course because it concerns the creation of a useful representation of image data. This is a classic computer vision task that uses state of the art neural nets to accomplish its goal.

Finally, we want to push ourselves by documenting the process thoroughly. We think this can help us greatly in our studies and teach us not only about deep learning, but also about ourselves.

Relevant Pages

Other related papers and information

Start training

  • conda env create -f conda_env.yml
  • conda activate curl
  • With CURL encoder: bash scripts/run.sh
  • Without encoder: bash scripts/run_identity.sh
  • Visualize training (for cartpole): tensorboard --logdir tmp/cartpole --host localhost --reload_interval 30 --host 0.0.0.0

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published