Skip to content

An implementation of the state of the art continuous action space reinforcment learning technique known as (TD3) Twin Delayed Deep Deterministic Policy Gradient

Notifications You must be signed in to change notification settings

Husain0007/Reinforcement-Learning-with-TD3

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Reiforcement Learning with TD3

The Twin Delayed Deep Deterministic Policy Gradients Algorithm or TD3 for short, is an extension of traditional Deep Q-Learning which is religated to a discrete action space.
The TD3 algorithm was conceived by the research team at Google Deep Mind (here)

In this project, I will be implementing the TD3 algorithm using PyTorch to train a Quadruped to run across a field generated in the pybullet enviornment.

About

An implementation of the state of the art continuous action space reinforcment learning technique known as (TD3) Twin Delayed Deep Deterministic Policy Gradient

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published