GitHub - phuongboi/drone-control-using-reinforcement-learning: Control drone with ppo in gym-pybullet-drones

Control drone in gym-pybullet-drones using ppo

Hovering a quacopter with some predefined position using gym-pybullet-drones env with PPO algorithm from PPO-PyTorch

25/09/2024 Update drone racing

Refer to my recent project on drone racing in gym pybulet

06/09/2024 Update fly through the gate

I test FlyThruGateAvitary environment with PPO with some modify in reward function. I created a gate model with Tinkercad to and add pybullet.
To train: python train_thrugate.py, to test: python test_thrugate.py

20/02/2024 Note about ppo implementation

Recently, I figure out the frustration of drone at hover position may come from fixed action_std of this PPO implementation, they setting action_std_init = 0.6 and decay this value during training time. ~~In inference mode, there is no mechanism to reduce or remove this variance, so control output this vary all the time.~~ I look at some other implementation of Soft Actor Critic, they use one more layer to learn action std beside action mean.

13/01/2024 Update hovering with some constrains

Add some contrains to naive reward, drone look more stable at hover position, reference from paper

30/12/2023 Update training result

28/12/2023 Init commit

Change reward function, compute terminate

Fly through the gate

Hover at (0, 0, 1) position

Hover at (0, 1, 1) position

How to use

Follow author's guide to install gym-pybullet-drones environment
Training python train_hover.py
Test pretrained model python test_hover.py

References

https://github.com/utiasDSL/gym-pybullet-drones/
https://github.com/nikhilbarhate99/PPO-PyTorch
Schulman, John, et al. "Proximal policy optimization algorithms." arXiv preprint arXiv:1707.06347 (2017).
https://web.stanford.edu/class/aa228/reports/2019/final62.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.github/workflows		.github/workflows
gym_pybullet_drones		gym_pybullet_drones
log_dir		log_dir
results		results
tests		tests
.gitignore		.gitignore
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
build_project.sh		build_project.sh
ppo.py		ppo.py
pypi_description.md		pypi_description.md
pyproject.toml		pyproject.toml
test_hover.py		test_hover.py
test_thrugate.py		test_thrugate.py
train_hover.py		train_hover.py
train_thrugate.py		train_thrugate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Control drone in gym-pybullet-drones using ppo

25/09/2024 Update drone racing

06/09/2024 Update fly through the gate

20/02/2024 Note about ppo implementation

13/01/2024 Update hovering with some constrains

30/12/2023 Update training result

28/12/2023 Init commit

Fly through the gate

Hover at (0, 0, 1) position

Hover at (0, 1, 1) position

How to use

References

About

Releases

Packages

Languages

License

phuongboi/drone-control-using-reinforcement-learning

Folders and files

Latest commit

History

Repository files navigation

Control drone in gym-pybullet-drones using ppo

25/09/2024 Update drone racing

06/09/2024 Update fly through the gate

20/02/2024 Note about ppo implementation

13/01/2024 Update hovering with some constrains

30/12/2023 Update training result

28/12/2023 Init commit

Fly through the gate

Hover at (0, 0, 1) position

Hover at (0, 1, 1) position

How to use

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages