How to finetune? #1

fanbbbb · 2019-09-04T07:00:36Z

I have trained a model for "soccer pdqn", and I want to finetune a new work based on the trained model, what should I do?

cycraig · 2019-09-04T19:17:26Z

finetune a new work based on the trained model

Hi, are you trying to do transfer learning to apply the trained model to a similar task in the HFO (soccer) environment, or just optimising the hyperparameters for (M)P-DQN on HFO?

fanbbbb · 2019-09-09T07:25:40Z

Yes, I have trained a model with 1 offense-agent and 0 defense-npc, and I transfer this model to 1 offense-agent and 1 defense-npc, I modified some layers in pytorch with random initialization. It worked! However, there is a new issue I would like to ask u , I am trying to use this model in a 2 offense-agents and 2 defense-agents work (a multi-agents work), I have no idea about that. Could u please leave me a mail address if u are convenient? Some details need to be consulted, Thanks a lot.

cycraig · 2019-09-10T06:07:29Z

Nice work 👍

P-DQN in its original form really only considers single, independent agents. There has been some work in multi-agent reinforcement learning with parameterised actions, such as https://arxiv.org/abs/1903.04959 . Your best bet would be to use their algorithm which is designed with multiple agents in mind, or extend P-DQN in a similar fashion.

It's also a bit difficult to transfer models trained using fewer agents on HFO since the state space increases with every agent added to the environment. If you want to discuss this more you can email me at: mpdqn at pm.me

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to finetune? #1

How to finetune? #1

fanbbbb commented Sep 4, 2019

cycraig commented Sep 4, 2019

fanbbbb commented Sep 9, 2019 •

edited

Loading

cycraig commented Sep 10, 2019 •

edited

Loading

How to finetune? #1

How to finetune? #1

Comments

fanbbbb commented Sep 4, 2019

cycraig commented Sep 4, 2019

fanbbbb commented Sep 9, 2019 • edited Loading

cycraig commented Sep 10, 2019 • edited Loading

fanbbbb commented Sep 9, 2019 •

edited

Loading

cycraig commented Sep 10, 2019 •

edited

Loading