Wolpertinger architecture 3 for power grid operation 7
1.Tavakoli, A., Pardo, F. & Kormushev, P. Action Branching Architectures for Deep Reinforcement Learning. in Thirty-Second AAAI Conference on Artificial Intelligence (2018).
2.Lillicrap, T. P. et al. Continuous control with deep reinforcement learning. arXiv:1509.02971 [cs, stat] (2019).
3.Dulac-Arnold, G. et al. Deep Reinforcement Learning in Large Discrete Action Spaces. arXiv:1512.07679 [cs, stat] (2016).
4.Hausknecht, M. & Stone, P. Deep Reinforcement Learning in Parameterized Action Space. arXiv:1511.04143 [cs] (2016).
5.Chandak, Y., Theocharous, G., Kostas, J., Jordan, S. & Thomas, P. S. Learning Action Representations for Reinforcement Learning. arXiv:1902.00183 [cs, stat] (2019).
6.Donnot, B., Guyon, I., Schoenauer, M., Panciatici, P. & Marot, A. Introducing machine learning for power system operation support. arXiv:1709.09527 [cs, stat] (2017).
7.Marot, A. et al. L2RPN: Learning to Run a Power Network in a Sustainable World. 40.