An efficient off-policy reinforcement learning algorithm for the continuous-time LQR problem
Kategorien |
Konferenzbeiträge |
Jahr | 2023 |
Autorinnen/Autoren | Lopez, V. G. & Müller, M. A. |
Veröffentlicht in | Accepted for IEEE 62nd Conference on Decision and Control (CDC) |
DOI | 10.1109/CDC49753.2023.10384256 |
arXiv | 2303.17819 |