An efficient off-policy reinforcement learning algorithm for the continuous-time LQR problem
Kategorien |
Konferenzbeiträge |
Jahr | 2023 |
Autorinnen/Autoren | Lopez, V. G. & Müller, M. A. |
Veröffentlicht in | Accepted for IEEE 62nd Conference on Decision and Control (CDC) |
arXiv | 2303.17819 |