An efficient off-policy reinforcement learning algorithm for the continuous-time LQR problem
| Kategorien |
Konferenzbeiträge |
| Jahr | 2023 |
| Autorinnen/Autoren | Lopez, V. G. & Müller, M. A. |
| Veröffentlicht in | Accepted for IEEE 62nd Conference on Decision and Control (CDC) |
| DOI | 10.1109/CDC49753.2023.10384256 |
| arXiv | 2303.17819 |