How are policy gradient methods affected by the limits of control?

We study stochastic policy gradient methods from the perspective of control-theoretic limitations. Our main result is that ill-conditioned linear systems in the sense of Doyle inevitably lead to noisy gradient estimates. We also give an example of a class of stable systems in which policy gradient Show more

Publication status

published

External links

https://doi.org/10.1109/CDC51059.2022.9992612

Book title

2022 IEEE 61st Conference on Decision and Control (CDC)

Pages / Article No.

5992 - 5999

Publisher

IEEE

Event

61st IEEE Conference on Decision and Control (CDC 2022), Cancun, Mexico, December 6-9, 2022

More

Show all metadata

ETH Bibliography

yes

Altmetrics

Research Collection

Search

How are policy gradient methods affected by the limits of control? Mendeley CSV RIS BibTeX

How are policy gradient methods affected by the limits of control?

Mendeley

CSV

RIS

BibTeX