Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space - Research Collection

Download

Full text (published version) (PDF, 3.166Mb)

Open access

Author

Fatkhullin, Ilyas

Date

2023

Type

Conference Paper

ETH Bibliography

yes

Altmetrics

Download

Full text (published version) (PDF, 3.166Mb)

Rights / license

In Copyright - Non-Commercial Use Permitted

Abstract

We consider the reinforcement learning (RL) problem with general utilities which consists in maximizing a function of the state-action occupancy measure. Beyond the standard cumulative reward RL setting, this problem includes as particular cases constrained RL, pure exploration and learning from d Show more

Permanent link

https://doi.org/10.3929/ethz-b-000638297

Publication status

published

External links

https://proceedings.mlr.press/v202/barakat23a.html

Editor

Krause, Andreas

Brunskill, Emma

Engelhardt, Barbara

Scarlett, Jonathan

Book title

Proceedings of the 40th International Conference on Machine Learning

Journal / series

Proceedings of Machine Learning Research

Volume

202

Pages / Article No.

1753 - 1800

Publisher

PMLR

Event

40th International Conference on Machine Learning (ICML 2023), Honolulu, HI, USA, July 23-29, 2023

Subject

reinforcement learning; policy gradient methods; convex RL; Global convergence

Organisational unit

09729 - He, Niao / He, Niao
02219 - ETH AI Center / ETH AI Center

More

Show all metadata

ETH Bibliography

yes

Altmetrics