Hallucinated Adversarial Control for Conservative Offline Policy Evaluation

We study the problem of conservative off-policy evaluation (COPE) where given an offline dataset of environment interactions, collected by other agents, we seek to obtain a (tight) lower bound on a policy’s performance. This is crucial when deciding whether a given policy satisfies certain minimal Show more

Publication status

published

External links

https://proceedings.mlr.press/v216/rothfuss23a.html

Editor

Evans, Robin J.

Shpitser, Ilya

Book title

Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence

Journal / series

Proceedings of Machine Learning Research

Volume

216

Pages / Article No.

1774 - 1784

Publisher

PMLR

Event

39th Conference on Uncertainty in Artificial Intelligence (UAI 2023), Pittsburgh, PA, USA, July 31 - August 4, 2023

Organisational unit

03908 - Krause, Andreas / Krause, Andreas

Funding

815943 - Reliable Data-Driven Decision Making in Cyber-Physical Systems (EC)

180545 - NCCR Automation (phase I) (SNF)

More

Show all metadata

ETH Bibliography

yes

Altmetrics

Research Collection

Search

Hallucinated Adversarial Control for Conservative Offline Policy Evaluation Mendeley CSV RIS BibTeX

Hallucinated Adversarial Control for Conservative Offline Policy Evaluation

Mendeley

CSV

RIS

BibTeX