Metadata only
Datum
2011-05Typ
- Conference Paper
ETH Bibliographie
yes
Altmetrics
Abstract
We present a dynamic programming-based solution to the problem of maximizing the probability of attaining a target set before hitting a cemetery set for a discrete-time Markov control process. Under mild hypotheses we establish that there exists a deterministic stationary policy that achieves the maximum value of this probability. We demonstrate how the maximization of this probability can be computed through the maximization of an expected total reward until the first hitting time to either the target or the cemetery set. Martingale characterizations of thrifty, equalizing, and optimal policies in the context of our problem are also established. Mehr anzeigen
Publikationsstatus
publishedExterne Links
Buchtitel
Special Issue related to IFAC Conference on Analysis and Design of Hybrid Systems (ADHS'09)Zeitschrift / Serie
Nonlinear Analysis. Hybrid SystemsBand
Seiten / Artikelnummer
Verlag
ElsevierKonferenz
Thema
Dynamic programming; Probability maximization; Markov control processesOrganisationseinheit
03751 - Lygeros, John / Lygeros, John
ETH Bibliographie
yes
Altmetrics