Asymptotic optimality of finite model approximations for partially observed markov decision processes with discounted cost

Saldı, Naci; Yuksel, S.; Linder, T.

doi:10.1109/TAC.2019.2907172

Publication:
Asymptotic optimality of finite model approximations for partially observed markov decision processes with discounted cost

dc.contributor.author	Saldı, Naci
dc.contributor.author	Yuksel, S.
dc.contributor.author	Linder, T.
dc.contributor.department	Natural and Mathematical Sciences
dc.contributor.ozuauthor	SALDI, Naci
dc.date.accessioned	2021-02-10T12:28:01Z
dc.date.available	2021-02-10T12:28:01Z
dc.date.issued	2020-01
dc.description.abstract	We consider finite model approximations of discrete-time partially observed Markov decision processes (POMDPs) under the discounted cost criterion. After converting the original partially observed stochastic control problem to a fully observed one on the belief space, the finite models are obtained through the uniform quantization of the state and action spaces of the belief space Markov decision process (MDP). Under mild assumptions on the components of the original model, it is established that the policies obtained from these finite models are nearly optimal for the belief space MDP, and so, for the original partially observed problem. The assumptions essentially require that the belief space MDP satisfies a mild weak continuity condition. We provide an example and introduce explicit approximation procedures for the quantization of the set of probability measures on the state space of POMDP (i.e., belief space).
dc.description.sponsorship	Natural Sciences and Engineering Research Council of Canada (NSERC)
dc.identifier.doi	10.1109/TAC.2019.2907172
dc.identifier.endpage	142
dc.identifier.issn	0018-9286
dc.identifier.issue	1
dc.identifier.scopus	2-s2.0-85077786832
dc.identifier.startpage	130
dc.identifier.uri	http://hdl.handle.net/10679/7292
dc.identifier.uri	https://doi.org/10.1109/TAC.2019.2907172
dc.identifier.volume	65
dc.identifier.wos	000506851100010
dc.language.iso	eng
dc.peerreviewed	yes
dc.publicationstatus	Published
dc.publisher	IEEE
dc.relation.ispartof	IEEE Transactions on Automatic Control
dc.relation.publicationcategory	International Refereed Journal
dc.rights	restrictedAccess
dc.subject.keywords	Aerospace electronics
dc.subject.keywords	Convergence
dc.subject.keywords	Quantization (signal)
dc.subject.keywords	Markov processes
dc.subject.keywords	Computational modeling
dc.subject.keywords	Cost function
dc.subject.keywords	Approximations
dc.subject.keywords	Markov decision processes
dc.subject.keywords	Non-linear filtering
dc.subject.keywords	Quantization
dc.subject.keywords	Stochastic control
dc.title	Asymptotic optimality of finite model approximations for partially observed markov decision processes with discounted cost
dc.type	article
dspace.entity.type	Publication
relation.isOrgUnitOfPublication	7a8a2b87-4f48-440a-a491-3c0b2888cbca
relation.isOrgUnitOfPublication.latestForDiscovery	7a8a2b87-4f48-440a-a491-3c0b2888cbca

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.45 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Natural and Mathematical Sciences

Publication: Asymptotic optimality of finite model approximations for partially observed markov decision processes with discounted cost

Files

License bundle

Collections

Publication:
Asymptotic optimality of finite model approximations for partially observed markov decision processes with discounted cost