Convergence of Markov decision processes with constraints and state-action dependent discount factor

来源 :中国科学:数学(英文版) | 被引量 : 0次 | 上传用户:pangyaoyu
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
This paper is concerned with the convergence of a sequence of discrete-time Markov decision processes(DTMDPs)with constraints,state-action dependent discount factors,and possibly unbounded costs.Using the convex analytic approach under mild conditions,we prove that the optimal values and optimal policies of the original DTMDPs converge to those of the“limit”one.Furthermore,we show that any countable-state DTMDP can be approximated by a sequence of finite-state DTMDPs,which are constructed using the truncation technique.Finally,we illustrate the approximation by solving a controlled queueing system numeri-cally,and give the corresponding error bound of the approximation.
其他文献
In this paper,we give a new genus-4 topological recursion relation for Gromov-Witten invariants of compact symplectic manifolds via Pixton\'s relations on the moduli space of curves.As an application,we prove that Pixton\'s relations imply a known top
Linear factor models are familiar tools used in many fields.Several pioneering literatures established foundational theoretical results of the quasi-maximum likelihood estimator for high-dimensional linear factor models.Their results are based on a critic
This paper is dedicated to studying the following elliptic system of Hamiltonian type:{-ε2△u+u+V(x)v=Q(x)Fv(u,v),x∈(R)N,-ε2△v+v+V(x)u=Q(x)Fu(u,v),x∈(R)N,||u(x)|+|v(x)| → 0,as |x| → ∞,where N≥3,V,Q ∈ C((R)N,R),V(x)is allowed to be sign-changing and inf Q>0
Natural aquifers usually exhibit complex physical and chemical heterogeneities,which are key factors complicating kinetic processes,such as contaminant transport and transformation,posing a great challenge in the remediation of contaminated groundwater.Aq
In this paper,we study steady Ricci solitons with a linear decay of sectional curvature.In particular,we give a complete classification of 3-dimensional steady Ricci solitons and 4-dimensional κ-noncollapsed steady Ricci solitons with non-negative section
We derive averaging formulas for the Lefschetz coincidence numbers,the Nielsen coincidence num-bers and the Reidemeister coincidence numbers of maps on infra-solvmanifolds modeled on a connected and simply connected solvable Lie group of type(R).As an app
Dear Editor,rnThe genetic mechanism of large-scale interspecies traits,including evolutionary novelties and the characteristics of high taxa,is a central issue in evolutionary biology.At present,genome-wide association studies(GWAS)are known as one of the
期刊
Extreme Meiyu rainfall in 2020,starting from early June to the end of July,has occurred over the Yangtze River valley(YRV),with record-breaking accumulated precipitation amount since 1961.The present study aims to examine the possible effect of sea surfac
Eastern China experienced excessive Meiyu rainfall in the summer of 2020,with a long rainy season and frequent extreme rainfall events.Extreme rainfall occurred on daily to monthly time scales.In particular,persistent heavy rainfall events occurred;e.g.,t