【摘 要】
:
This talk concerns with the constrained optimality problem(COP)of first passage discrete-time Markov decision processes with multi-constraints,state-depende
【机 构】
:
Sun Yat-sen University,PRC
【出 处】
:
The 11th Workshop on Markov Processes and Related topics(第十一
论文部分内容阅读
This talk concerns with the constrained optimality problem(COP)of first passage discrete-time Markov decision processes with multi-constraints,state-dependent discount factors,and possibly unbounded costs.By means of the properties of a so-called occupation measure of a policy,we show that the constrained optimality problem is equivalent to an(infinitedimensional)linear programming on the set of occupation measures,and thus prove the existence of an optimal policy under suitable conditions.Furthermore,using the equivalence between the constrained optimality problem and the linear programming,we obtain an exact form of an optimal policy for the case of finite states and actions.Finally,as an example,a controlled queueing system is given to illustrate our results.
其他文献
采用液相无焰燃烧合成尖晶石型LiCu0.05Mn1.95O4锂离子电池正极材料,500℃下燃烧反应3h所得产物,再经700℃二次焙烧3h,通过XRD检测分析其晶相组成,并通过SEM观察产物的微
以碳酸锂、碳酸锰、硝酸铝为原料、葡萄糖为燃料,采用固相燃烧合成在 600 ℃温度下进行二次焙烧6 h制备尖晶石型LiMn2O4和LiAl0.04Mn1.96O4正极材料.
Earlier work by Diaconis and Saloff-Coste gives a spectral criterion for a maximum separation cutoff to occur for birth and death chains.Ding,Lubetzky and P
A well known theorem of Delmotte is that Gaussian bounds,parabolic Harnack inequality,and the combination of volume doubling and Poincaré inequality are eq
We derive Euler-Poincaré equations for stochastic processes defined on semidirect product Lie algebras.
We prove an ergodic theorem and a mean ergodic theorem in the random periodic regime on a Polish space.
We study the eigenvalues of a Laplace-Beltrami operator defined on the set of the symmetric polynomials,where the eigenvalues are expressed in terms of part
Under hypercontractivity and Lp-integrability of transition density for some p > 1,we use the perturbation theory of linear operators to obtain the long tim
Suppose heterogeneous customers arrive at an observable queueing system with different preference of service.A system administrator controls the queue lengt
In this talk,I will present a construction of obliquely reflected Brownian motions in all bounded simply connected planar domains,including non-smooth domai