FIRST PASSAGE MARKOV DECISION PROCESSES WITH CONSTRAINTS AND VARYING DISCOUNT FACTORS

来源 :The 11th Workshop on Markov Processes and Related topics(第十一 | 被引量 : 0次 | 上传用户:winbourbit
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  This talk concerns with the constrained optimality problem(COP)of first passage discrete-time Markov decision processes with multi-constraints,state-dependent discount factors,and possibly unbounded costs.By means of the properties of a so-called occupation measure of a policy,we show that the constrained optimality problem is equivalent to an(infinitedimensional)linear programming on the set of occupation measures,and thus prove the existence of an optimal policy under suitable conditions.Furthermore,using the equivalence between the constrained optimality problem and the linear programming,we obtain an exact form of an optimal policy for the case of finite states and actions.Finally,as an example,a controlled queueing system is given to illustrate our results.
其他文献
  采用液相无焰燃烧合成尖晶石型LiCu0.05Mn1.95O4锂离子电池正极材料,500℃下燃烧反应3h所得产物,再经700℃二次焙烧3h,通过XRD检测分析其晶相组成,并通过SEM观察产物的微
  以碳酸锂、碳酸锰、硝酸铝为原料、葡萄糖为燃料,采用固相燃烧合成在 600 ℃温度下进行二次焙烧6 h制备尖晶石型LiMn2O4和LiAl0.04Mn1.96O4正极材料.
  Earlier work by Diaconis and Saloff-Coste gives a spectral criterion for a maximum separation cutoff to occur for birth and death chains.Ding,Lubetzky and P
  A well known theorem of Delmotte is that Gaussian bounds,parabolic Harnack inequality,and the combination of volume doubling and Poincaré inequality are eq
会议
  We derive Euler-Poincaré equations for stochastic processes defined on semidirect product Lie algebras.
会议
  We prove an ergodic theorem and a mean ergodic theorem in the random periodic regime on a Polish space.
  We study the eigenvalues of a Laplace-Beltrami operator defined on the set of the symmetric polynomials,where the eigenvalues are expressed in terms of part
会议
  Under hypercontractivity and Lp-integrability of transition density for some p > 1,we use the perturbation theory of linear operators to obtain the long tim
  Suppose heterogeneous customers arrive at an observable queueing system with different preference of service.A system administrator controls the queue lengt
会议
  In this talk,I will present a construction of obliquely reflected Brownian motions in all bounded simply connected planar domains,including non-smooth domai
会议