A Mean-Variance Problem for Discounted Markov Decision Processes

来源 :The Third IMS-China International Conference on Statistics a | 被引量 : 0次 | 上传用户:phoenixs
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  In this talk we introduce a mean-variance problem for Markov decision processes (MDPs).Different from the usual goal in MDPs toward an optimal policy for a specified class of policies, we aim to obtain a so-called mean-variance optimal policy that minimizes the variance over a set of policies with a given expected reward.
其他文献
  We propose parametric and nonparametric data-driven approaches to model covariance structures for longitudinal data.Based on a modified Cholesky decompositi
【摘 要】自我效能感的概念一经Bandura.A.提出,便引起广泛注意,有许多心理学家也对此进行了研究。近年来,随着世界范围内对教育发展的重视,教师的自我效能感也成为研究的热点问题。教师自我效能感不仅直接影响着教师教育行为,导引着教师知觉到自身的教育行为与结果,而且影响到学生的学业和个人成长。  【关键词】自我效能感;教师  Bandura认为,人们对其能力的判断在其自我调节体统中起主要作用,将
  In this short article, we shall study one-dimensional local Dirichlet spaces associated with linear diffusions.One result, which has its independent interes
会议
  Backward Stochastic Differential Equation (BSDE) has been well studied and widely applied.The main difference from the original stochastic differential equa
会议
  The urn model is a popular in many disciplines.In particular, it is extensively used in treatment allocation schemes in clinical trials.It is a hard work to
会议
  Brownian motion with darning (BMD in abbreviation) is a diffusion process obtained from Brownian motion by rendering each hole in the space into one point.I
会议
利用一块曲率半径适当的凹面板可使各种反射图象达到难以觉察的程度。本文将论述这一发展,并引证它给图象、亮度或清晰度带来的损失是最小的。一台电视机,根据其屏幕尺寸,有
系统“锁死”问题 磁带机在不同的使用时间和场合,不可能从开始加上电一直到主机系统运行结束才关电,特别是在实时系统中应该允许磁带机随时联机或者脱机。可是,目前常常因
  In this paper, with the notion of independent identically distributed (IID) random variables under sub-linear expectations initiated by Peng, we investigate
会议
回归共同体主义与拯救德性——现代德性伦理学评介龚群(中国人民大学哲学系100872)现代西方伦理学以元伦理学、规范伦理学和应用伦理学而三分天下。不过,随着近20年来规范伦理学理论的发