A Mean-Variance Problem for Discounted Markov Decision Processes

来源 :The Third IMS-China International Conference on Statistics a | 被引量 : 0次 | 上传用户：phoenixs

【摘要】

：

　　In this talk we introduce a mean-variance problem for Markov decision processes (MDPs).Different from the usual goal in MDPs toward an optimal policy for a

【作者】

：

XianpingGuo[1]LiuerYe[1]G.Yin[2]

【机构】

：

Zhongshan University, CHINA

【出处】

：

The Third IMS-China International Conference on Statistics a

【发表日期】

：

2011年期

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

　　In this talk we introduce a mean-variance problem for Markov decision processes (MDPs).Different from the usual goal in MDPs toward an optimal policy for a specified class of policies, we aim to obtain a so-called mean-variance optimal policy that minimizes the variance over a set of policies with a given expected reward.

其他文献

Parametric and Nonparametric Regression of Covariance Structures in Longitudinal Studies

　　We propose parametric and nonparametric data-driven approaches to model covariance structures for longitudinal data.Based on a modified Cholesky decompositi

会议

Covariance modellingParametric and Non-parametric Regression ModelsGen-eralize

浅析怎样培养教师自我效能感

【摘要】自我效能感的概念一经Bandura.A.提出，便引起广泛注意，有许多心理学家也对此进行了研究。近年来，随着世界范围内对教育发展的重视，教师的自我效能感也成为研究的热点问题。教师自我效能感不仅直接影响着教师教育行为，导引着教师知觉到自身的教育行为与结果，而且影响到学生的学业和个人成长。　　【关键词】自我效能感；教师　　Bandura认为，人们对其能力的判断在其自我调节体统中起主要作用，将

期刊

自我效能感教师

Representation of Dirichlet form of 1-dim diffusions

　　In this short article, we shall study one-dimensional local Dirichlet spaces associated with linear diffusions.One result, which has its independent interes

会议

Statistical inference for BSDE and related problems

　　Backward Stochastic Differential Equation (BSDE) has been well studied and widely applied.The main difference from the original stochastic differential equa

会议

Asymptotic properties of urn models with applications to adaptive designs in clinical trials

　　The urn model is a popular in many disciplines.In particular, it is extensively used in treatment allocation schemes in clinical trials.It is a hard work to

会议

Brownian motion with darning and Komatu-Leowner equation for multiply connected domains

　　Brownian motion with darning (BMD in abbreviation) is a diffusion process obtained from Brownian motion by rendering each hole in the space into one point.I

会议

不晃眼的阴极射线管

利用一块曲率半径适当的凹面板可使各种反射图象达到难以觉察的程度。本文将论述这一发展,并引证它给图象、亮度或清晰度带来的损失是最小的。一台电视机,根据其屏幕尺寸,有

期刊

曲率半径观看者光学透镜屏幕尺寸镜面反射拍摄距离玻壳无限远抗反射膜反射镜面

消除因磁带机导致的计算机系统的“锁死”现象

系统“锁死”问题磁带机在不同的使用时间和场合,不可能从开始加上电一直到主机系统运行结束才关电,特别是在实时系统中应该允许磁带机随时联机或者脱机。可是,目前常常因

期刊

计算机系统外部设备主机系统实时系统错码通断零压正弦电压交流电源可控硅电路

A law of the iterated logarithm sublinear expectations

　　In this paper, with the notion of independent identically distributed (IID) random variables under sub-linear expectations initiated by Peng, we investigate

会议

回归共同体主义与拯救德性——现代德性伦理学评介

回归共同体主义与拯救德性——现代德性伦理学评介龚群（中国人民大学哲学系１００８７２）现代西方伦理学以元伦理学、规范伦理学和应用伦理学而三分天下。不过，随着近２０年来规范伦理学理论的发

期刊

德性伦理学规范伦理学麦金太尔龚群道义论道德规则哲学系道德价值德性论中国人民大学

A Mean-Variance Problem for Discounted Markov Decision Processes

与本文相关的学术论文