Variance minimization for finite continuous-time markov decision processes

来源 :IMS-China International Conference on Statistics and Probabi | 被引量 : 0次 | 上传用户：X446873887

【摘要】

：

　　This talk concerns finite continuous-time Markov decision processes (CTMDPs) with the long-run average variance minimization (AV) criterion, the goal of whi

【作者】

：

Xianping Guo

【机构】

：

ZhongshanUniversity

【出处】

：

IMS-China International Conference on Statistics and Probabi

【发表日期】

：

2008年6期

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

　　This talk concerns finite continuous-time Markov decision processes (CTMDPs) with the long-run average variance minimization (AV) criterion, the goal of which is to find a policy with minimum AV over a class of expected average optimal policies.We first show that in general an AV minimization policy may exist by using examples.Furthermore, in order to obtain an AV minimization policy, we then consider a spccial but important class of CTMDPs larger than the class of ergodic CTMDPs.By using a martingale technique we prove that the AV criterion for the class of CTMDPs can be transformed into an equivalent expected average criterion, and thus the existence and calculation of a AV minimization policy are obtained by a policy iteration algorithm in an finite munber of iterations.

其他文献

Simple models for the description of molecular conductors

会议

Comparison simulationexperiment in the Low Energy-Loss regionapplication to lithium battery material

会议

Ultra-thin oxide filmsnew materials with unprecedented properties

会议

ab-initio Density Matrix Renormalization Group and Tensor Network wavefunctions

会议

Theoretical studies of weak Si…H interactions

会议

Multiple Scale Modeling of Supramolecular Architectures in Solution by 3D molecular theory of salvat

会议

On simple semi-classical tools to improve the capability of classical mechanics to describe molecula

会议

Dynamics of single-file water chains inside nanoscale channelsphysics,biological significance and ap

会议

Asymptotic properties of the nonparametric maximum likelihood estimation for doubly interval-censore

　　We consider asymptotic properties of the nonparametric maximum likelihood estimate (NPMLE) of a failure time distribution function based on doubly interval

会议

Finite Markov chain imbedding and its application to matching probability between two MAarkov depend

　　The length of the longest matching between two DNA sequences plays an im portant rule in genomic studies.The exact distribution of longest matching remains

会议

Variance minimization for finite continuous-time markov decision processes

与本文相关的学术论文