Variance minimization for finite continuous-time markov decision processes

来源 :IMS-China International Conference on Statistics and Probabi | 被引量 : 0次 | 上传用户:X446873887
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  This talk concerns finite continuous-time Markov decision processes (CTMDPs) with the long-run average variance minimization (AV) criterion, the goal of which is to find a policy with minimum AV over a class of expected average optimal policies.We first show that in general an AV minimization policy may exist by using examples.Furthermore, in order to obtain an AV minimization policy, we then consider a spccial but important class of CTMDPs larger than the class of ergodic CTMDPs.By using a martingale technique we prove that the AV criterion for the class of CTMDPs can be transformed into an equivalent expected average criterion, and thus the existence and calculation of a AV minimization policy are obtained by a policy iteration algorithm in an finite munber of iterations.
其他文献
会议
会议
会议
会议
会议
会议
会议
会议
  We consider asymptotic properties of the nonparametric maximum likelihood estimate (NPMLE) of a failure time distribution function based on doubly interval
会议
  The length of the longest matching between two DNA sequences plays an im portant rule in genomic studies.The exact distribution of longest matching remains
会议