Approximate dynamic programming solutions with a single network adaptive critic for a class of nonli

来源 :控制理论与应用(英文版) | 被引量 : 0次 | 上传用户:yangtianmei02
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Approximate dynamic programming (ADP) formulation implemented with an adaptive critic (AC)-based neural network (NN) structure has evolved as a powerful technique for solving the Hamilton-Jacobi-Bellman (HJB) equations.As interest in ADP and the AC solutions are escalating with time,there is a dire need to consider possible enabling factors for their implementations.A typical AC structure consists of two interacting NNs,which is computationally expensive.In this paper,a new architecture,called the ‘cost-function-based single network adaptive critic (J-SNAC)' is presented,which eliminates one of the networks in a typical AC structure.This approach is applicable to a wide class of nonlinear systems in engineering.In order to demonstrate the benefits and the control synthesis with the J-SNAC,two problems have been solved with the AC and the J-SNAC approaches.Results are presented,which show savings of about 50% of the computational costs by J-SNAC while having the same accuracy levels of the dual network structure in solving for optimal control.Furthermore,convergence of the J-SNAC iterations,which reduces to a least-squares problem,is discussed; for linear systems,the iterative process is shown to reduce to solving the familiar algebraic Ricatti equation.
其他文献
在废物回收分拣站或冶金工厂里,我们经常可以看到一种特殊的起重机,当光秃秃的大圆盘放在各种金属混杂的杂物堆上,通电后,铁片、铁丝、铁钉等铁料马上被吸引到圆盘上(如图1),
编码是碎部点的属性之一,如何合理利用编码提高生产效益本文根据自己的生产经验提出了一些方法和建议。
请下载后查看,本文暂不支持在线获取查看简介。 Please download to view, this article does not support online access to view profile.
有别巴塞罗那的火热,不同于巴黎的喧嚣,尼斯以它独有的魅力依偎在地中海的臂弯里,充沛的阳光、欧式的建筑、葱郁的树木、宜人的气候、清澈的海水、洁净的沙滩、处处都是美景
主要研究算子α-△+V是R^n+1(n≥3)上的抛物型薛定谔算子,其中非零非负势V对q≥n/2属于反霍尔德类Bq.得到一类抛物型算子(α,-△+V)^1/2V^1/2;在空间L^P(R^n+1)的估计.
A2B7型La2Ni7基储氢舍金是高性能镍氢电池电极合金的候选材料之一。本文介绍了La2Ni7基合金的结构特性,概述了国内外有关A2B7型La2Ni7基储氢合金的研究进展,并提出了相关研究中
形如ut=F(u,ux,uxx)的非线性偏微分方程由可积系统vx=P(v,u,ux),vt=Q(v,u,ux)定义的B(a)cklund变换u→v分类,其最简Burgers方程为ut=uxx+2uux,相应的可积系统是vx=(λ+v)(u-v
在过去的十年中笔者亲身参与了课堂教学改革的十年,感受到改革没有达到预期的目标,关键的问题没有得到解决,无法落实素质教育;文章提出了加大教育投入,实施均衡发展的改革建
传统的教育教学方式是把教育工作单纯地理解为“教”。针对传统教育的不足,教育者和被教育者之间,理论与实践之间进行双向沟通。引导幼儿主动探究、自觉学习,教学活动游戏化;采用
Evaluation for the performance of learning algorithm has been the main thread of theoretical research of machine learning.The performance of the regularized reg