Asymptotic tracking by a reinforcement learning-based adaptive critic controller

来源 :Journal of Control Theory and Applications | 被引量 : 0次 | 上传用户:ZJUCS
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Adaptive critic(AC) based controllers are typically discrete and/or yield a uniformly ultimately bounded stability result because of the presence of disturbances and unknown approximation errors.A continuous-time AC controller is developed that yields asymptotic tracking of a class of uncertain nonlinear systems with bounded disturbances.The proposed AC-based controller consists of two neural networks(NNs)-an action NN,also called the actor,which approximates the plant dynamics and generates appropriate control actions;and a critic NN,which evaluates the performance of the actor based on some performance index.The reinforcement signal from the critic is used to develop a composite weight tuning law for the action NN based on Lyapunov stability analysis.A recently developed robust feedback technique,robust integral of the sign of the error(RISE),is used in conjunction with the feedforward action neural network to yield a semiglobal asymptotic result.Experimental results are provided that illustrate the performance of the developed controller. Adaptive critic (AC) based controllers are typically discrete and / or yield a distributed ultimately bounded stability result because of the presence of disturbances and unknown approximation errors. A continuous-time AC controller is developed that yields asymptotic tracking of a class of uncertain nonlinear systems with bounded disturbances. The proposed AC-based controller consists of two neural networks (NNs) -an action NN, also called the actor, which approximates the plant dynamics and generates the appropriate control actions; and a critic NN, which evaluates the performance of the actor based on some performance index. The reinforcement signal of the critic of is used to develop a composite weight tuning law for the action NN based on Lyapunov stability analysis. Experts recently on robust feedback technique, robust integral of the sign of the error (RISE) , is used in conjunction with the feedforward action neural network to yield a semiglobal asymptotic result.Experimental results are provided tha t illustrate the performance of the developed controller.
其他文献
目的 比较非手术与手术治疗肩胛骨骨折的临床治疗效果.方法 对我院2008年1月至2013年1月期间所收治的肩胛骨骨折59例患者的临床资料进行回顾性分析,其中,59例患者均实行X线检
目的 探讨环乳晕切口联合皮内缝合在乳腺纤维腺瘤手术中的应用.方法 回顾性分析我院2009年3月至2011年3月住院治疗的乳腺纤维腺瘤患者256例,全部经环乳晕切口联合皮内缝合行
目的 分析和探讨胃癌的活检病理分型和诊断意义,提高早期胃癌的检出率,做到及时发现、诊断和治疗.方法 回顾性分析我院从2010年3月至2013年3月收治的活检诊断为胃癌的100例患
目的 分析总结老年人急性心肌梗死的治疗方法.方法 选取124例老年急性心肌梗死患者病例作为研究对象,对所有研究对象进行回顾性总结分析.结果 124例患者分别采用静脉溶栓治疗
目的 探讨研究胃大部分切除手术治疗胃十二指肠溃疡大出血的效果和优势,以更好地为临床治疗胃十二指肠溃疡大出血给予参考.方法 选择2009年2月--2012年10月我院收治的61例胃
工程质量控制是施工项目管理的一项重要内容.工程质量管理水平不仅影响到企业的经济利益与竞争能力,也能反映企业的精神文明建设的状况,是一项量大面广的社会系统工程,其质量
目的 观察高血浆同型半胱氨酸(HHcy)与缺血性脑卒中复发的关系.方法 收集本院2011年1月--2011年10月缺血性脑卒中住院患者,选出血浆同型半胱氨酸(HHcy)正常患者及高血浆同型
目的 探讨高危妊娠在超导无痛人工流产术中临床应用价值.方法 选取了2011年6月到2012年6月我院妇产科收治确诊为高危妊娠孕妇共60例,对选取的孕妇行超导无痛人工流产术,并观
目的 对乳腺癌术后发生在非乳腺部位的多原发恶性肿瘤的临床特点、发生原因及预后进行分析探讨.方法 对在我院进行治疗的20例乳腺癌术后发生在非乳腺部位的多原发恶性肿瘤患
目的 利用手指背侧筋膜蒂皮瓣移植修复手指各种位置的皮肤软组织缺损.方法 全部利用手指背侧筋膜蒂皮瓣移植,对80例手指不同部位皮肤软组织缺损进行恢复.结果 80例皮瓣全部成