Incorporation of Perception-based Information in Robot Learning Using Fuzzy Reinforcement Learning A

来源 :Journal of Ocean University of Qingdao | 被引量 : 0次 | 上传用户:delphiall
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Robot learning in unstructured environments has been proved to be an extremely challenging problem, mainly because of many uncertainties always present in the real world. Human beings, on the other hand, seem to cope very well with uncertain and unpredictable environments, often relying on perception-based information. Furthermore, humans beings can also utilize perceptions to guide their learning on those parts of the perception-action space that are actually relevant to the task. Therefore, we conduct a research aimed at improving robot learning through the incorporation of both perception-based and measurement-based information. For this reason, a fuzzy reinforcement learning (FRL) agent is proposed in this paper. Based on a neural-fuzzy architecture, different kinds of information can be incorporated into the FRL agent to initialise its action network, critic network and evaluation feedback module so as to accelerate its learning. By making use of the global optimisation capability of GAs (genetic algorithms), a GA-based FRL (GAFRL) agent is presented to solve the local minima problem in traditional actor-critic reinforcement learning. On the other hand, with the prediction capability of the critic network, GAs can perform a more effective global search. Different GAFRL agents are constructed and verified by using the simulation model of a physical biped robot. The simulation analysis shows that the biped learning rate for dynamic balance can be improved by incorporating perception-based information on biped balancing and walking evaluation. The biped robot can find its application in ocean exploration, detection or sea rescue activity, as well as military maritime activity. Robot learning in unstructured environments has been proved to be an extremely challenging problem, primarily because of many uncertainties always present in the real world. Human beings, on the other hand, seem to cope very well with uncertain and unpredictable environments -based information. Furthermore, humans beings can also enabling perceptions to guide their learning on those parts of the perception-action space that are actually relevant to the task. Therefore, we conduct a research aimed at improving robot learning through the incorporation of both perception Based on a neural-fuzzy architecture, different kinds of information can be incorporated into the FRL agent to initialise its action network , critic network and evaluation feedback module so as to accelerate its learning. By making use of the global optimization capability GAs (genetic algorithms), a GA-based FRL (GAFRL) agent is presented to solve the local minima problem in traditional actor-critic reinforcement learning. On the other hand, with the prediction capability of the critic network, GAs can perform a The more effective global search. Different GAFRL agents are constructed and verified by using the simulation model of a physical biped robot. The simulation analysis shows that the biped learning rate for dynamic balance can be improved by incorporating perception-based information on biped balancing and walking evaluation The biped robot can find its application in ocean exploration, detection or sea rescue activity, as well as military maritime activity.
其他文献
在新技术的不断推动下,传统汽车行业也实现了大规模的发展,这就无形中给能源带来了一定的消耗,直接影响了人们的生活环境,新能源汽车应运而生.新能源汽车通过电力作为驱动,不
近些年来我国对环境污染问题给予了高度重视,为了保护环境减少能源消耗,逐渐研究出了新能源汽车,这种汽车能够满足时代发展需求.为了提升汽车运行效率,及时解决汽车故障问题,
随着我国经济的飞速发展,汽车作为交通工具在人们的日常生活中起到关键作用,私家车的比例也逐年上升.近些年,因为国家提倡可持续发展的战略目标,汽车行业也随之改变发展目标,
目的建立猪经脐经肛直肠肿瘤切除的杂交NOTES手术模型。方法选择健康普通家猪7只,气管插管,异氟烷吸入麻醉。造气腹后,经脐单切口分别置入5 mm、10 mm及12 mm Trocar各1支,腹
发展新能源汽车是我国从汽车大国迈向汽车强国的必由之路,其新能源汽车零部件产业链蕴含着巨大的产业机会.新能源汽车是目前热门的汽车发展方向.本课题针对新新能源汽车零部
随着国家大力倡导节能减排,新能源电动汽车作为一种绿色的新型能源汽车应用也越来越广泛.与此同时新能源汽车的所引发的安全问题也逐渐显现出来.新能源汽车自燃事件也将新能
每当出行的时候,人们首先想到的便是汽车.汽车整体的使用范围随着科学技术的进步和社会经济的发展逐步扩大,但是由于汽车使用的传统能源会污染环境,所以随着新能源逐步的被发
整车控制策略的优劣直接关系着新能源汽车运行的驾驶性及稳定性,当前各研究机构与大型车厂主要采取HIL测试技术对新能源汽车的各项整车控制策略进行测试.本文将对HIL测试的输
文章简要阐述了秋色叶树种的定义和变色的原因,介绍了18种常见的秋色叶树种,并探讨了在城市园林中的应用及应用方式。 The article briefly describes the definition of au
随着世界能源的逐步紧张,开展新能源成为社会各界发展需求.文本以新能源汽车发展为主要研究对象,针对相关障碍问题以及政府政策导向进行深入的研究和阐述,结合笔者新能源领域