Q学习中基于模糊规则的强化函数设计方法

来源 :模式识别与人工智能 | 被引量 : 0次 | 上传用户:mryangjinhui
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Q 学习算法是求解信息不完全马尔可夫决策问题的一种强化学习方法.Q 学习中强化信号的设计是影响学习效果的重要因素.本文提出一种基于模糊规则的 Q 学习强化信号的设计方法,提高强化学习的性能.并将该方法应用于单交叉口信号灯最优控制中,根据交通流的变化自适应调整交叉口信号灯的相位切换时间和相位次序.通过 Paramics 微观交通仿真软件验证,说明在解决交通控制问题中,使用基于模糊规则的 Q 学习的学习效果优于传统 Q 学习. Q learning algorithm is an intensive learning method for solving incomplete Markov decision problems.Q-learning is an important factor that affects the learning effect.This paper presents a design method of Q-learning enhanced signal based on fuzzy rules To improve the performance of reinforcement learning.And this method is applied to the optimal control of single-intersection traffic lights.According to the changes of traffic flow, the phase switching time and phase order of traffic lights can be adaptively adjusted.According to Paramics microscopic traffic simulation software, In solving traffic control problems, learning using Q learning based on fuzzy rules is better than traditional Q learning.
其他文献
Cyclic voltammetry and chronoamperometry were used to investigate the electrochemical behavior of Pr3+ ions electrochemical parameters were measured. Potentiost
Four different topics for high-temperature components, namely the development of the assessment codes for the structural integrity of high-temperature component
Blast vibration analysis constitutes the foundation for studying the control of blasting vibration damage and provides the precondition of controlling blasting
On the basis of analyzing the machine-workpiece-tool system, the main factors affecting diameter errors in bars turning are considered, and the mathematic model
Based on the advanced integrated technology of materials preparation and formation, a new pattern ZnAl-Mg-RE anti-corrosion coating for steel structure sustaina
Failure, especially induced by cracks, usually occurred in the service process of chemical equipment, which could cause the medium leakage, fire hazard and expl
目的掌握饮用高砷水地区居民砷中毒发病状况,为防治提供科学依据.方法以村为单位,采用整群拉网式调查.结果调查724人,发现可疑病人44人,轻度病人18例,砷中毒病人62例,患病率8
The present status and development trends of nano-composite coatings were briefly introduced. The nano-SiO2 was dispersed into crylic acid resin by ultrasonic w
A complete state variable current-mode biquadratic filter built by duo-output CCⅡ (DOCCⅡ) with variable current gain is presented. All the coefficients of the
A molecular dynamics simulation study has been performed for the formation and evolution characteristics of nano-clusters in a large-scale system consisting of