Parallel exploration via negatively correlated search

来源 :计算机科学前沿 | 被引量 : 0次 | 上传用户:baiqing001
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Effective exploration is key to a successful search process.The recently proposed negatively correlated search(NCS) tries to achieve this by coordinated parallel exploration,where a set of search processes are driven to be negatively cor-related so that different promising areas of the search space can be visited simultaneously.Despite successful applications of NCS,the negatively correlated search behaviors were mostly devised by intuition,while deeper (e.g.,mathematical) under-standing is missing.In this paper,a more principled NCS,namely NCNES,is presented,showing that the parallel explo-ration is equivalent to a process of seeking probabilistic models that both lead to solutions of high quality and are distant from previous obtained probabilistic models.Reinforcement learn-ing,for which exploration is of particular importance,are con-sidered for empirical assessment.The proposed NCNES is ap-plied to directly train a deep convolution network with 1.7 mil-lion connection weights for playing Atari games.Empirical re-sults show that the significant advantages of NCNES,especially on games with uncertain and delayed rewards,can be highly owed to the effective parallel exploration ability.
其他文献
针对新型航标装置研究了浮体在规则波作用下的随波性能及浮体形状对浮体结构运动的影响,结合流体力学、模型试验及数值模拟等理论知识,通过AQWA软件对新型航标装置浮体结构的水动力特性进行数值模拟,分析了在规则波作用下四种形状浮体结构的运动响应幅值算子、附加质量和辐射阻尼及一阶波浪激振力随入射波频率的变化规律,验证了新型航标装置圆柱形浮体运行可靠性.
1 IntroductionrnBy making the best of the information technology in smart grid,considerable power energy can be effectively saved[1,2].How-ever,frequently collecting user\'s power consumption data in-curs privacy disclosure issues.Meanwhile,data integri
期刊
Solving the optimization problem to approach a Nash Equilibrium point plays an important role in imperfect information games,e.g.,StarCraft and poker.Neural Fictitious Self-Play (NFSP) is an effective algorithm that learns approxi-mate Nash Equilibrium of
分析影响港口大型桥式起重机海上运输及现场安装方式的主要因素,对其运输、安装方式进行分类,阐述了不同运输、安装方式的技术特点及优缺点,对比分析其应用范围、适用工况,并结合工程实例进行说明.
Multi-label classification aims to assign a set of proper labels for each instance,where distance metric learning can help improve the generalization ability of instance-based multi-label classification models.Existing multi-label metric learning techniqu
在对矿石码头火车装车站缓冲仓方案论证过程中,根据装车能力的不同研究了缓冲仓的结构特点,提出缓冲仓设置配料口数量不同时料位的确定方法;并采用工艺技术研究了缓冲仓前后设备的工艺流程,对设备操作时间进行了数据统计和分析,进而确定了缓冲仓容积的理论计算公式和方法,对矿石码头装车站缓冲仓设计具有重要的指导意义.
1 IntroductionrnMobile cloud computing (MCC) can break the limitations of mobile devices by migrating applications to the Cloud with richer computing and storage resources[1].Consequently,mo-bile users can obtain better service experience,improved pro-ces
期刊
Software systems are present all around us and playing their vital roles in our daily life.The correct function-ing of these systems is of prime concern.In addition to clas-sical testing techniques,formal techniques like model check-ing are used to reinfo
摘要:水文地质既是岩土工程勘察组成的一部分,又直接影响了岩土工程的特性和质量,甚至还影响到建筑的安全性、稳定性和耐久性。本文笔者对岩土工程中水文地质勘察的作用进行了探讨,希望对相关从业人员具有借鉴意义。  关键词:岩土工程;水文地质勘察  中图分类号:S969文献标识码: A  1、岩土的水理性质  岩土的水理性质也就是指我们施工中的岩石土壤和地下水在一定程度上发生相互的作用而形成的一种固定的特征
期刊
Traditional recommendation algorithms predict the latent interest of an active user by collecting rating informa-tion from other similar users or items.Recently,more and more recommendation systems attempt to involve social relations to improve recommenda