reinforcementlearning 相关硕士博士期刊学术论文

reinforcementlearning相关论文

Reinforcement learning based energy efficient robot relay for unmanned aerial vehicles against smart

Unmanned aerial vehicles (UAVs) with limited energy resources,severe path loss,and shadowing to the ground base stations......

期刊

unmanned aerial vehicles relay jamming game theory reinforcement learning

Learning practically feasible policies for online 3D bin packing

We tackle the online 3D bin packing problem (3D-BPP),a challenging yet practically useful variant of the classical bin p......

期刊

bin packing problem online 3D-BPP reinforcement learning

Accelerated value iteration via Anderson mixing

In this paper,we introduce the Anderson acceleration technique developed to be applied to reinforcement learning tasks.W......

期刊

reinforcement learning Q-learning value iteration Anderson acceleration deep neu

Minimax Q-learning design for H∞ control of linear discrete-time systems

The H∞ control method is an effective approach for attenuating the effect of disturbances on practical systems, but it ......

期刊

H∞ control Zero-sum dynamic game Reinforcement learning Adaptive dynamic program

Underwater Target Detection Based on Reinforcement Learning and Ant Colony Optimization

Underwater optical imaging produces images with high resolution and abundant information and hence has outstanding advan......

期刊

ant colony optimization reinforcement learning underwater target edge detection

Sequencing of multi-robot behaviors using reinforcement learning

Given a collection of parameterized multi-robot controllers associated with individual behaviors designed for particular......

期刊

Multi-robot systems Reinforcement learning Distributed control

Quantum-enhanced reinforcement learning for control:a preliminary study

Reinforcement learning is one of the fastest growing areas in machine learning,and has obtained great achievements in bi......

期刊

Quantum theory Reinforcement learning Quantum computation State superposition Op

Derivative-free reinforcement learning:a review

Reinforcement learning is about learning agent models that make the best sequential decisions in unknown en-vironments.I......

期刊

reinforcement learning derivative-free optimiza-tion neuroevolution reinforcemen

Actor-Critic Reinforcement Learning and Application in Developing Computer-Vision-Based Interface Tr

This paper synchronizes control theory with computer vision by formalizing object tracking as a sequen-tial decision-mak......

期刊

Interface tracking Object tracking Occlusion Reinforcement learning Uniform mani

Parallel-Data-Based Social Evolution Modeling

Abnormal or drastic changes in the natural environment may lead to unexpected events,such as tsunamis and earthquakes,wh......

期刊

parallel data reinforcement learning decision-making

On-line learning algorithm for dynamic sensitivity control in IEEE 802.11 ax network

The popularity of IEEE 802.11 based wireless local area network (WLAN) increased significantly in recent years and resul......

期刊

IEEE 802.11ax reinforcement learning dynamic sensitivity control dense network

Self-driving Car Navigation with Obstacle Avoidance Using Reinforcement Learning

Self-driving car navigation with obstacle avoidance problem is a hot topic in academic world.The goal of this problem is......

会议

reinforcement learning car navigation obstacle avoidance adaptive dynamic progra

Data-driving Reinforcement Learning on the Path Planning for Autonomous Vehicles

The path planning for autonomous vehicles is a hot topic in academic world.The goal of this problem is to design a vehi-......

会议

autonomous vehicle path planning reinforcement learning adaptive dynamic program

Depression Detection on Social Media with Reinforcement Learning

Depression detection is a significant issue for human well-being.Conventional diagnosis of depression requires a face-to......

会议

Depression Social Media Reinforcement Learning

Decision Mechanisms for Interactive Character Animations in Virtual Environment

Recently,interactive character animations in computer games are mainly rely on motion-captured or carefully crafted moti......

会议

Realistic character Motion graph Reinforcement learning

Least-Squares Temporal Difference Learning with Eligibility Traces based on Regularized Extreme Lear

The task of learning the value function under a fixed policy in continuous Markov decision processes(MDPs)is considered.......

会议

Reinforcement learning Markov decision processes Function approximation Least-sq

Reinforcement Learning for Robot Navigation Using Hex-grid Maps

For some rodent mammals when they foraging or looking for a target, the positions and headings in their brain cells are ......

会议

Reinforcement Learning Hex-gird Map Robot Navigation Q-learning

Uncovering representations and algorithms of decision making by model-based analysis of striatal neu

The striatum is a major input site of the basal ganglia,which play an essential role in 337 decision making....

会议

reinforcement learning

Development of a Deep Learning Model for Binding Affinity Prediction and Fragment-based de novo Drug

The traditional drug design and discovery methods were time-consuming and expensive,which largely reduced the efficiency......

会议

scoring function Representation of binding pocket de novo drug design Reinforcem

De Novo Molecular Design Through Deep Reinforcement Learning

Over the past decade,deep learning(DL)has achieved remarkable success in various artificial intelligence(AI)research are......

会议

Deep learning Artificial intelligence Recurrent neural network Reinforcement lea

基于强化学习的准分子激光器能量控制算法研究

光刻用准分子激光器的能量特性在集成电路的光刻过程中至关重要,直接影响光刻机曝光线条的精度。为了实现对于衡量能量特性的能量......

期刊

激光器光刻准分子激光器强化学习能量稳定性剂量精度 lasers photolithography excimer laser reinforcemen

基于注意力和强化学习的遥感图像描述方法

针对当前遥感目标检测方法只能识别出遥感目标的类别及位置,无法生成与遥感图像内容相关文本描述的问题,提出了一种基于注意力和强......

期刊

遥感图像描述强化学习注意力机制编码-解码 remote sensing image caption reinforcement learning att

边缘计算使能的天地一体化信息网络中通信与存储资源联合调度

5G时代移动设备产生了海量数据，其中大多数是多媒体内容。通过无线网络传输如此规模的多媒体内容将会消耗大量无线频谱资源，进而导致......

学位

计算天地一体化信息网络通信资源存储资源点匹配算法缓存命中率研究内容 Reinforcement Learning Information Netw

Revisiting the ODE Method for Recursive Algorithms:Fast Convergence Using Quasi Stochastic Approxima

Several decades ago,Profs.Sean Meyn and Lei Guo were postdoctoral fellows at ANU,where they shared interest in recursive......

期刊

Learning and adaptive systems in artificial intelligence reinforcement learning

主动配电网运行优化的深度强化学习方法

随着分布式电源、柔性负荷等新型元素在配电网的渗透逐渐增加以及配电自动化系统、信息系统的建设,传统配电网正逐渐演变成可观可......

学位

电网运行优化系统运行方式 ADN 可中断负荷控制策略需求响应恢复控制故障恢复 Reinforcement Learning 实时环境分布式电源 Di

Single Exposure to Cocaine Impairs Reinforcement Learning by Potentiating the Activity of Neurons in

Plasticity in the glutamatergic synapses on striatal medium spiny neurons (MSNs) is not only essential for behavioral ad......

期刊

Cocaine Reinforcement learning Striatum Medium spiny neurons Long-term potentiat

Neural mechanisms of social learning and decision-making

One of the hallmarks of human society is the ubiquitous interactions among individuals.Indeed,a significant portion of h......

期刊

social cognition decision-making reinforcement learning value altruism

Achieving Safe Deep Reinforcement Learning via Environment Comprehension Mechanism

Deep reinforcement learning (DRL), which combines deep learning with reinforcement learning, has achieved great success ......

期刊

Reinforcement learning Deep rein-forcement learning Safe deep reinforcement lear

Multi-Agent Modeling and Simulation in the AI Age

With the rapid development of artificial intelligence(AI)technology and its successful application in various fields,mod......

期刊

artificial intelligence system dynamics reinforcement learning large-scale multi

A self-supervised method for treatment recommendation in sepsis

Sepsis treatment is a highly challenging effort to reduce mortality in hospital intensive care units since the treatment......

期刊

Treatment recommendation Sepsis Self-supervised learning Reinforcement learning

Decentralized multi-agent reinforcement learning with networked agents:recent advances

Multi-agent reinforcement learning(MARL) has long been a significant research topic in both machine learning and control......

期刊

Reinforcement learning Multi-agent systems Networked systems Consensus optimizat

The greedy crowd and smart leaders: a hierarchical strategy selection game with learning protocol

In this paper,a general resource distribution game with a hierarchical structure on the bipartite graph is proposed.In t......

期刊

multi-agent system reinforcement learning game theory complex network bipartite

Control of chaos in Frenkel-Kontorova model using reinforcement learning

It is shown that we can control spatiotemporal chaos in the Frenkel-Kontorova (FK) model by a model-free control method ......

期刊

chaos control Frenkel-Kontorova model reinforcement learning

Reinforcement Learning Based Obstacle Avoidance for Autonomous Underwater Vehicle

该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生......

期刊

Obstacleavoidance Autonomousunderwatervehicle Reinforcementlearning Q-learning F

提高强化学习速度的方法研究

强化学习一词出自行为心理学，这门学科把学习看作为反复试验的过程，以便把环境的状态映射为动作。强化学习的这种特性必须增加智能系......

期刊

强化学习机器学习 Q-学习自适应启发评价方法学习速度 Reinforcementlearning Machine Learning Q-learning

加强学习主要算法的比较研究

文章介绍了加强学习模型，分别给出了加强学习的四个主要算法：动态规划、蒙特卡罗算法、时序差分算法、Q－学习，并指出了它们之间的区别......

期刊

加强学习蒙特卡罗算法时序差分算法 Q-学习机器学习人工智能 Reinforcementlearning Dynamic programming mont

一类基于有效跟踪的广义平均奖赏激励学习算法

取消了平均奖赏激励学习的单链或互通MDPs假设，基于有效跟踪技术和折扣奖赏型SARSA(λ)算法，时传统的平均奖赏激励学习进行了推广，提......

期刊

激励学习 MARKOV决策过程平均奖赏有效跟踪 Reinforcementlearning Markov decision processes(MDPs )

在信息融合系统中引入多智能体技术

论文简要介绍了多智能体技术和信息融合系统，将多智能体技术运用到信息融合系统中，对信息融合系统中的模型和方法进行改进，提出了多智......

期刊

信息融合多智能体系统(MAS) 强化学习 information fusion multi-agentsystem reinforcementlearning

基于再励学习的多移动机器人协调避障路径规划方法

随着多移动机器人协调系统的应用向未知环境发展，一些依赖于环境模型的路径规划方法不再适用，而利用再励学习与环境直接交互，不需要先......

期刊

避障路径规划路径规划再励学习再励函数多机器人协调移动机器人 Path planning Reinforcementlearning Reinforce

基于增强学习的多agent自动协商研究

该文通过对协商协议的引入，对提议形式、协商流程的分析，结合多属性效用理论和连续决策过程，提出了一个开放的、动态的、支持学习机制......

期刊

增强学习自动协商 Q学习评估提议 reinforcementlearning automated negotiation Q-learning evalua

基于博弈策略强化学习的函数优化算法

该文提出了一种基于博弈论的函数优化算法。算法将优化问题的搜索空间映射为博弈的策略组合空间,优化目标函数映射为博弈的效用函......

期刊

博弈函数优化强化学习策略组合效用函数 game function optimization reinforcementlearning strateg

折扣与无折扣MDPs：一个基于SARSA（λ）算法的实例分析

分析了折扣激励学习存在的问题，对MDPs的SARSA（λ）算法进行了折扣的比较实验分析，讨论了平均奖赏常量对无折扣SARSA（（）算法的影响。......

期刊

机器学习激励学习 SARSA(λ)算法实例分析 MDPs Reinforcementlearning Markov decision processes D

看过本文同时还关注