Collaborative Pushing and Grasping of Tightly Stacked Objects via Deep Reinforcement Learning

来源 :自动化学报（英文版） | 被引量 : 0次 | 上传用户：ADAM129XU

【摘要】

：

【作者】

：

Yuxiang Yang Zhihao Ni Mingyu Gao Jing Zhang Dacheng Tao

【机构】

：

School of Electronics and Information,Hangzhou Dianzi University,Hangzhou;Zhejiang Provincial Key La

【出处】

：

自动化学报（英文版）

【发表日期】

：

2022年1期

【关键词】

：

Convolutional neural network deep Q-learning (DQN) reward function robotic grasp

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

Directly grasping the tightly stacked objects may cause collisions and result in failures, degenerating the functionality of robotic arms. Inspired by the observation that first pushing objects to a state of mutual separation and then grasping them individually can effectively increase the success rate, we devise a novel deep Q-learning framework to achieve collaborative pushing and grasping. Specifically, an efficient non-maximum suppression policy (PolicyNMS) is proposed to dynamically evaluate pushing and grasping actions by enforcing a suppression constraint on unreasonable actions. Moreover, a novel data-driven pushing reward network called PR-Net is designed to effectively assess the degree of separation or aggregation between objects. To benchmark the proposed method, we establish a dataset containing common household items dataset (CHID) in both simulation and real scenarios. Although trained using simulation data only, experiment results validate that our method generalizes well to real scenarios and achieves a 97％ grasp success rate at a fast speed for object separation in the real-world environment.

其他文献

叶片进口边位置对单叶片离心泵蜗壳内压力脉动的影响

为了研究叶片进口边位置对单叶片离心泵性能的影响,基于标准k-ε湍流模型,对不同叶片进口边位置的单叶片离心泵首先进行了定常数值计算和外特性分析,并通过试验验证了数值计算结果的可靠性.然后以定常计算结果为基础进行了非定常数值计算,分析了叶片进口边位置对泵压力脉动特性的影响.数值计算结果表明:随着叶片进口边沿后盖板或前盖板向泵入口延伸量的增加,除隔舌附近外蜗壳内其余区域的随机压力脉动减弱,压力脉动周期性特征增强,压力脉动系数的幅值和主频压力脉动幅值增大;而隔舌处的压力脉动系数的幅值和主频压力脉动幅值均是先减小后

期刊

单叶片离心泵叶片进口边位置压力脉动特性数值模拟

基于小波包频带稀疏编码的非完备信息条件下轴承状态识别

针对传统稀疏编码不够精细的问题,提出一种小波包频带稀疏编码算法.首先对原始信号进行小波包分解和最优频带选择,对每个最优频带分别训练一个过完备稀疏字典,并将待测试信号每个频带的压缩重构误差作为新的稀疏编码,利用灰色B型绝对关联度降维得到最终退化特征.考虑到轴承正常运行状态和严重摩擦状态容易识别,建立基于上述两种状态的非完备信息条件下轴承退化状态评估模型,根据设定好的门限值设置预警线.利用公开轴承全寿命数据进行仿真分析,发现新的稀疏编码特征其预警线临界点早于传统稀疏编码特征,从而能够更早发布故障报警.此外,对

期刊

稀疏编码灰色B型绝对关联度退化状态识别轴承压缩感知

Robotic Knee Tracking Control to Mimic the Intact Human Knee Profile Based on Actor-Critic Reinforce

We address a state-of-the-art reinforcement learning (RL) control approach to automatically configure robotic pros-thesis impedance parameters to enable end-to-end, continuous locomotion intended for transfemoral amputee subjects. Specifically, our actor-

期刊

Automatic tracking of intact kneeconfiguration of robotic knee prosthesisdirec

Dynamic Event-Triggered Scheduling and Platooning Control Co-Design for Automated Vehicles Over Vehi

This paper deals with the co-design problem of event-triggered communication scheduling and platooning control over vehicular ad-hoc networks (VANETs) subject to finite communication resource. First, a unified model is presented to describe the coordinate

期刊

Automated vehiclesdynamic event-triggered communicationinformation flow topolo

Generative Adversarial Network Based Heuristics for Sampling-Based Path Planning

Sampling-based path planning is a popular methodology for robot path planning. With a uniform sampling strategy to explore the state space, a feasible path can be found without the complex geometric modeling of the configuration space. However, the qualit

期刊

Generative adversarial network (GAN)optimal path planningrobot path plannings

Integrating Variable Reduction Strategy With Evolutionary Algorithms for Solving Nonlinear Equations

Nonlinear equations systems (NESs) are widely used in real-world problems and they are difficult to solve due to their nonlinearity and multiple roots. Evolutionary algorithms (EAs) are one of the methods for solving NESs, given their global search capabi

期刊

Evolutionary algorithm (EA)nonlinear equations systems (ENSs)problem domain kn

PID Control of Planar Nonlinear Uncertain Systems in the Presence of Actuator Saturation

This paper investigates PID control design for a class of planar nonlinear uncertain systems in the presence of actuator saturation. Based on the bounds on the growth rates of the nonlinear uncertain function in the system model, the system is placed in a

期刊

Actuator saturationdomain of attractionnonlinear systemsPID controluncertain

Human-in-the-Loop Consensus Control for Nonlinear Multi-Agent Systems With Actuator Faults

This paper considers the human-in-the-loop leader-following consensus control problem of multi-agent systems (MASs) with unknown matched nonlinear functions and actuator faults. It is assumed that a human operator controls the MASs via sending the command

期刊

Actuator faultsdistributed controlhuman-in-the-loopneighborhood observernonl

Improving Dendritic Neuron Model With Dynamic Scale-Free Network-Based Differential Evolution

Some recent research reports that a dendritic neuron model (DNM) can achieve better performance than traditional artificial neuron networks (ANNs) on classification, prediction, and other problems when its parameters are well-tuned by a learning algorithm

期刊

Artificial neuron networks (ANNs)dendrite neuron networkdifferential evolution

基于GM(1,1)-ARIMA模型的设备故障时长分析

设备故障停机时长是影响企业生产运营和效率的关键技术指标.为了能一目了然地反映设备自身的运行状况,提出了一种设备故障时长评估新指标——设备故障工时损耗率,将因设备故障产生的时长转化为设备故障工时损耗率.以露天矿大型设备电铲车一年电气故障时长作为基础数据,建立均值差分GM(1,1)模型,掌握设备稳定性,为采矿优化调度、生产产能制定提供有效支撑;建立ARIMA模型,拟合时间序列并获取短期预测数据,将其转化为评估指标验证模型,有效缩短故障检测和维修准备的时间.

期刊

设备故障ARIMA模型GM(11)模型预测

Collaborative Pushing and Grasping of Tightly Stacked Objects via Deep Reinforcement Learning

与本文相关的学术论文