Collaborative Pushing and Grasping of Tightly Stacked Objects via Deep Reinforcement Learning

来源 :自动化学报(英文版) | 被引量 : 0次 | 上传用户:ADAM129XU
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Directly grasping the tightly stacked objects may cause collisions and result in failures, degenerating the functionality of robotic arms. Inspired by the observation that first pushing objects to a state of mutual separation and then grasping them individually can effectively increase the success rate, we devise a novel deep Q-learning framework to achieve collaborative pushing and grasping. Specifically, an efficient non-maximum suppression policy (PolicyNMS) is proposed to dynamically evaluate pushing and grasping actions by enforcing a suppression constraint on unreasonable actions. Moreover, a novel data-driven pushing reward network called PR-Net is designed to effectively assess the degree of separation or aggregation between objects. To benchmark the proposed method, we establish a dataset containing common household items dataset (CHID) in both simulation and real scenarios. Although trained using simulation data only, experiment results validate that our method generalizes well to real scenarios and achieves a 97% grasp success rate at a fast speed for object separation in the real-world environment.
其他文献
为了研究叶片进口边位置对单叶片离心泵性能的影响,基于标准k-ε湍流模型,对不同叶片进口边位置的单叶片离心泵首先进行了定常数值计算和外特性分析,并通过试验验证了数值计算结果的可靠性.然后以定常计算结果为基础进行了非定常数值计算,分析了叶片进口边位置对泵压力脉动特性的影响.数值计算结果表明:随着叶片进口边沿后盖板或前盖板向泵入口延伸量的增加,除隔舌附近外蜗壳内其余区域的随机压力脉动减弱,压力脉动周期性特征增强,压力脉动系数的幅值和主频压力脉动幅值增大;而隔舌处的压力脉动系数的幅值和主频压力脉动幅值均是先减小后
针对传统稀疏编码不够精细的问题,提出一种小波包频带稀疏编码算法.首先对原始信号进行小波包分解和最优频带选择,对每个最优频带分别训练一个过完备稀疏字典,并将待测试信号每个频带的压缩重构误差作为新的稀疏编码,利用灰色B型绝对关联度降维得到最终退化特征.考虑到轴承正常运行状态和严重摩擦状态容易识别,建立基于上述两种状态的非完备信息条件下轴承退化状态评估模型,根据设定好的门限值设置预警线.利用公开轴承全寿命数据进行仿真分析,发现新的稀疏编码特征其预警线临界点早于传统稀疏编码特征,从而能够更早发布故障报警.此外,对
We address a state-of-the-art reinforcement learning (RL) control approach to automatically configure robotic pros-thesis impedance parameters to enable end-to-end, continuous locomotion intended for transfemoral amputee subjects. Specifically, our actor-
This paper deals with the co-design problem of event-triggered communication scheduling and platooning control over vehicular ad-hoc networks (VANETs) subject to finite communication resource. First, a unified model is presented to describe the coordinate
Sampling-based path planning is a popular methodology for robot path planning. With a uniform sampling strategy to explore the state space, a feasible path can be found without the complex geometric modeling of the configuration space. However, the qualit
Nonlinear equations systems (NESs) are widely used in real-world problems and they are difficult to solve due to their nonlinearity and multiple roots. Evolutionary algorithms (EAs) are one of the methods for solving NESs, given their global search capabi
This paper investigates PID control design for a class of planar nonlinear uncertain systems in the presence of actuator saturation. Based on the bounds on the growth rates of the nonlinear uncertain function in the system model, the system is placed in a
This paper considers the human-in-the-loop leader-following consensus control problem of multi-agent systems (MASs) with unknown matched nonlinear functions and actuator faults. It is assumed that a human operator controls the MASs via sending the command
Some recent research reports that a dendritic neuron model (DNM) can achieve better performance than traditional artificial neuron networks (ANNs) on classification, prediction, and other problems when its parameters are well-tuned by a learning algorithm
设备故障停机时长是影响企业生产运营和效率的关键技术指标.为了能一目了然地反映设备自身的运行状况,提出了一种设备故障时长评估新指标——设备故障工时损耗率,将因设备故障产生的时长转化为设备故障工时损耗率.以露天矿大型设备电铲车一年电气故障时长作为基础数据,建立均值差分GM(1,1)模型,掌握设备稳定性,为采矿优化调度、生产产能制定提供有效支撑;建立ARIMA模型,拟合时间序列并获取短期预测数据,将其转化为评估指标验证模型,有效缩短故障检测和维修准备的时间.