A learning-based flexible autonomous motion control method for UAV in dynamic unknown environments

来源 :系统工程与电子技术(英文版) | 被引量 : 0次 | 上传用户:wxa180395
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
This paper presents a deep reinforcement learning (DRL)-based motion control method to provide unmanned aerial vehicles (UAVs) with additional flexibility while flying across dy-namic unknown environments autonomously. This method is ap-plicable in both military and civilian fields such as penetration and rescue. The autonomous motion control problem is ad-dressed through motion planning, action interpretation, traject-ory tracking, and vehicle movement within the DRL framework. Novel DRL algorithms are presented by combining two differ-ence-amplifying approaches with traditional DRL methods and are used for solving the motion planning problem. An improved Lyapunov guidance vector field (LGVF) method is used to handle the trajectory-tracking problem and provide guidance control commands for the UAV. In contrast to conventional motion-con-trol approaches, the proposed methods directly map the sensor-based detections and measurements into control signals for the inner loop of the UAV, i.e., an end-to-end control. The training experiment results show that the novel DRL algorithms provide more than a 20% performance improvement over the state-of-the-art DRL algorithms. The testing experiment results demon-strate that the controller based on the novel DRL and LGVF, which is only trained once in a static environment, enables the UAV to fly autonomously in various dynamic unknown environ-ments. Thus, the proposed technique provides strong flexibility for the controller.
其他文献
肿瘤增长扩散界面模型是Cahn-Hilliard方程和一个反应扩散方程的耦合,用Cahn-Hilliard方程描述肿瘤相场,用反应扩散方程描述营养物质浓度的变化对肿瘤相场的影响.利用Aubin-Lions引理、Lebesgue控制收敛定理、Gronwall引理以及一些常用的基本不等式得到了该耦合方程组初边值问题解的存在唯一性.
Affected by the natural environmental and human activity factors, significant seasonal differences appear on the regional scattering characteristic and ground deformation of sa-line soil. Interferometric decorrelation due to season replace-ment limits the
When multiple ground-based radars (GB-rads) are utili-zed together to resolve three-dimensional (3-D) deformations, the resolving accuracy is related with the measurement geo-metry constructed by these radars. This paper focuses on con-strained geometry a
Synthetic aperture radar (SAR) is able to detect sur-face changes in urban areas with a short revisit time, showing its capability in disaster assessment and urbanization monitoring. Most presented change detection methods are conducted using couples of S
In the application of persistent scatterer interfero-metry (PSI), deformation information is extracted from persistent scatterer (PS) points. Thus, the density and position of PS points are critical for PSI. To increase the PS density, a time-series In-SA
For ship targets with complex motion, it is difficult for the traditional monostatic inverse synthetic aperture radar (ISAR) imaging to improve the cross-range resolution by increasing of accumulation time. In this paper, a distributed ISAR imaging al-gor
Radar detection of small targets in sea clutter is a particularly demanding task because of the nonstationary cha-racteristic of sea clutter. The track-before-detect (TBD) filter is an effective way to increase the signal-to-clutter ratio (SCR), thus im-p
Signal reconstruction is a significantly important the-oretical issue for compressed sensing. Considering the situation of signal reconstruction with unknown sparsity, the conventional signal reconstruction algorithms usually perform low accuracy. In this
Higher-order statistics based approaches and signal sparseness based approaches have emerged in recent decades to resolve the underdetermined direction-of-arrival (DOA) estima-tion problem. These model-based methods face great chal-lenges in practical app
Constellation reconfiguration is a critical issue to re-cover from the satellite failure, maintain the regular operation, and enhance the overall performance. The constellation recon-figuration problem faces the difficulties of high dimensionality of desi