Multi-Agent Few-Shot Meta Reinforcement Learning for Trajectory Design and Channel Selection in UAV-

来源 :中国通信(英文版) | 被引量 : 0次 | 上传用户:huiyigng
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Unmanned aerial vehicle(UAV)-assisted communications have been considered as a solution of aerial networking in future wireless networks due to its low-cost,high-mobility,and swift features.This pa-per considers a UAV-assisted downlink transmission,where UAVs are deployed as aerial base stations to serve ground users.To maximize the average trans-mission rate among the ground users,this paper for-mulates a joint optimization problem of UAV trajec-tory design and channel selection,which is NP-hard and non-convex.To solve the problem,we propose a multi-agent deep Q-network(MADQN)scheme.Specifically,the agents that the UAVs act as per-form actions from their observations distributively and share the same reward.To tackle the tasks where the experience is insufficient,we propose a multi-agent meta reinforcement learning algorithm to fast adapt to the new tasks.By pretraining the tasks with sim-ilar distribution,the learning model can acquire gen-eral knowledge.Simulation results have indicated the MADQN scheme can achieve higher throughput than fixed allocation.Furthermore,our proposed multi-agent meta reinforcement learning algorithm learns the new tasks much faster compared with the MADQN scheme.
其他文献
In this paper,a low-profile wideband di-electric resonator antenna(DRA)with a very compact planar size is investigated.The antenna consists of a high permittivity dielectric sheet on the top,a low per-mittivity substrate in the middle,and a probe feeding
This paper proposes a novel joint channel estimation and beamforming scheme for the massive multiple-input-multiple-output(MIMO)frequency-division duplexing(FDD)wireless legiti-mate surveillance system.With the proposed scheme,the monitor with the full du
With the rapid increasing of maritime activities,maritime wireless networks(MWNs)with high reliability,high energy efficiency,and low de-lay are required.However,the centralized network-ing with fixed resource scheduling is not suitable for MWNs due to th
Intelligent reflecting surface(IRS)is re-garded as a promising technology because it can achieve higher passive beamforming gain.In partic-ular,the IRS assisted simultaneous wireless informa-tion and power transfer(SWIPT)system can make the information de
With the research of the upcoming sixth generation(6G)systems,new technologies will re-quire wider bandwidth,larger scale antenna arrays and more diverse wireless communication scenarios on the future channel modeling.Considering chan-nel model is prerequ
With energy harvesting capability,the Internet of things(IoT)devices transmit data de-pending on their available energy,which leads to a more complicated coupling and brings new techni-cal challenges to delay optimization.In this paper,we study the delay-
In space-based Automatic Identification Systems(AIS),due to high satellite orbits,several Ad Hoc cells within the observation range of the satellite are vulnerable to interference by an external signal.To increase efficiency in target detection and improv
Low earth orbit(LEO)satellite network is an important development trend for future mobile communication systems,which can truly realize the“ubiquitous connection”of the whole world.In this paper,we present a cooperative computation offload-ing in the LEO
With the rapid growth of the demand for indoor location-based services(LBS),Wi-Fi re-ceived signal strength(RSS)fingerprints database has attracted significant attention because it is easy to ob-tain.The fingerprints algorithm based on convolution neural
Mobile edge computing(MEC)deploy-ment in a multi-robot cooperation(MRC)system is an effective way to accomplish the tasks in terms of en-ergy consumption and implementation latency.How-ever,the computation and communication resources need to be considered