Improved Q-learning algorithm for load balance in millimeter wave backhaul networks

来源 :中国邮电高校学报(英文版) | 被引量 : 0次 | 上传用户:wyn6098
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
With the intensive deployment of users and the drastic increase of traffic load,a millimeter wave (mmWave) backhaul network was widely investigated.A typical mmWave backhaul network consists of the macro base station (MBS) and the small base stations (SBSs).How to efficiently associate users with the MBS and the SBSs for load balancing is a key issue in the network.By adding a virtual power bias to the SBSs,more users can access to the SBSs to share the load of the MBS.The bias values shall be set reasonably to guarantee the backhaul efficiency and the quality of service (QoS).An improved Q-learning algorithm is proposed to effectively adjust the bias value for each SBS.In the proposed algorithm,each SBS becomes an agent with independent learning and can achieve the best behavior,namely the optimal bias value through a series of training.Besides,an improved behavior selection mechanism is adopted to improve the learning efficiency and accelerate the convergence of the algorithm.Finally,simulations conducted in the 60 GHz band demonstrate the superior performance of the proposed algorithm in backhaul efficiency and user outage probability.
其他文献
Based on the pseudo-symplectic space over Fq(2v+1) of characteristics 2,combining the definition of low density parity check (LDPC) codes with the knowledge of graph theory,two kinds of LDPC codes with larger girth are constructed.By the knowledge of bipa
This paper puts forward a user clustering and power allocation algorithm for non-orthogonal multiple access (NOMA) based device-to-device (D2D) cellular system.Firstly,an optimization problem aimed at maximizing the sum-rate of the system is constructed.S
In heterogeneous wireless networks,there are various kinds of service demands from the users.A network selection algorithm based onthe analytic hierarchy process (AHP) and Similarity is proposed to solve this problem.The services are divided into three cl
For achieving a higher compression ratio (CR) in compression sensing,the time-sparse bio-signals,such as electrocardiograph (ECG),are generally directly filtered via a dynamic or fixed threshold,however,inevitably leading to the loss of critical diagnosti
Pedestrian trajectory prediction plays an important role in bothadvanced driving assistance system (ADAS) and autonomous vehicles.An algorithm for pedestrian trajectory prediction in crossing scenario is proposed.To obtain features of pedestrian motion,we
The ever-increasing complexity of on-chip interconnection poses great challenges for the architecture of conventional system-on-chip (SoC) in semiconductor industry.The rapid development of process technology enables the creation of stacked 3-dimensional
Driving in the complex traffic safely and efficiently is a difficult task for autonomous vehicle because of the stochastic characteristics of engaged human drivers.Deep reinforcement learning (DRL),which combines the abstract representation capability of
Nowadays,the service of network video is increasing explosively.But the quality of experience (QoE) model of network video quality is not stable.The video quality may be impaired by many factors.This paper proposes QoE models for network video quality.It
Wireless ultra-dense network (UDN) is one of the important technologies to solve the burst of throughput demand in the forthcoming fifth generation (5G) cellular networks.Reusing spectrum resource for the backhaul of small base stations (SBSs) is a hotspo
Collaborative filtering (CF) is one of the most widely used Algorithm in recommender systems,which help users obtain the information they may like.We proposed a latent Dirichlet allocation (LDA) model combining time and rating (TR-LDA) for CF.We use mathe