Adaptive Mobile Video Delivery Based on Fountain Codes and DASH: A Survey

来源 :ZTE Communications | 被引量 : 0次 | 上传用户:bianmomo
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  Recent years have witnessed an explosive growth in mobile video?based services and efficient and reliable video delivery draws more and more attention. As a type of rateless codes, fountain codes can automatically adapt to wireless channel conditions without any knowledge of channels. This paper provides an overview of several typical Forward Error Correction (FEC) codes, such as Reed?Solomon (RS) code, Tornado code, Luby?Transform (LT) code, and Raptor code. We focus on a novel delay?aware fountain coding (DAF) technique that maximizes the code word length under the constraint of a given delay. Based on DAF, this paper also presents Unequal Error Protection DAF (UEP?DAF) which improves the Peak Signal to Noise Ratio (PSNR) without additional coordination between the encoder and the decoder, as well as Model Predictive Control DAF (MPC?DAF) which reduces the computational complexity to an affordable level for real?time video communications. Moreover, we review video streaming technologies, then introduce Dynamic Adaptive Streaming over HTTP (DASH) and DASH over Multiple Content Distribution Servers (MCDS?DASH) in detail. Based on MCDS?DASH that adapts video bitrate at the block level to alleviate video fluctuation, we propose a novel approach to integrating fountain codes with MCDS?DASH, which is capable of achieving unprecedented high throughput.
  mobile video delivery; DAF; UEP; MPC; DASH
  1 Introduction
  uring the past few years, mobile video?based services have witnessed an explosive growth, such as YouTube, FaceTime, and WeChat, with the popularity of smart mobile devices as well as the evolution of 3G, Long Term Evolution (LTE), Wi?Fi and other communication technologies. It is also expected that traffic from virtual reality (VR), augmented reality (AR), 3D games, surveillance video, vehicle networking and drone video will increase significantly in the near future. However, the problems including packet loss and insufficient bandwidth easily deteriorate in even higher demanding video applications due to the stochastic nature of wireless channels. As a result, how to deliver low latency and high quality video over wireless networks poses an unprecedented challenges for both academia and industry.
  Erasure codes (Fig. 1), such as Reed?Solomon (RS) code proposed first by Reed and Solomon in 1960 and Tornado code that Luby et al. put forward in 1997, are comparatively powerful in error correction for data transmission over network. Nevertheless, they are not suitable for wireless transmission due to high co?decoding complexity. In addition, some fixed code rate must be chosen when the encoding begins, which could also lead to bandwidth waste if the erasure rate is overestimated, otherwise poor video quality.   Luby, Byers, et al. first presented the concept of digital fountain (DF) in 1998 for large?scale data distribution services and reliable broadcast/multicast services. In this concept, the number of encoded symbols that can be generated from the source data is potentially limitless and the code rate can automatically adapt to wireless environment without assuming any knowledge of channels.
  In 2002, Luby further proposed the first practical DF codes, Luby?Transform (LT) code [1]. Although the LT code exhibits excellent efficiency compared to traditional erasure codes, the shortcomings are obvious: decoding cannot be successful only until all the source packets are recovered; the decoding complexity of LT code is linear logarithmic order O (k log k), k is the number of source symbols.
  In 2006, Shokrollahi put forward the most effective DF code, Raptor code, which was obtained by concatenating a high rate low?density parity?check (LDPC) code with an LT code of constant average degree distribution [2]. The Raptor code exhibits linear co?decoding time while still keeping low coding overhead. More importantly, it has been employed in the Digital Video Broadcasting—Handheld (DVB?H) for IP Datacast (IPDC) and the Third Generation Partnership Project (3GPP) standard for multimedia broadcast multicast services (MBMS) [3].
  Internet video streaming services have also witnessed a tremendous growth with the evolution of Internet and the popularity of mobile intelligent devices. Traditional Real?Time Transport Protocol/Real Time Stream Protocol (RTP/RTSP) has increasingly become unable to answer the demand for the following reasons: high deployment cost, indispensable stream session management, exceedingly difficult traversing restrictive network address translators and firewalls, as well as not supported by prevalent content delivery networks (CDN).
  Advanced streaming media technologies are urgently needed to solve the aforementioned problems. HTTP?based dynamic adaptive streaming media technology emerges as the times require, and has already developed rapidly.
  The rest of this paper is organized as follows. Section 2 briefly describes block coding and sliding window coding, and introduces delay?aware fountain coding (DAF) in detail. Section 3 presents Unequal Error Protection DAF (UEP?DAF) and Model Predictive Control DAF (MPC?DAF) based on DAF. Section 4 reviews Dynamic Adaptive Streaming over HTTP (DASH) and proposes our novel scheme. Section 5 concludes the paper.   2 Delay?Aware Fountain Codes
  Fountain codes are widely used in network transmission, such as reliable multicast, multi?source parallel downloading and distributed storage. However, if a video streaming file is delivered with conventional fountain codes, it cannot be displayed until the entire file is successfully decoded. Unfortunately, video streaming is of timeliness that means the time interval of video generation and display must not exceed a certain threshold. In addition, due to limited memory capabilities, receiver devices may also impose some restrictions on the receiving time of frames.
  2.1 Block and Sliding Window Coding
  The most direct solution to transmit video streaming with fountain codes is to partition video streams into blocks (Fig. 2), and then encode and transmit them sequentially. In [3], Ahmad et al. present a block coding design of fountain codes for video transmission. Considering the playback delay, the smaller the block size is, the shorter it is, however, from the fountain code point of view, the larger the block size, the lower the coding overhead and the higher the coding efficiency.
  In [4], Bogino et al. proposed a sliding window approach to segment the source data, leaving some overlaps between successive steps, as shown in Fig. 3. Consequently, decoded packets in one window are valuable for decoding the coded packets of subsequent windows. The sliding window scheme virtually extends the size of source block with respect to the non?overlap block coding, thereby obtains a higher co?decoding efficiency with the overhead decrease. In addition, expanding window was presented in [5]. In the aforementioned three schemes, the block size is fixed in block coding, virtually extended in the sliding window, and kept growing in expanding window.
  2.2 Delay?Aware Fountain Codes
  In 1977, Elience first proposed the joint source channel coding scheme to enhance the overall performance of the communication system. Although numerous studies on joint fountain codes and video coding have also been done, such as [3], [4], [5], and [6], they have a common shortcoming that is to divide the video streaming into blocks with the fixed number of packets without considering the video content characteristics. Accordingly, the same frame is probably divided into different blocks, which has a negative impact on the decoding of compressed video stream. Moreover, due to the fluctuation of video bit rate (Fig. 4), each packet?based block may contain different number of frames and the resulting video jitter increases as a result of the varying latency.   As is known to all that the concepts relevant to time in video sequence can be measured with number of frames. Consequently, different from the existing sliding window schemes, Sun, Wu et al. [7] innovatively proposed to establish the sliding window with the fixed number of frames to provide the much desired delay awareness in video streaming, namely DAF. In this way, the code word length can be maximized under the condition of limited delay, so as to achieve higher coding gain. In addition, a low?complexity online version DAF?L was also proposed, adopting only one parameter-slope factor to quantify the sampling distributions.
  Design of degree distribution has a critical influence on the decoding performance of fountain codes, and optimization of degree distribution functions has been done in certain literatures, such as [1]. It is worth noting that in fountain codes the more uniform the sampling distribution of the source packets, the higher the coding efficiency [7]. As a result, we proposed the strategy of window?wise sampling which dynamically adjusts each window sampling mode according to the fluctuation of video bit rate.
  [ASP(t)=ω∈all windows covertppktω(t),] (1)
  where[ppktω(t)] denotes the average sampling probability of each packet in frame t within window ω. [ASP(t)] denotes the total probability of every packet in frame t accumulated through all the sliding windows covering that frame. The objective function of the major optimization process in DAF is to minimize the variance of the accumulated sampling possibility [ASP(t)] as in (1).
  Fig. 5 shows the experiments, where the code rate is defined using the ratio of total number of native packets to total number of coded packets, and IDR denotes in?time decoding ratio, prove that this scheme has higher video quality than existing schemes with low delay and constant data rate.
  3 UEP?DAF and MPC?DAF
  All the video data has been assumed of the same importance in DAF, in order to further enhance the practical applicability, we propose a method to integrate UEP into DAF.
  3.1 UEP?DAF
  The aforementioned schemes are all based on the assumption that all the data has the same importance. However, the importance of different levels for video stream could be reflected in the following aspects, such as picture types, data types, position of the frame in a group of pictures (GOP), layers in scalable video coding (SVC), and picture content [8]. Fountain codes with equal error protection (EEP) are obviously inefficient, especially in the case of video delivery over wireless channel, the bandwidth of which is more often insufficient and decoding all packets with equal chance sometimes could induce suboptimal quality.   In recent years, fountain codes with unequal error protection (UEP) characteristics have attracted extensive attentions. In 2005, Rahnavard and Fekri suggested for the first time UEP?based fountain codes [9] which principle was to encode data by a method of adopting different degree distribution according to unequal importance. In [5] and [6], Sejdinovic and Vukobratovic proposed an approach of expanding window fountain codes, in which all windows had the same starting position and packets in each window must be a subset of the subsequent window (Fig. 6), so packets in the innermost window had the highest sampling probability. In order to achieve high efficient transmission of data in multiple source relay channels, Talari et al. presented a distributed unequal error protection fountain codes [10] to meet the requirements of different terminals. Ahmad, Hamzaoui et al. divided video source data block into several segments, and then duplicated them according to the protection factors to obtain a new block [11], as shown as in Fig. 7, where Mib denotes most important bits, and Lib denotes least important bits.
  The aforementioned UEP?based schemes can improve remarkably the peak signal?to?noise ratio (PSNR) without the bit rate increasing. Nevertheless, they all have an unfavorable problem which is the encoder needs to send importance description information to the decoder. Whether coordinating between encoder and decoder beforehand or explicit transmission in the packet headers, the resulting overhead will increase the possibility of packet loss.
  In [8], Sun, Wu et al. proposed a novel scheme UEP?DAF to apply UEP to video communication applications based on delay?aware fountain codes. The proposed scheme does not need additional coordination between encoder and decoder; besides, the frame?level importance profile may be specified in advance. Simulation experiments show that compared to the result of DAF, UEP?DAF allocates higher sampling probability to the frames in the front of a GOP than that in the back (Fig. 8) where ASP denotes the accumulated sampling possibility. Consequently, the proposed system achieves higher decoding ratios and PSNR compared to EEP under the same network conditions.
  3.2 MPC?DAF
  DAF based on delay?aware sliding window and window?wise sampling, takes full advantage of channel?adaptive rateless feature, effective delay control and optimal sampling pattern, therefore it outperforms the other existing schemes. However, high computational complexity induced by the per?window optimization of sampling pattern and that the bit rate information of all frames needs to be obtained in advance, prevent its applications in live video streaming. The encoding computational complexity of DAF is [O(T?W)/Δt23], where T denotes video length, W denotes sliding window size, and[Δt]denotes step size. The detailed deduction can be found in [7]. Although the performance of low?complexity DAF?L proposed in [7] is higher than that of many other existing schemes, compared to DAF the gap is still significant, especially when packet loss is relatively serious.   DAF?M based on the Model Predictive Control (MPC) was proposed in [12], which is an online optimization algorithm with a lower computational complexity with respect to DAF, but higher performance compared to DAF?L. More details of MPC?DAF code can be found in [12]. As a result, the computational complexity is lowered to an affordable level for real?time video communications. Moreover, the results of simulation experiments show that the decoding ratio of DAF?M is close to the global optimum in DAF codes [12].
  4 MCDS?DASH Based on Fountain Codes
  HTTP?based incremental download technology, overcoming the aforementioned problems, has become widespread, including Adobe’s flash player, Microsoft Corp.’s Silverlight and windows media player, etc. However, it is inevitable that video playback interruption and video quality degradation occur as a result of the constantly changing network bandwidth, especially for the stochastic wireless networks.
  4.1 MPEG?DASH
  New generation streaming media technology, so?called the adaptive streaming media scheme, becomes more and more popular, because it can provide users with real?time, smooth and high quality streaming service [13]. Meanwhile, numerous HTTP adaptive streaming schemes were also proposed, including HTTP Live Streaming (HLS) from Apple Corp., Smooth Streaming (SS) from Microsoft Corp. and HTTP Dynamic Streaming (HDS) from Adobe Corp. However, the diversity of different streaming media service solutions brings about a large number of compatibility issues, which increases the difficulty of system maintenance.
  DASH, also known as MPEG?DASH, is the first international standard of HTTP?based adaptive streaming media solution. Moving Picture Experts Group (MPEG) issued a call for proposal for an HTTP streaming standard in April 2009, then started the evaluation of the submitted fifteen full proposals since July 2009. The DASH international standard draft was completed in January 2011 and became the international standard in November 2011. In April 2012, the international standard of DASH with ISO/IEC 23009?1 was officially promulgated. Since then DASH has been widely adopted for providing uninterrupted video streaming service to users with dynamic network conditions and heterogeneous devices [14], [15].
  Media presentation description (MPD) is a manifest file of encoding information defined by DASH server, including the descriptions of time?based periods, representations based on the bit rate, frame rates and resolutions, as well as video data segments. The DASH client is capable of choosing to download and display the most appropriate video segment according to the network conditions and the receiver buffer state.   Adaptive bit stream switching algorithm, as the core technology, has always been the most critical factor affecting the efficiency of DASH. An efficient rate adaptation algorithm should prevent frequent hopping between video bitrates, as well as frequent interruptions or non?optimal visual quality due to higher or lower than the available bandwidth.
  In [16], Ojanper et al. proposed an adaptive network aided method for adaptive HTTP video stream based on cognitive network management architecture and distributed control. Müller et al. proposed an improved DASH implementation employing scalable video coding (SVC), which was suitable for mobile applications with large bandwidth variations [17]. Besides, a new model predictive control algorithm to optimize the throughput and buffer occupancy information was also proposed in [18].
  4.2 DASH over Multiple Content Distribution Servers
  (MCDS?DASH)
  Generally traditional DASH is based on single server, so no matter how excellent adaptation algorithm is adopted, all efforts seem helpless once the server is unreachable or even down. In order to improve the bandwidth and stability of the transmission, a parallel download technology based on MCDS was proposed in [19].
  MCDS?DASH deploys DASH over multiple servers; thereby the same content can be available at multiple URLs concurrently and the DASH client can obtain video segments at the maximum bandwidth from the optimal server. Compared with the single server node, MCDS can obviously provide higher bandwidth, reliability and scalability.
  However, rate adaption control becomes a more challenging problem in MCDS?DASH. Because if it adapts video bitrate still at the segment level as traditional methods do, frequent video bitrate switching would occur due to diverse bandwidths over multiple heterogeneous servers. In that case, viewing experience of users will decline dramatically [20]. Moreover, disorder downloading of video segments from different servers and the compulsory requirements of playback according to the correct order would cause additional delay.
  In [19], Chao Zhou et al. presented to group multiple segments into a block and adapt video bitrate at the block level rather than at the segment level to alleviate the video fluctuation. Furthermore, downloading each segment according to the playback deadline was also proposed.
  4.3 MCDS?DASH Based on Fountain Codes
  Bitrate smoothness and bandwidth utilization are a couple of contradictions due to the inherent bandwidth variations in MCDS?DASH. Although the scheme proposed in [19] adapting video bitrate at the block level alleviates actually the video drastic fluctuation, this is at the expense of lower bandwidth utilization.   The order of video segments download completion may not be in accordance with the one you assume based on the current network status, especially in stochastic wireless networks, which would cause additional delay. In fact, the approaches mentioned in [19] are not completely parallel, because the servers having completed download will keep idle in one block unit, which actually is a waste of bandwidth resources.
  Nevertheless, the block?based scheme prompts us an innovative inspiration to employ fountain codes in MCDS?DASH. In this scheme, we code video streaming files by a method of adopting fountain codes, then send the coded packets to the MCDS. In that sense DASH clients need not care about how many packets downloaded from each specific DASH server, and as long as the number of received packet reaches the threshold, the video streaming file can be decoded successfully. In that sense our novel scheme can achieve actually parallel download.
  5 Conclusions
  In this paper we presented an overview of several typical FEC codes, and focused on the state?of?the?art fountain codes DAF as well as its extended UEP?DAF and MPC?DAF. We also did research on respective advantages, disadvantages, suitable application scenarios etc. Furthermore, we reviewed the development of video streaming technologies, and paid a special attention to both DASH and MCDS?DASH in this work. Based on grouping multiple segments into a block and adapting video bitrate at the block level in MCDS?DASH, we put forward an innovative scheme which is able to integrate fountain codes with MCDS?DASH. Our scheme is capable of achieving unprecedented high throughput when multiple servers exert best efforts to transmit the same video file to clients.
其他文献
荨麻疹是皮肤科常见的过敏性皮肤疾病。中医称之为“瘾疹”。中医认为荨麻疹多因禀赋不足,又食鱼虾等腥荤动风之物,或因饮食不节,胃肠积热,或因平素体虚卫表不固,风热、风寒之邪侵袭皮毛肌腠之间而发病;或因情志不遂,伤及五脏,五脏失调,肝郁不舒,气机壅滞,郁而化火,灼伤阴血,血虚化燥,肌肤失濡,复感风邪可诱发本病。  关键词:荨麻疹;中医药治疗;脱敏汤  中图分类号:R758.24 文献标识码:B 文章
期刊
按《中药新药临床研究指导原则》制定。临床痊愈:临床症状体征消失,胃镜复查粘膜慢性炎症明显好转,病理检查证实腺体萎缩,肠上皮化生和异型增生恢复正常或消失;显效:临床主要症状、体征消失胃镜复查粘膜慢性炎症好转,病理检查证实腺体萎缩、肠化生和异型增生恢复正常或减轻2度。  关键词:萎缩性胃炎;旋复代赭汤;疗效  中图分类号:R573.3+2 文献标识码:B 文章编号:1007-2349(2004
期刊
24例均为门诊就诊病人,其中青春期功能性子宫出血10例,年龄12~17岁,更年期功能性子宫出血14例,年龄40~50岁,更年期组中子宫内膜增生3例,有1例曾行清宫术止血未效,仍有子宫流血不止,再继续使用黄体酮片止血进行人工周期治疗也无效。  关键词:补中益气汤;功能性子宫出血;中医药疗法  中图分类号:R711.52 文献标识码:B 文章编号:1007-2349(2004)03-0016-01
期刊
泌尿系结石属中医石淋范畴。巢元方在《诸病源候论.诸淋病候》中提出“石淋者,肾主水,水结则化为石,故肾客砂石,肾虚为热所乘。”其病因病机为肾虚及下焦湿热,蕴积日久,煎熬成石,石阻尿路而见腰腹疼痛,小便艰涩,血尿等。  关键词:泌尿系结石;三金益肾汤;中医药疗法  中图分类号:R256.52 文献标识码:B 文章编号:1007-2349(2004)03-0020-01
期刊
诊断标准 WHO糖尿病诊断标准,确认并发周围神经病变32例糖尿病有下列表现之一者可以诊断为糖尿病周围神经病变。肢体感觉,运动神经病变表现为蚁走感、麻木、发凉、烧灼样、撕裂样或针刺样疼痛,肌无力等;深浅感觉明显减速,跟腱反射,膝反射明显减退或消失。  关键词:糖尿病神经病变;中医药疗法;灯盏细辛注射液  中图分类号:R587.2 文献标识码:B 文章编号:1007-2349(2004)03-00
期刊
功能性消化不良(functional dyspepsia FD)是指非器质性病变引起的以餐后胃胀满隐痛,暖气恶心、纳呆食少等为主要症状的临床综合征,在人群中发病率高达20~30%,占消化门诊的50%左右,由于其病因、病理机制涉及多种因素,现无特异性治疗,笔者运用逍遥散加减治疗FD患者75例,并与用吗丁啉治疗50例作对照,现将结果报告如下。  关键词:功能性消化不良;逍遥散;中医药疗法  中图分类号
期刊
Due to the vulnerability of fibers in optical networks, physical?layer attacks targeting photon splitting, such as eavesdropping, can potentially lead to large information and revenue loss. To enhance
期刊
小儿急性支气管炎可由细菌或/和病毒感染引起。常见细菌有肺炎球菌、链球菌、流感噬血杆菌、金黄色葡萄球菌等,病理上可引起支气管粘膜水肿和分泌物增多,甚至引起支气管痉挛,导致咳嗽、咯痰、喘息等症状,临床治疗通常是抗病毒、抗细菌同时进行。  关键词:双黄连注射液;小儿急性支气管炎;治效  中图分类号:R725.6 文献标识码:B 文章编号:1007-2349(2004)03-0021-01
期刊
六味地黄丸滋阴补肾,强体增精,改善机体神经内分泌功能,延缓衰老。自拟更年方中沙参甘平,益肾养肝,补五脏之阴;当归、熟地补血养心;太子参、茯苓益心气;远志宁心养神;枣仁、五味子收敛心气,以防耗散;黄芩、麦冬清心平肝;女贞子、五味子为滋肾强壮药;白芍敛肝和营。诸药共奏补肾养血,滋阴清热,协调阴阳之功。  关键词:更年期综合征;中医药疗法;疗效  中图分类号:R711.75 文献标识码:B 文章
期刊
面神经炎是多种原因所致面部表情肌瘫痪为主症的一种常见疾病。与称“面瘫”、“周围性面神经麻痹。”中医则称为“喎僻”或“口角喎斜”。本病常突然发作,病侧颜面麻木,不能皱眉,鼓颊,口角歪向健侧,闭目露睛,漱口漏水,进餐时食物常停滞于患侧齿颊之间,鼻唇沟平坦,耳后疼痛。给患者带来极大的痛苦。在临床上采取电针透穴配合牵正复原汤治疗面神经炎60例,有一定疗效,现报告如下。  关键词:面神经炎;电针透穴法;牵正
期刊