Variable Bit Rate Fuzzy Control for Low Delay Video Coding

来源 :ZTE Communications | 被引量 : 0次 | 上传用户:shena011
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  Abstract
  Rate control plays a critical role in achieving perceivable video quality under a variable bit rate, limited buffer sizes and low delay applications. Since a rate control system exhibits non?linear and unpredictable characteristics, it is difficult to establish a very accurate rate?distortion (R?D) model and acquire effective rate control performance. Considering the excellent control ability and low computing complexity of the fuzzy logic in non?linear systems, this paper proposes a bit?rate control algorithm based on a fuzzy controller, named the Fuzzy Rate Control Algorithm (FRCA), for All?Intra (AI) and low?delay (LD) video source coding. Contributions of the proposed FRCA mainly consist of four aspects. First, fuzzy logic is adopted to minimize the deviation between the actual and the target buffer size in the hypothetical reference decoder (HRD). Second, a fast lookup table is employed in fuzzy rate control, which reduces computing cost of the control process. Third, an input domain determination scheme is proposed to improve the precision of the fuzzy controller. Fourth, a novel scene change detection is introduced and integrated in the FRCA to adaptively adjust the Group?of?Pictures (GOP) length when the source content fluctuates. The FRCA can be transplanted and implemented in various industry coders. Extensive experiments show that the FRCA has accurate variable bit?rate control ability and maintains a steady buffer size during the encoding processes. Compared with the default configuration encoding under AI and LD, the proposed FRCA can achieve the target bit rates more accurately in various classical encoders.
  Keywords
  rate control; video coding; fuzzy control; bit per pixel; rate?distortion model
  W1 Introduction
  ith the widespread development of various video streaming and multimedia networking applications, such as mobile TV, live video broadcasting and home cinema, a desire for high quality and low delay is increasing rapidly. In the video coding technology, rate control (RC) is an important tool to strengthen the coding efficiency, which can maximize visual quality under limited bandwidth and buffer capacity. RC can be classified into two types, the constant bit rate (CBR) and the variable bit rate (VBR). The CBR allocates an uniform bit to different coding units regardless of their characteristics. Correspondingly the CBR scheme leads to frequent fluctuation and degradation of the picture quality for consecutive pictures with fast motion or scene change. Compared with the CBR, the VBR scheme can dynamically adjust the target bit rate according to characteristics of video content and obtain the consistent picture quality, but it may cause extensive buffer delay.   RC regulates the encoder output bit rate by adjusting the quantization parameters (QP) to optimize the video quality under an available channel bandwidth. Thus, many RC algorithms concentrate on an accurate rate?distortion (R?D) model and an efficient bit allocation scheme. Along with the video coding standard developing, MPEG?2, H.263, MPEG?4, H.264/AVC and HEVC integrate RC algorithms into their encoders. Meanwhile, various improved RC algorithms [1], [2], [3] have been investigated.
  The R?D model is the kernel model for the great majority of RC algorithm since the QP value assignment depends on the transcendental R?D model. Based on discrete mathematics function, various R?D models were derived. T. Chiang, et al. [4] proposed a widely used quadratic R?D model by assuming the video source statistics are Laplacian distribution and expanding the rate distortion function into a Taylor series. Through observing the strong relationship between [Qstep] and the quantization parameter, Z. Li, et al. [5] improved the quadratic R?D model by using the predicted mean absolute difference (MAD) and [Qstep] instead of QP. To identify the impacts of the parameter Lambda in rate control, B. Li, et al. [6] proposed [R-λ] model and adopted it into High Efficiency Video Coding (HEVC) rate control. Besides, [ρ]?domain rate control [7] was proposed by Z. He, et al., where [ρ] is the zero ratios of transformed coefficients after quantization. In our previous work, a concise exponential R?Q model [8] was proposed for H.264/AVC, which takes picture complexity, gradient and histogram information into consideration.
  To provide a significant improvement in the coding efficiency, the study of RC has focused on the R?D [9] or the rate?quantization (R?Q) [10] model, and also dealt with the Intra only RC [11], the scalable video coding RC [12] and the bit allocation schemes [13]. These studies are important parts in RC and have helped develop various video compression standards, such as MPEG?2, H.263, MPEG?4, H.264/AVC and HEVC.
  Fuzzy logic has become a successful intelligent technology for control process [14]. Compared with traditional controllers such as proportional?integral?derivative (PID), proportional?integral (PI) and proportional?derivative (PD), fuzzy control does not require any internal mathematical model and can be applied to uncertainty and imprecision system [15]. Traditional controllers have been used to control over 90% industrial processes [14]. However, for more complex systems, such as time delay, time varying parameters, models mismatch and non?linearity, those controllers cannot generate satisfactory system performance since they depend on the mathematical model [16]. With the development of intelligent control, the fuzzy controller is adopted in industrial fields such as transportation devices, intelligent machines, power engineering, and chemical processes [17]. In the medical area, fuzzy logic effects on diagnosis, treatment of illness, patient pursuit, prediction of disease risk and other medical fields [18]. Besides, fuzzy controllers are also used for food control [19], network control [20] and so on. The experiments in literature show that fuzzy controllers adopted in related areas obtain effective performance, robustness and overall stability, and overcome the shortages of traditional controllers, especially in uncertain and complex systems.   RC systems also adopted the fuzzy logic into the coding process. D. H. Tsang, et al. [21] proposed a fuzzy logic?based control scheme for real?time MPEG video to avoid long delay or excessive loss at the user?network interface (UNI) in an asynchronous transfer mode (ATM) network. S. Sheu et al. [22] presented a fuzzy adaptive rate control scheme to select the transmission rate for frame transmissions in wireless LANs. M. Rezaei, et al. [23] introduced a semi?fuzzy rate control algorithm that utilized a fuzzy rate controller and a quality controller to adjust the QP value. However, existed fuzzy controllers based RC schemes are only used for certain video coding standards, or simple experience parameters are used into the fuzzy controller. To improve the efficiency and accuracy and to reduce the computational complexity of the rate control system, this paper proposes a novel adaptive RC approach, named the Fuzzy Rate Control Algorithm (FRCA). This is an input domain determination scheme to improve the precision of the fuzzy controller and adopts a fuzzy logic look?up table to derive increment QP values to minimize the deviation between the actual and the target buffer sizes in the hypothetical reference decoder (HRD).
  The proposed FRCA controls the bit rate by adjusting the QP value. It employs an improved fuzzy controller that utilizes an exponential R?D model to determine the input domain. Then, the proposed fuzzy controller is adopted to generate a fast lookup table. Through looking up the fast table, the FRCA uses the signals from the buffer to calculate the increment QP value, which is used for the RC process. The proposed FRCA also presents a novel scene change detection to adaptively adjust the Group?of?Pictures (GOP) length when the source content fluctuates. The proposed FRCA has high adaptability and can be transplanted and implemented in various industry coders. In this paper, FRCA is implemented on MPEG?2, H.263, MPEG?4, H.264/AVC and HEVC encoders. Simulations and analysis show that the proposed FRCA provides good performance on the peak signal?to? noise ratio (PSNR) gains and bit?rate savings.
  The remainder of this paper is organized as follows. Section 2 introduces a fuzzy logic controller for the low delay video coding rate control. Section 3 presents a new input domain determination for the fuzzy controller. Section 4 proposes a scene change detection scheme and Section 5 gives the description of the proposed RC algorithm step by step. Simulation results and performance analysis are described in section 6. Finally, Section 7 concludes the paper.   2 Fuzzy Controller for Rate Control
  Different from traditional controller systems with exact mathematical models, the fuzzy controller has an intelligent control process that is set according to the experience values and can be applied into unpredictable or uncertain system. In this paper, we adopt a fuzzy controller to regulate the suitable QP value for encoding.
  2.1 Structure of Fuzzy Controller
  Fig. 1 shows the structure of the proposed fuzzy logic controller used for RC. It is composed of the fuzzy interface, the knowledge base (rule base), the inference mechanism and the defuzzy interface. First, a fuzzy interface transforms an exact measured value into a fuzzy value to fit the fuzzy calculation. Second, this fuzzy value is utilized to calculate the fuzzy output value by the fuzzy control knowledge base and the inference mechanism. Third, the fuzzy system converts the fuzzy output value to a precise value to control the coding process.
  The fuzzy controller (Fig. 1) has two input variables and an output variable. The input variable in the video coding consists of two components, which are the buffer deviation and the buffer deviation rate of changes. Compared with the multiple?input multipleoutput fuzzy controller, the proposed controller has simple architecture that results in low computing complexity. Meanwhile, it can accurately reflect the dynamic characteristics of the output variable during the control process.
  At the encoding time[t], the variable target bit rate [TBRt] can be converted to the target bit per pixel [Tbppt] by (1).
  [Tbppt=TBRt/(W?H?Frate)] , (1)
  where [Frate] indicates the actual frame rate and [W]and [H] are frame’s width and height.
  After the t?th frame is encoded, let [Rt] denote the actual output bits, then the current buffer size [Bt] can be updated by (2).
  [Bt=Bt-1+Rt/(W?H)-Tbppt] . (2)
  2.2 Fuzzy Interface
  It is important to recognize input variables and output variables of the fuzzy control system. Generally, the fuzzy control system defines two variables as the input variables. One variable is the deviation between the actual value and system default value. Another is the change rate of this deviation. The control value is determined as the output of the system. Since QP value is the principal term that is employed to adjust the encoder output bit rate and avoid over?flow and under?flow of the buffers in video decoders, we take QP as the output of the system in this fuzzy controller.   Generally, the content and motion complexity of the successive pictures have a high correlation, so the QP at the coding time [t]?1 is very close to that of the coding time[t]. Without loss of generality, we define the incremental QP as output variable [u?] , and then calculate the current QP based on the incremental QP.
  For low delay video capture and transmission system, a relatively small buffer size is adequate. However, for random access applications such as network televisions, a large buffer size is obligatory. When feeding the bit stream into a HRD with suitable parameters, the HRD buffer should be neither overflow nor underflow. The overflow brings unexpected frame skipping in the encoder, which may result in visual quality degrades. On the other hand, the underflow can lead to under utilization of the available network bandwidth.
  We define the deviation between the target buffer level [BT] and current buffer size [Bt] as the input variable[et], define the change rate of the deviation as another input variable [eDt]. Generally, in order to avoid overflow and underflow and to reduce the deviation between the target buffer size and the current buffer level, [BT] is set to constant value 0. [et] and [eDt] can be calculated by (3) and (4).
  [et=Bt-BT], (3)
  [eDt=et-et-1]. (4)
  After the fuzzy controller inputs and outputs are selected, we define the fuzzy control system. Fuzzy control is a fuzzy logic based control method. In the fuzzy logic, fuzzy input variables are used for the fuzzy inference. Thus, precise inputs should be converted to the fuzzy subset by the fuzzy interface, which can be interpreted and compared to the rules in the rule?base. The actual input variables are [et] and[eDt], the scaled variables are [E] and [EC], we define the domain of the scaled variables with finite integer. The domain [S] can be expressed by (5).
  [S=-6,-5,-4,-3,-2,-1,0,+1,+2,+3,+4,+5,+6]. (5)
  The practical domains of input variables [et] and[eDt] are different from the domain[S], so the actual input variables [et] and[eDt] should be scaled into the scaled variables in domain [[-6,6]]. If the values of input variables [et] and[eDt] range in [[aE,bE]] and [[aEC,bEC]] respectively, the [et] and[eDt] can be scaled to[E] and [EC]by (6),
  [E=12?(et-(aE+bE)/2)bE-aEEC=12?(eDt-(aEC+bEC)/2)bEC-aEC]. (6)
  After scaling, scaled variables [E] and [EC] are then fuzzified by input fuzzy sets. Input fuzzy sets are defined on domains [S]. Since the number of fuzzy segmentation determines the fuzzy control accuracy, the fuzzy segmentation is significant in the fuzzy controller. Designing more fuzzy segmentation levels results in more control rules and increases the computing complexity. On the contrary, less fuzzy segmentation level degrades the control precision and has low calculating complexity. To maintain the trade?off between precision and complexity, the most fuzzy control literature [24] adopts the fuzzy controller, which divides the linguistic variable into 7 fuzzy sets, to control systems and simulations show this fuzzy controller can obtain effective control performance. In this paper, we also use 7 fuzzy sets for input and output variables. Each input fuzzy set is assigned a linguistic name: negative big (NB ), negative medium (NM ), negative small (NS ), zero (ZO ), positive small (PS ), positive medium (NM ), positive big (PB ). We define fuzzy sets of[E], [EC] and [U] as[NB,NM,MS,ZO,PS,][PM,PB].   2.3 Knowledge Base
  Fuzzification results are used in fuzzy rules to make combined membership values for fuzzy inference. Once the domain [S] and the fuzzy segmentation are selected, the membership function is required, which is a quantitative description of the fuzzy conception and is the basis of the fuzzy controller. It can be utilized to transform the quantized precise inputs into fuzzy sets. However, it is difficult to define the uniform membership function type. In order to decrease computational complexity and acquire low delay, we choose a simple triangle membership function to change the values of[E], [EC] and [U] into a membership value that is confined to [[0,1]].
  Table 1 gives the membership values of the linguistic variables[E], [EC] and [U] for fuzzy sets. The first row and column of Table 1 are 13 values of the domain [S] and the fuzzy sets.
  The fuzzy control rules are set up based on the expert knowledge or the manual operation. Control rules are critical to the fuzzy control system, since the quantity and accuracy of the rules have an effect on the performance of the control system. In this paper, we adopt the double?input singleoutput fuzzy controller, the double inputs are [E] and [EC], the single output is the control variables [U]. So the fuzzy control rules in this paper is IF[E]AND [EC] THEN[U]. Since the fuzzy spatial segmentation of the proposed fuzzy controller has seven levels, we can get [7×7=49] fuzzy control rules which represent the relationships between inputs ([E] and [EC]) and output ([U]).
  Table 2 gives the fuzzy control rules in this work. The first row and column of Table 2 are the fuzzy sets of [E] and the fuzzy sets of [EC] respectively. Other values are the elements of the fuzzy sets of [U].
  2.4 Inference Mechanism
  Inference mechanism emulates the expert’s decision in interpreting and applying knowledge to control the process. By inference mechanism, the membership values produced by fuzzification are transformed to the output fuzzy set.
  If a rule in Table 2 is IF[E]is [Ai] AND [EC] is [Bi] THEN [U]is[Ci], where[Ai], [Bi] and [Ci] are fuzzy sets, these fuzzy sets are defined by membership function in(7), (8) and (9).
  [Ai=EμAi(e)/E] , (7)
  [Bi=ECμBi(ec)/EC] , (8)
  [Ci=UμCi(u)/U] , (9)
  where [μAi(e)], [μBi(ec)] and [μCi(u)] are membership values. The rule deduces a fuzzy relation [Ri] by (10).
  [Ri=Ai?Bi?Ci] , (10)
  where [?] is the mini?operation rule of fuzzy implication. The ultimate fuzzy relation [Re] can be expressed by (11).   [Re=?Ri], (11)
  where [?] represents the union operation.
  When inputs [E] and [EC] are given as [A?] and [B?], the control variables [U] can be calculated by (12).
  [U=A??B?⊕Re] , (12)
  where [A?], [B?] and [U] are fuzzy sets, [⊕] is the sup?min compositional operator.
  2.5 Defuzzy Interface
  The output control value [U] generated by the inference mechanism is a fuzzy value, which cannot be directly used in the monitor. The defuzzification converts the result obtained by the inference mechanism into the exact value which can be applied to control the process. In this paper, the fuzzy set [U] expressed by membership values should be transformed to [u] by the center of gravity method in (13).
  [u=iμU(Ui)?UiiμU(Ui)]. (13)
  Fuzzy sets [A?] and [B?] can be acquired according to Tables 1 and 2 after two input variables are calculated. Then the control variable [u] can be calculated according to (10), (12) and (13).
  In order to achieve low computation work, we built a fuzzy logic querying table shown in Table 3 to represent the relationship between the inputs [e,ec]and output [u]. Through looking up this table, we can obtain the control variable based on the input variables [et] and [eDt].
  The output control value [u] in [U] cannot be directly used in the RC system since the domain of the [u] is different from the domain of adjustable parameter [ΔQP]. Thus, the scale factor [Ku] is employed to converts the value in Table 3 to the actual control value [u?], which can be expressed by (14).
  [u?=[u?Ku]]. (14)
  The maximum range of [u] value is [±6], while the QP value variation allowed in rate control amplitude is [±3], so the scale factor [Ku]can be derived from (15).
  [Ku=maxΔQPmaxu∈Su=3/5=0.6]. (15)
  After scaling, a fast lookup table used for encoding process is obtained (Table 4).
  2.6 QP Computation in Rate Control
  The incremental QP value [ΔQ] can be calculated based on the proposed fuzzy control scheme. This [ΔQ] and the previous frame QP value [Qt-1] are adopted to calculate the QP value, which can be expressed by (16), to encode the current frame.
  [Qt*=Qt-1+ΔQ=Qt-1+u?]. (16)
  As usual, the QP value is bounded in [[Qmin,Qmax]], which intends to avoid large quality fluctuation between two consecutive frames. Therefore, all the derived QP values are clipped by (17).
  [Qt=maxQmin,minQmax,Qt*]. (17)
  3 Input Domain Determination for Fuzzy   Controller
  In a fuzzy controller, the practical domain of input variables [et] and [eDt] can be converted to the internal domain [E] and [EC] based on the practical variation range of input variables. To quantize [et] and [eDt] , we pre?calculate the practical variation range of input variables [[aE,bE]] and [[aEC,bEC]] . This work inherits the exponential R?D model to pre?calculate the practical variation range, which is expressed in (18).
  [Rbpp=α?e-β?Q], (18)
  where [Rbpp] represents output encoded bits per pixel, [Q] denotes quantization parameter QP, and [α] and [β] are model parameters.
  We run tests on different representative video sequences. Table 5 lists the exponent fitting results including [α] , [β] and the Pearson correlation coefficient [P2]. [P2] is the correlation between the model and actual R?Q data. The correlation is stronger with [P2] being closer to 1, and vice versa. Table 5 shows that the R?Q curve intercept [α] drastically changes, but the curvature [β] changes steadily. For example, in the 720p sequence, the min and max value of [α] are 6.8911 and 27.178 respectively, however [β] is bounded in [[0.0756,0.0949]]. Thus, we adopt [β] to pre?calculate the practical variation range.
  The first?order and second?order differentials of R?Q model (18) can be expressed by
  [dRbppdQ=α?e-β?Q?(-β)=-β?Rbpp], (19)
  [d2RbppdQ2=-β?α?e-β?Q?(-β)=β2?Rbpp]. (20)
  When QP varies in a small range, we suppose [ΔRbpp≈dRbpp] , [ΔQ≈dQ], [ΔR2bpp≈d2Rbpp],[ΔQ2≈dQ2]. Therefore, if [ΔQ] bounds in [[-3,3]], the practical variation range of the two input domain can be calculated by (21).
  [[aE,bE]=[-ΔRbpp,ΔRbpp]=[-β?Rbpp?3,β?Rbpp?3]]. (21)
  Since [β] changes steadily, the mean of the [β] in Table 5 is adopted to calculate (21) and (22). The [Rbpp] can be approximately arrived by the average of output bits per pixel.
  [[aEC,bEC]=[-ΔRbpp2,ΔRbpp2]=[-β2?Rbpp?32,β2?Rbpp?32]], (22)
  [Rbpp=1n?1W?H?i=1nRt-i] , (23)
  where [n] is the window size that is empirically set to 15.
  4 Scene Change Detection
  When an unpredictable scene change happens, video content changes among consecutive frames. In this condition, information, such as QP, from previous frames cannot be directly employed to encode the current frame. Therefore, scene?change detection is significant to recognize changes and prevent unexpected inter prediction. There is much research work dealing with the scene change detection based on the video frame features, such as luminance component [25], YUV’s mean [26], image complexity MAD [27] and color histogram information [28]. The statistical properties of video sequences indicate that the histogram information between the two adjacent frames has a greater difference when the scene change occurs. Therefore, this paper proposes scene a unified scene change detection algorithm based on the histogram statistics information of the video frame, which integrates the Pearson correlation coefficient and cosine similarity to calculate the histogram correlation between the two adjacent frames.   4.1 Pearson Correlation Coefficient
  The Carl Pearson correlation coefficient [P2t-1,t] is used to represent the histogram relationship between two consecutive frames.
  [Pt-1,t=i=1nHt-1i-Ht-1Hti-Hti=1nHt-1i-Ht-12i=1nHti-Ht2] , (24)
  with
  [Ht=i=1nHti/n] , (25)
  where [H] is histogram, [Ht[i]] represents the total number of pixels at gray level [i] in coding the current frame, [n] denotes the total number of elements in [H] ,which is usually set to 256, and[Pt-1,t] ranges in [[-1,1]].
  The correlation between two consecutive frame histograms can be measured by [P2t-1,t]. Generally, if the [P2t-1,t] is greater than the predefined threshold 0.8, the correlation becomes obvious. Otherwise, the correlation is weak, which indicates the scene change happens.
  4.2 Cosine Similarity of Histograms
  Another algorithm for scene change detection between successive frames is the cosine similarity. It is expressed by (26).
  [cosθt-1,t=Ht-1?HtHt-1?Ht=i=1nHt-1i?Htii=1nHt-1i2i=1nHti2], (26)
  where the cosine similarity is bounded in [[0,1]]. If the [cos(θt-1,t)] is close to 1, it denotes the correlation is strong. Otherwise, the correlation is weak, which shows the scene change occurs.
  4.3 Unified Scene Change Detection and GOP Adjustment
  To improve the accuracy of scene change detection, we propose a novel scene change detection method that combines the Pearson correlation coefficient and the cosine similarity to detect the histogram correlation between the two adjacent frames. We define[Simt] to express the similarity between two successive frames, which is calculated by (27).
  [Simt=Pt-1,t?cos(θt-1,t)]. (27)
  We predefine a threshold [ξ], and it means a scene change occurs if the [Simt] is lower than the [ξ] . Otherwise, there is no scene change. [ξ] is an experience value showing the sensitivity of the scene detection, which is empirically set to 0.85.
  The adaptive GOP structure is formed according to the scene structure. When a scene change occurs, the current frame type is set to I?frame. Besides, we should adjust the GOP size and allocate a new GOP.
  5 Fuzzy Rate Control Algorithm
  This section exhibits a new rate control algorithm (Algorithm 1) that integrates the proposed fuzzy controller and the unified scene change detection.
  6 Simulation Results
  To assess the performance of the proposed rate control scheme, the experiments were conducted on MPEG?2, H.263, MPEG?4 and H.264/AVC, and Hx265 separately. There are two prediction structures over the coding tests. One is All?Intra frames coding (II..I) and the other is low?delay frames coding; only the first frame is I?frame and the remaining frames are all P?frames (IPP..P). Comprehensive experiences have been carried out to evaluate the performance of the proposed FRCA and the default configuration encoding. Typically, the test QP points are [{8,11,14,17,20}] for the QP range of [[0,31]] and [{17,22,27,32,37}] for the QP range of [[0,51]].   Table 6 gives seven groups of video sequences generated for the coding tests. All of the encoding tests in this paper are based on these video sequences.
  Four classical encoders are used to show the control accuracy of the FRCA in Table 7. We can find that the bit rate error (the difference between the target bit rate and actual bit rate) of the FRCA is relatively small. For AI case, the average bit rate error is less than 0.066%; for LD case, the control accuracy can achieve 0.03%.
  Fig. 2 shows the simulation results of the HRD buffer size during the coding process with All?Intra and low?delay coding structure. The X?axis and Y?axis denote the picture index and the actual buffer size respectively. It is obvious that, compared with the default configuration encoding, our algorithm significantly acquires a more stable buffer size during the entire coding process. According to the upper and lower limits of the Y?axis given in Fig. 2, FRCA can control the buffer curves in a very narrow interval. Although there are large fluctuations caused by scene change, the buffer curves can more quickly response and return to steady, which illustrates that the FRCA can effectively work in video coding.
  In Figs. 2b and 2d, it can be seen that buffer curves are on the brink of the target buffer level (constant set to 0), and fluctuate frequently. However, our rate controller quickly and effectively controls the buffer and makes the buffer close to target buffer levels, which indicates the FRCA has a strong control ability.
  Fig. 3 shows the actual output bit curves for each frame coding. The X?axis is the picture index and Y?axis denotes the actual output bit per pixel (bpp) during frame level encoding. It is evident that the curves of FRCA are consistent and very regularly fluctuates around the target bpp. The four classic encoders under default configuration value coding exhibit that frame level bpp are not controlled and present obvious volatility characteristics. In Fig. 3, FRCA variances σ are smaller than those of classic encoders, which indicates that FRCA can totally fulfil the purpose of rate control. The FRCA bpp variance σ in all of the sub?figures is significantly less than the σ value of default configuration encoding. Meanwhile, the σ value of FRCA with scene change ON is smaller than the σ value in the OFF condition. This means that the scene change detection with adaptive GOP length can obviously enhance FRCA performance.   To further demonstrate the FRCA algorithm performance in the latest encoders, comprehensive experiences have been conducted on Hx265. The simulations are performed over the sequences of Class A-Class F suggested by the common test conditions (CTC) for Hx265 [29], which include 20 sequences with resolutions ranging from 4K to WQVGA. Table 8 presents the control accuracy results for comparison with the default RC algorithm. From Table 8 we can find that the bit rate error of the FRCA is much smaller than the default RC algorithm. It shows that the proposed algorithm works better than the default RC algorithm for both AI structure and LB structure on Hx265. To clearly represent the FRCA control accuracy on Hx265, Fig. 4 shows the simulation results of the HRD buffer size during the coding process with All?Intra and low?delay coding structure. The HRD buffer state diagrams of the default RC algorithm are provided in (a), (b), and (c) in the figure and the HRD buffer state diagrams of the FRCA are presented in (d), (e), and (f). It is obvious that, compared with the default configuration encoding, our algorithm can strictly control the HDR buffer in a very narrow interval and acquire a more stable buffer size during the entire coding process.
  7 Conclusions
  This paper develops a novel RC algorithm base on a fuzzy controller regardless of R?D model dependence, named the Fuzzy Rate Control Algorithm (FRCA). The significant features of the FRCA are simple, efficiency and low computation complexity. Considering the efficiency and accuracy of the control system, an input domain determination scheme is adopted in the proposed fuzzy controller. The work employs a fast lookup table generated by fuzzy logic to calculate increment QP value, which reduces the computation complexity. In addition, we propose scene change detection for adaptive GOP length adjustment. FRCA achieves accurate rate control while maintains extremely low delay between the encoder and the decoder. Extensive simulations and analytical results demonstrate that the FRCA outperforms the default configuration encoding in bitrate accuracy.
  References
  [1] H. Choi, J. Nam, J. Yoo, and D. Sim, “Rate control based on unified RQ model for HEVC,” in JCT?VC of ITU?T SG16 WP3 and ISO/IEC JTC1/SC29/WG11, JCT?VC H0213, San José, CA, USA, Feb. 2012.
  [2] L. Xu, S. Kwong, Y. Zhang, and D. Zhao, “Low?complexity encoder framework for window?level rate control optimization,” IEEE Transactions on Industrial Electronics, vol. 60, no. 5, pp. 1850-1858, May 2013. doi: 10.1109/TIE.2012. 2190960.   [3] C. W. Seo, J. H. Moon, and J. K. Han, “Rate control for consistent objective quality in high efficiency video coding,” IEEE Transactions on Image Processing, vol. 22, no. 6, pp. 2442-2454, Jun. 2013. doi: 10.1109/TIP.2013.2251647.
  [4] T. Chiang and Y. Zhang, “A new rate control scheme using quadratic rate distortion model,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 7, no. 1, pp. 246-250, Feb. 1997. doi: 10.1109/76.554439.
  [5] Z. Li, F. Pan, K. Lim, et al., “Adaptive basic unit layer rate control for JVT,” in JVT?G012?r1, 7th Meeting, Pattaya, Thailand, 2003.
  [6] B. Li, H. Li, L. Li, and J. Zhang, “Rate control by R?lambda model for HEVC,” in JCTVC?K0103, 11th JCTVC Meeting, Shanghai, China, Oct. 2012.
  [7] Z. He and S. K. Mitra, “Optimum bit allocation and accurate rate control for video coding via ρ?domain source modeling,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 12, no. 10, pp. 840-849, Oct. 2002. doi: 10.1109/TCSVT.2002.804883.
  [8] Y. Zhou, Y. Sun, Z. Feng, and S. Sun, “New rate?distortion modeling and efficient rate control for H. 264/AVC video coding,” Signal Processing: Image Communication, vol. 24, no. 5, pp. 345-356, May 2009. doi: 10.1016/j.image.2009.02.014.
  [9] L. Xu, D. Zhao, X. Ji, et al., “Window?level rate control for smooth picture quality and smooth buffer occupancy,” IEEE Transactions on Image Processing, vol. 20, no. 3, pp. 723-734, Mar. 2011. doi: 10.1109/TIP.2010.2063708.
  [10] H. Choi, J. Yoo, J. Nam, D. Sim, and I. Bajic, “Pixel?wise unified rate?quantization model for multi?level rate control,” IEEE Journal of Selected Topics in Signal Processing, vol. 7, no. 6, pp. 1112-1123, Dec. 2013. doi: 10.1109/JSTSP.2013.2272241.
  [11] B. Yan and M. Wang, “Adaptive distortion?based intra?rate estimation for H.264/AVC rate control,” IEEE Signal Processing Letters, vol. 16, no. 3, pp. 145-148, Mar. 2009. doi: 10.1109/LSP.2008.2010813.
  [12] J. Yang, Y. Sun, Y. Zhou, and S. Sun, “Incremental rate control for H.264 AVC scalable extension,” Multimedia Tools and Applications, vol. 64, no. 3, pp. 581-598, Jun. 2013. doi: 10.1007/s11042?011?0967?y.
  [13] Y. Zhou, Y. Sun, Z. Feng, and S. Sun, “PID?based bit allocation strategy for H.264/AVC rate control,” IEEE Transactions on Circuits and Systems II: Express Briefs, vol. 58, no. 3, pp. 184-188, Mar. 2011. doi: 10.1109/TCSII.2011.2106350.
  [14] H. Ying, Fuzzy Control and Modeling: Analytical Foundations and Applications. Hoboken, USA: Wiley IEEE Press, 2000. doi:10.1109/9780470544730   [15] Y. Zhang, “The application of the fuzzy control algorithm in the automatic track following robot,” in IEEE Second International Conference on Mechanic Automation and Control Engineering (MACE 2011), Hohhot, China, 2011, pp. 759-762. doi:10.1109/MACE.2011.5987038.
  [16] R. Lasri, I. Rojas, and H. Pomares, “Explaining how intelligent control has improved the way we live: a survey on the use of Fuzzy Logic Controllers in daily human life,” in IEEE International Conference on Multimedia Computing and Systems (ICMCS), Ouarzazate, Morocco, 2011, pp. 1-6. doi: 10.1109/ICMCS.2011.5945577.
  [17] T. Muthuramalingam, B. Mohan, and A. Rajadurai, “Monitoring and fuzzy control approach for efficient electrical discharge machining process,” Materials and Manufacturing Processes, vol. 29, no. 3, pp.281-286, Nov. 2013. doi: 10.1080/10426914.2013.864412.
  [18] S. Barro and R. Marín, Fuzzy Logic in Medicine, vol. 83. Heidelberg, Germany: Physica?Verlag Heidelberg, 2002. doi: 10.1007/978?3?7908?1804?8.
  [19] S. Birle, M. Hussein, and T. Becker, “Fuzzy logic control and soft sensing applications in food and beverage processes,” Food Control, vol. 29, no. 1, pp. 254-269, Jan. 2013. doi: 10.1016/j.foodcont.2012.06.011.
  [20] H. Kazemian and L. Meng, “An adaptive control for video transmission over bluetooth,” IEEE Transactions on Fuzzy Systems, vol. 14, no. 2, pp. 263-274, Apr. 2006. doi: 10.1109/TFUZZ.2005.864080.
  [21] D. Tsang, B. Bensaou, and S. Lam, “Fuzzy?based rate control for real?time MPEG video,” IEEE Transactions on Fuzzy Systems, vol. 6, no. 4, pp. 504-516, Nov. 1998. doi: 10.1109/91.728442.
  [22] S. Sheu, Y. Wang, H. Yin, and J. Chen, “Adaptive rate controller for mobile ad hoc networks,” International Journal of Mobile Communications, vol. 1, no.3, pp. 312-328, Sept. 2003. doi: 10.1504/IJMC.2003.003497.
  [23] M. Rezaei, M. Hannuksela, and M. Gabbouj, “Semi?fuzzy rate controller for variable bit rate video,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 18, no. 5, pp. 633-645, May 2008. doi: 10.1109/TCSVT.2008.919108.
  [24] J. Lin and R. Lian, “Intelligent control of active suspension systems,” IEEE Transactions on Industrial Electronics, vol. 58, no. 2, pp. 618-628, Feb. 2011. doi: 10.1109/TIE.2010.2046581.
  [25] J. Shi, L. Xu, and C. Zhang, “Scene adaptive frame?layer rate control algorithm for H.264,” Application Research of Computers, vol.27, no. 5, pp. 1968-1970, May 2010. doi: 10.3969/j.issn.1001?3695.2010.05.105.
  [26] Y. Wang and Y. Xue, “Rate control algorithm based on scene change,” Video Engineering, vol. 33, no. 12, pp. 17-20, Dec. 2009.
  [27] H. Duan and T. Wang, “Improved frame?layer rate control algorithm for H.264,” Journal of Computer Applications, vol. 29, no. 4, pp. 1008-1010, Apr. 2009. doi: 10.3724/SP.J.1087.2009.01008.
  [28] S. Kang, S. Cho, and S. Yoo, “Scene change detection using multiple histograms for motion?compensated frame rate up?conversion,” Journal of Display Technology, vol. 8, no. 3, pp. 121-126, Mar. 2012. doi: 10.1109/JDT.2011.2167740.
  [29] F. Bossen, “Common HM test conditions and software reference configurations,” in 12th Meeting of Joint Collaborative Team on Video Coding (JCT?VC) of ITU?T SG, Geneva, Switzerland, 2013, Document JCTVC?K1100.
  Manuscript received: 2015?12?28
其他文献
随着中国吸引外资的快速增长以及对外贸易的不断扩大,对中国吸收的直接投资和对外贸易之间关系的科学把握显得尤为重要。欧盟是中国主要的外资来源国和贸易伙伴国之一,研究欧盟
汽轮机是发电厂的主要机组设备,汽轮机运行过程中高压调节阀很容易出现故障,本文对600MW汽轮机高压调节气阀的典型故障及应对策略进行分析与探讨,旨在为电厂的汽轮机运行提供
以水资源短缺、水污染严重、水生态恶化为主要内容的“水危机”问题已成为制约社会经济可持续发展的瓶颈之一。有效的环境规制是提高资源利用效率、缓解环境污染、提高环境质
对男性饰品来说,粗犷、厚重、刚性这些元素似已陈旧,取而代之的是前卫与多元化风潮。古朴的中国剪纸一落在手腕上就变得酷劲十足;金属质感的音符别针点缀出些许小资情调:皮
期刊
Abstract Network emulation is significant because of its ability to study network architecture running on real operating system and the whole protocol stack. However, conservatively allocating a physi
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
本文以四川省内成都、 自贡、 绵阳、 遂宁、 乐山五大片区调研数据为样本,通过分析四川省旅游住宿业规模与结构、 平均房价、 平均住房出租率及RevPAR等基本情况,得出四川省
利用某加氢裂化装置,掺炼催化柴油进行加氢改质试验,考察混合劣质柴油对加氢改质产物分布及产品质量的影响 [1] .结果表明: 以焦化蜡油,掺炼催化柴油进行加氢改质, 操作参数
给出了“石头剪刀布”的一种“白富美”新玩法。所谓“白”,即思路清清楚楚、明明白白;所谓“富”,即理论内涵非常丰富;所谓“美”,即结论绝对数学美。安全通论的魅力也在这里得到
Abstract Distributed Denial of Service (DDoS) attacks have been one of the most destructive threats to Internet security. By decoupling the network control and data plane, software defined networking