SKIP Prediction for Fast Rate Distortion Optimization in H.264

Size: px
Start display at page:

Download "SKIP Prediction for Fast Rate Distortion Optimization in H.264"

Transcription

1 SKIP Prediction for Fast Rate Distortion Optimization in H.264 Avishek Saha, Kallol Mallick, Jayanta Mukherjee, Senior Member, IEEE and Shamik Sural, Senior Member, IEEE Abstract In H.264, the optimal coding mode for each macroblock (MB) is selected by exhaustively searching all MB modes in the multiple reference frames. The exhaustive mode search has a very high computational complexity. This paper proposes three different approaches along with their combination to reduce the complexity of the rate-distortion optimized mode decision process in H.264. These approaches are based on transform-domain properties and spatio-temporal correlation in video sequences. Experimental results demonstrate that the proposed methods achieve around 5-7 times improvement in average speedup with good prediction quality and comparable bitrate 1. Index Terms SKIP Prediction, ρ-domain, restricted reference frame, coding efficiency. I. INTRODUCTION The H.264/AVC is the state-of-the-art video compression standard recently developed by the ITU-T/ISO/IEC Joint Video Team [1]. Compared to previous standard this new video coding standard can deliver significantly improved compression efficiency, which makes it possible to transmit high quality video over lower bit rate channels. In addition, the increased flexibility of H.264 encoding and transmission caters to a broad spectrum of video applications enabling new video services over cable, satellite and mobile networks. However, these performance gains of H.264 come at a cost of increased computational complexity [2]. The decoding complexity increases by a factor of four, whereas the encoding complexity may be as high as nine times over MPEG-2. This huge increase in encoder complexity is mainly due to Rate-Distortion Optimization (RDO) of the Motion Estimation (ME) and Mode decision (MD) processes in H.264. An H.264/AVC video encoder typically consists of the encoding modules of motion estimation, motion compensation, integer transform, quantization and entropy coding. In H.264/AVC, an MB can be encoded using intra prediction from neighboring samples in the same frame or using inter prediction from samples in a previously coded frame/frames. In addition, H.264 supports the use of variable Macro Block (MB) partition sizes encoded in the form of MB modes [1]. The MBs can be of 1 This work has been supported by a research grant from the Department of Science and Technology (DST), Govt. of India, under Research Grant No. SR/S3/EECE/024/2003. Avishek Saha, Kallol Mallick and Jayanta Mukherjee are with the Department of Computer Science and Engineering, Indian Institute of Technology, Kharagpur, WB India ( avishek.saha@gmail.com, kallol@cse.iitkgp.ernet.in, jay@cse.iitkgp.ernet.in ). Shamik Sural is the School of Information Technology, Indian Institute of Technology, Kharagpur, WB India ( shamik@sit.iitkgp.ernet.in ) N N 0 N/2 0 N/2 SKIP 16 x 8 N 16 x x 16 N/ P 8 x 8 8 x 8 8 x 4 4 x 8 4 x 4 Fig. 1 Partition sizes of macroblocks (N=16) and submacroblocks (N=8) in H.264/AVC. size 16x16, 8x16, 16x8, 8x8, 8x4, 4x8 and 4x4. Fig. 1 shows the different MB modes used in H.264. An MB with large partition size requires a single motion vector to represent its motion information. However, a single motion vector may not be able to accurately represent the motion information of the entire MB resulting in a large residual error and hence a large number of bits for encoding the transformed residual error. Again, an MB mode with small MB partitions may require more bits to represent the motion information and fewer bits to encode the residual error. Thu selection of the proper encoding mode has considerable impact on the compression efficiency of the H.264 encoder. In H.264, the MB mode with the best Rate-Distortion (RD) performance is selected as the optimal encoding mode. The optimized RD cost is obtained by using Lagrangian minimization [3]. The minimization cost function as follow J = D + λr (1) where, J is the cost, D is the distortion, λ is the Lagrangian multiplier and R is the rate. H.264 uses rate distortion optimization for both motion estimation and mode decision. To reduce the computational complexity of RD optimized H.264, several fast approaches have been proposed [4] [12]. These algorithms either speed up the motion estimation or the mode decision process of the encoder. In [4], the macroblock content complexity was employed to reduce the number of inter-modes check for each MB. The SATD-based mode selection method was used by Tanizawa et al. [5] to reduce the number of candidate modes. Subsequently, RDO was performed on the reduced number of candidates. A novel technique to limit the number of candidate MB modes to a small subset by pre-encoding a down-sampled version of the original image was proposed in [6]. The edge direction

2 map [7] and amplitude of the edge vector [8], which can be obtained from the Sobel operator, have been successfully used to predict the Inter and Intra MB mode respectively. The MB SKIP decision has also been taken based on the difference between the average boundary error and the average bit-rate cost of the best Inter mode [9]. The weighted cost obtained from quantized transform coefficients has been used [10] to save substantial amount of computation. In [11] and [12], motion search information has also been used to skip the checking of unlikely block sizes. This work presents three approaches to expedite the RDO process in H.264. The first two methods are based on SKIP prediction and the last method is based on reducing the number of reference frames to a single best matching frame. The mode decision process will be performed on this selected best reference frame. The first approach toward SKIP prediction uses the zero-count in transform-domain to decide on whether to SKIP or not. The transform-domain model used is the ρ- domain model [13], which has so far been used for bit estimation purposes [14]. But, to the best of our knowledge, the ρ-domain model has never been used for SKIP prediction. In the second approach, the spatio-temporal correlation properties of video sequences have been utilized to further improve the ρ- domain SKIP prediction results. The third approach reduces the number of reference frames on which the mode decision is to be performed. Only a single best reference frame is selected by performing an initial low cost matching on the multiple reference frames. The selected reference frame is then used in the mode decision process. The rest of this paper is organized as follows. Section II reviews fast rate distortion optimization in H.264. Fast RDO based on skip prediction and reduced reference frame based fast RDO have also been discussed. The proposed approaches based on SKIP prediction and reduced reference frame have been presented in Section III. Section IV compares and analyses the performance of the individual approaches as well as a combination of all the three approaches. Finally, Section V concludes this paper. II. FAST RATE DISTORTION OPTIMIZATION IN H.264 Rate-distortion optimization techniques have been widely applied to video encoders [3]. The RDO techniques result in substantial improvement in compression efficiency. In RDO, the Lagrangian minimization method is used to find the best MB mode among the modes of {INTRA4X4, INTRA16X16, INTER16X16, INTER16X8, INTER8X16, INTER8X8, INTER8X4, INTER4X8, INTER4X4, SKIP, DIRECT}. For each MB in an inter-frame, the RDO is first used to obtain the optimal motion vector by minimizing the cost function, J ( mv, λ MOTION ) = D( c( mv)) + λmotion R( mv pmv) (2) where, mv is the motion vector obtained by motion estimation, pmv is the predicted motion vector and λ MOTION is the Lagrange multiplier. R(mv-pmv) represents the motion information and D(c(mv)) is the sum of absolute differences (SAD) between the original video signal s and the coded video signal c. For multiple reference frame the final reference frame is selected by minimizing (3), J ( REF λmotion ) = D( c( REF, mv( REF))) + (3) λmotion ( R( mv( REF) pmv( REF)) + R( REF)) where, D(c(REF,mv(REF))) is the SAD between original signal s and the coded signal c from reference frame REF. Rate-distortion optimized mode selection is performed by minimizing J( c, MODE QP, λmode) = D( c, MODE QP) (4) + λmoder( c, MODE QP) where the distortion D(c,MODE QP) is measured as the sum of squared errors between the original block s and the coded block c, and QP is the quantization parameter. The relation between the Lagrangian multipliers λ MOTION, λ MODE and the quantization parameter QP, is given by: 6 λ 2 QP / MOTION = λmode = m (5) where m is a constant. The optimal motion vector and mode selection is dependant on the quantization parameter QP. The optimal mode for inter prediction is obtained by computing all the Lagrangian costs (1) for all possible subblock partitions. Moreover, the H.264/AVC encoder uses Intra_16 16 and Intra_4 4 prediction modes for intraframes. In these mode the reconstructed pixels in the adjacent blocks coded previously are used to predict the content of the macroblock. The Intra_16 16 has four mode whereas the Intra_4 4 prediction has nine modes for each 4 4 subblock. This results in a total of 144 cost checkings for each intra MB mode decision. H.264 inter-prediction uses Tree Structured Motion Compensation (TSMC) [15]. TSMC supports block sizes from 16x16 to 4x4 with the facility of dividing each block into sub-blocks. The lowest sub-block size can be 4x4. In inter-prediction, all the eight possible MB partitions and the SKIP mode are checked to select the optimal encoding mode. Usually, small partition sizes are selected for detailed regions whereas large partition sizes perform reasonably well for homogeneous regions. The actual distortion and bit consumption calculations for all candidate modes greatly increase the encoding time. Thu using RDO to select the best coding mode for each MB is the most computationally intensive process in H.264. A. Skip Prediction based Fast RDO The H.264/AVC JM encoder [1] identifies certain MBs as skipped during encoding. The encoding of these MBs are skipped by the encoder. In the decoder, the skipped MB is reconstructed by motion-compensated prediction from the current reference picture using a motion vector predicted from previously decoded motion vectors. The SKIP mode has the lowest RD cost (J) among all MB encoding modes. Moreover, checking of SKIP mode involves the lowest complexity. Thu a SKIP decision at the start of the mode decision process can substantially lower the entire encoder complexity. However, incorrect skip decisions may increase the bit rate and may also result in a loss of picture quality.

3 B. Restricted Reference Frame based Fast RDO In H.264, multiple reference frames are used for inter motion estimation purposes. The Lagrangian minimization based RD cost function is calculated for all reference frames. An increase in the number of reference frames also increases the computational cost of RDO. A number of reasons have been cited [16] to justify the use of more number of reference frames to obtain better predictions. However, strong correlations among the motion vectors of successive frames lead to the intuition that, the full RD-optimized mode decision search for all reference frames need not be performed and a decision can be taken based on an initial estimate of the best reference frame. MB A Frame (n 1) MB B Frame (n 1) Current MB X Frame n MB C Frame (n 1) III. PROPOSED APPROACHES TOWARD FAST RDO The RD optimized motion estimation process in H.264/AVC encoder calculates the distortion costs and the bit costs associated with each of the possible MB modes. This mode decision step consumes considerable portion of the entire encoder execution time in encoding process. The proposed schemes delineated in the following subsections intend to improve the computational complexity of the RDO process without incurring noticeable drops in reconstructed video quality. The first two methods employ SKIP prediction, whereas the last one expedites motion estimation by reducing the number of reference frames. A. RHO-domain Skip Prediction Rate-distortion optimized fast ME requires computation of the RD cost J for all modes of an MB. In order to calculate the rate and distortion of an MB mode, the DCT, quantization, inverse transform and inverse quantization operations need to be performed on the MB. The rate information is derived from the quantized transform coefficients. These operations contribute to the high computational complexity of the RDO process. However, low complexity DCT and quantization calculation may substantially bring down the RDO execution time. Reduced complexity DCT and quantization can be performed using the rate and distortion models. Information-theoretic analysis shows [22] that, a generalized Gaussian can best approximate the distribution of transformed AC coefficients of a natural image. Based on this assumption, [13] proposed a rate model called the ρ-domain model. In ρ- domain, the rate is expressed as a function of the number of zeroes among the transformed coefficients. If ρ is the percentage of zeroes among the quantized transform coefficient then it can be shown [17] that, ρ monotonically increases with the quantization step size q. This implies that there is a one-to-one mapping between ρ and q. Hence, mathematically, the coding bit rate R, which is a function [18] of the quantization step-size q, must also be a function of ρ, denoted by R(ρ). In encoder, the transform coding coefficients are quantized and then entropy encoded. So, higher the number of zeros in quantized transform coefficient the lower is the bit rate. Thu the rate calculation in ρ-domain can be expressed a R ( ρ ) = (1 ρ) θ (6) Fig. 2 Prediction from neighboring macroblocks in the previous frame It is to be observed that, when ρ tends to 1, R tends to zero. This essentially implies the SKIP mode, since in this mode no motion vector or distortion information is encoded. The decoder predicts the motion vector from the MVs of the previously decoded neighboring macroblocks. This observation provides the motivation to predict the SKIP mode from the Sum-of-Absolute-Transformed-Difference (SATD). If the number of zeroes in the SATD of a block is above a pre-defined threshold, a SKIP mode is predicted for the MB. The threshold value is determined empirically. Proper choice of the threshold parameter keeps the PSNR drop within acceptable limits. The proposed method predicts the SKIP mode based on a single SATD calculation and avoids the mode decision calculations for the remaining modes. Thu the SKIP prediction saves a large amount of computation. Especially for slow moving video where the percentage of SKIP mode MB is particularly high, the proposed method results in substantial improvement of the motion estimation complexity. B. Spatio-Temporal Skip Prediction In video frame the collocated macroblocks are highly correlated due to spatial homogeneity in the neighboring regions. Fig. 2 shows the neighboring MBs 'A', 'B' and 'C' in the (n-1) th frame, of the current macroblock 'X' in the n th frame. It has been shown [19] that, modes of macroblocks A, B and C in previous frame, have a high correlation with the best mode for the current macroblock. Moreover, [20] shows that the rate-distortion cost function (J) between neighboring blocks is highly correlated. These observations provide the necessary motivation to predict the SKIP mode of an MB from the spatio-temporal characteristics of neighboring MBs in the current and the previous frames. First, the proposed algorithm checks the mode selected for the MBs A, B and C, in the previous frame. If majority of them (can be parameterized as at least 2 or all 3) have SKIP as the best mode, the algorithm moves to the next step. It then finds out the SATD cost for SKIP mode of all the three MBs - A, B and C and also finds out the minimum SKIP cost among these three MBs. If the minimum SKIP cost is less than an empirically determined threshold value, then the

4 mode for current MB in the current frame is predicted as SKIP. The SATD costs of all MBs in the current frame are calculated and stored as reference for processing of the next frame. Let, SKIP(MB i,n ) = 1, if the best mode from the i th MB in the n th frame is SKIP = 0, otherwise Using the above notation, the spatio-temporal SKIP prediction algorithm can be briefly described as follows: SKIP(MB i,n ) = 1 if (i) Σ SKIP(MB m,n-1 ) = 3 (or >= 2, as per parameter), where the sum is over m; and m Є N i,n-1 and (ii) min {SATD(MB m,n-1 )} < T, where the min is over m and m Є N i,n-1 and T is a threshold. SATD(MB i,n ) is the SATD cost value calculated for SKIP mode for i th MB in frame n. Moreover, N i,n = {m: m denotes the MB number for neighboring MB A, B or C for i th MB in frame n.} Since the MBs in the first column do not have Neighboring MB A, the algorithm is disabled for all the MBs in first column of the frame. Similarly, MBs in the first row do not have Neighboring MB B and C ; the algorithm is disabled for all the MBs in the first row of the frame. This enables the algorithm to start its prediction from optimally found best mode, rather than predicted best mode. The algorithm is also not used in case of last column (MB C is absent). The only drawback of this approach is its positive error accumulation. An incorrectly predicted SKIP mode for an MB in the n th frame has a positive bias on the SKIP prediction of the (n+1) th frame. Hence, the error gets accumulated over frames and the PSNR drops. To overcome this drawback, the algorithm is switched off for every n th frame. Experimentally, a value of n=5 is found to perform well for fast moving video streams. C. Restricted Reference Frame In H.264 motion estimation, the encoder searches for MVs in multiple reference frames (max 16, as per JM10.2 reference implementation) to obtain the best mode decision. For each reference frame, the motion estimation algorithm is executed for all possible modes of an MB. This essentially increases the motion estimation time in multiples of the number of reference frames used. Our proposed algorithm selects a single reference frame, based on an initial estimate of the Rate-Distortion (RD) cost for the reference frame. Then the selected reference frame is used to find the best mode for the MB. By the principle of spatial correlation, it can be said that, neighboring MBs tend to have similar motion vector and mode decision. Hence, for similar reason it can be posited that the neighboring MBs should tend to use the same reference frame. Based on this spatio-temporal correlation, a predicted motion vector (which is also used for SKIP cost calculation) is used as the center of the motion vector search region. This idea has been extended to predict the minimum RD cost for the best mode of the MB, for a particular reference frame. SKIP RD Cost of an MB is defined as the RD Cost, assuming SKIP mode for the MB, for a particular reference frame. During SKIP mode, a motion vector predictor is used, which predicts the Motion Vector for the MB based on the MVs of the neighboring macroblocks. The simplest predictor used is the median of motion vectors of the neighboring macroblocks. For a moving object in a video frame, if the particular MB is part of the same object as its neighboring MB the motion vector and reference frame for the current MB will be similar to its neighboring ones. Since in SKIP mode, the motion vector predicted from neighboring MBs is used, this SKIP RD cost for this MB will be minimum among all reference frames. However, if MB is not a part of the same object in motion, the SKIP RD cost will be high. So, the reference frame, which gives minimum SKIP RD Cost, will also give the minimum RD cost among all modes. In the proposed approach, the SKIP RD cost of the MB is calculated first for all the reference frames. The reference frame having the minimal SKIP RD Cost is predicted as the best reference frame and the full motion estimation and mode decision is carried out only on this reference frame. Experimental results show a substantial improvement in motion estimation time, with PSNR drop within acceptable limits. This is particularly effective for fast moving video where the speed-up due to previously proposed SKIP prediction based algorithm is low, since fast moving videos have a less percentage of SKIP modes. IV. RESULTS Our experiments were performed on the JM 10.2 reference implementation [21] of H [1]. The code was compiled using MS Visual C++.Net on Windows XP (SP2) platform. The results for performance analysis were collected for different bitrates by varying the Quantization Parameter (QP) from 10 to 30, in steps of 2. The simulations have been performed on the luminance component of the popular video sequences listed in Table 1. These sequences consist of different degrees and types of motion and are in QCIF ( ), CIF ( ), SIF ( ) and CCIR601 ( ) formats. The first two sequence namely, Container and Foreman, are in QCIF format. The next two sequences are Stefan in CIF format and Football in SIF format. Tennis and Garden is in CCIR601 format. Among these sequence Container has gentle, smooth and low motion change and consists mainly of stationary and quasistationary blocks. Foreman has moderately complex motion and hence is categorized as "medium" motion content sequences. Rigorous motion based on camera panning with translation and complex motion content can be found in the sequences Stefan, Football, Tennis and Garden. Image sequences are always IPPPPP and no B frames were used. 2 Encoder parameter configuration: Profile 100, Level 40, Period of I- Frames = 10, Quantization parameter for I and P Slices (0-51) = 10 to 30 in steps of 2, No frames skipped, Subpixel motion estimation disabled, Hadamard enabled, CABAC entropy coding enabled, ±16 search range (no search range restriction), Number of previous references frame = 5, All MB size InterSearch enabled, No B-frame used, SP-Picture Periodicity disabled, Entropy coding method = CABAC, RD-optimized mode decision enabled

5 Table 1 Test sequences used in analysis Name Format Frames Motion Container QCIF (176x144) 300 Low Foreman CIF (352x288) 300 Medium Stefan CIF (352x288) 89 High Football SIF (352x240) 124 High Tennis ITU CCIR High (720x480) Garden ITU CCIR601 (720x486) 199 High The ρ-domain (RHO), the spatio-temporal (SPT5.3), the reduced reference frame (RRF) approaches and their combination (COMB) have been compared with the baseline (ORG) JM 10.2 reference encoder. The comparisons have been made in terms of three metric namely, (a) PSNR, (b) Bitrate, and (c) SpeedUp Factor (SUF). The PSNR drop and the bitrate give an idea of the loss in prediction quality and the loss in compression efficiency. The speed up factor denotes the reduction in computational complexity. Table 2 presents the results of the experiments. A. ρ-domain SKIP Prediction Results The ρ-domain results have been presented in Table 2. As can be seen, the maximum PSNR drop obtained by the ρ-domain is only about 0.05 db. This shows that the ρ-domain zero count is a good approximation for SKIP prediction. The ρ-domain results are particularly encouraging at low bitrate where the prediction quality obtained by ρ-domain SKIP prediction is even better than the reference encoder implementation of H.264, for the sequence Container. For the fast moving sequence Stefan, ρ-domain SKIP prediction results in no loss in prediction quality. In addition, it is to be noted that, for both Foreman and Stefan, the ρ-domain prediction results in increased compression. Thu at low bitrate Stefan has no loss in prediction quality, better compression and increased speedup. Similar improvements can be noticed for other test cases as well. The highest ρ-domain PSNR drop can be observed for Tennis and Garden. However, even the highest ρ-domain PSNR loss is less than 0.1 db. The average of the ρ-domain results taken over all test sequences at both low and high bitrates show a loss in prediction quality of only about db. This quality loss is accompanied by an improvement in compression efficiency. B. Spatio-temporal SKIP Prediction Results Table 2 also shows the performance comparison of SPT5.3 with ORG, RHO, RRF and COMB schemes. The main advantage of SPT5.3 over other approaches is its increased compression efficiency. In most case the coding efficiency of SPT5.3 is very close to that of the RHO approach. Although the loss in prediction quality is more than RHO, this loss is well within the MPEG limit of 0.5dB. SPT5.3 intends to enhance the performance of RHO and hence it predicts SKIP modes from spatially and temporally neighboring MBs. However, this prediction is made over and above the SKIP predictions of the ρ-domain approach. This accounts for the increased PSNR drop of SPT5.3 as compared to the RHO approach. However, reducing the number of frames in the resetting interval brings down this PSNR drop. The average loss in prediction quality for the SPT5.3 scheme is the highest among all the three proposed strategies. However, it is to be noted that the average compression efficiency of SPT5.3 is also highest among the proposed approaches. This increased compression is accompanied by higher speedups as compared to the averaged ρ-domain results. C. Restricted Reference Frame Results The Restricted Reference Frame (RRF) approach has been compared with ORG, RHO and SPT5.3 in Table 2. From the tabulated result it can be concluded that the RRF approach is extremely advantageous in terms of speedup. For most test sequence the RRF approach reduces the computational complexity by 5-8 times with marginal loss in prediction quality. The only disadvantage is its increased bitrate. This is understandable since the major motivation [16] behind the use of more than one reference frames is the strong correlation between the motion vectors in multiple reference frame which results in better prediction quality and hence higher compression efficiency. Hence, low number of reference frames accounts for the increase in bitcount. In terms of prediction quality, the RRF performs better at high bitrates. The average RRF results have the highest speedup with acceptable quality loss. The drawback of reducing the number of reference frames is an increase in the number of bits. An accurate estimate of the reference frame reduces this increase in bit information. D. Combined Results All the three aforementioned SKIP prediction schemes perform better in one aspect or the other. The ρ-domain SKIP prediction has the best prediction quality. Spatio-temporal SKIP prediction results demonstrate the highest compression efficiency. And finally, the restricted reference frame based SKIP prediction leads to the best coding efficiency. This observation provides the motivation to combine these schemes and observe the combined results. For all the test sequence the combined results have the highest speedup, the highest PSNR drop and the lowest compression efficiency. However, it is to be noted that the increase in speedup is substantial, as compared to the baseline reference (JM10.2) implementation. Moreover, this increased coding efficiency comes at an expense of negligible loss in quality and marginal increase in bitrate. As already mentioned, the maximum quality loss is well within the MPEG tolerance limit of 0.5 db. Moreover, at low bitrate the increase in bit count is very small. Thu the combination of the three proposed approaches has much better performance than the reference JM implementation, particularly at low bitrates.

6 Table 2 PSNR, Bit rate and SUF comparison of the proposed approaches Input Parameters Method PSNR Δ PSNR Bitrate Δ Bitrate SUF (in db) (in db) (in kbps) (in kbps) ORG ±16 RHO fps SPT QP RRF Container COMB QCIF ORG ±16 RHO fps SPT QP RRF COMB ORG ±16 RHO fps SPT QP RRF Foreman COMB CIF ORG ±16 RHO fps SPT QP RRF COMB ORG ±16 RHO fps SPT QP RRF Stefan COMB CIF ORG ±16 RHO fps SPT QP RRF COMB ORG ±16 RHO fps SPT QP RRF Football COMB SIF ORG ±16 RHO fps SPT QP RRF COMB ORG ±16 RHO fps SPT QP RRF Tennis COMB ITU CCIR601 ORG ±16 RHO fps SPT QP RRF COMB ORG ±16 RHO fps SPT QP RRF Garden COMB ITU CCIR601 ORG ±16 RHO fps SPT QP RRF COMB Avg. RHO Avg. SPT Avg. RRF Avg. COMB

7 As can be seen in Table 2, the average of the COMB results has the highest drop in prediction quality. The combined effects of SPT5.3 and RRF have resulted in an overall PSNR loss of 0.26 db, which is well within the MPEG limit of 0.5 db. The average speedup achieved in about 6 times the baseline implementation. Moreover, the average increase in bit information for COMB is less as compared to the average RRF results. Figs. 3-6 show the Rate-Distortion curves of the proposed scheme tested on the aforementioned test sequences. In most case the RD-curve of ρ-domain almost coincides with the ORG RD-curve. This shows that the ρ- domain zero count is a good approximation for SKIP prediction. For the sequence Container, the RD-curves at low bitrates are extremely close to one another. Hence, for Container, the proposed approaches perform extremely well at low bitrates. Similar conclusions can be drawn from the RD-curves of Foreman. Except for the SPT5.3 scheme, which has a particularly high PSNR drop at low bitrate the other RD-curves faithfully follow the baseline (ORG) RD curve. As can be seen, the RD-curves of both Container and Foreman converge at low bitrates. However, in case of Stefan, the RD-curves demonstrate more or less uniform performance over the entire bit-range. The PSNR drops are identical at both low and high bitrate. In Football, the RD-curves of ORG, RHO and SPT5.3 are super-posed onto a single curve, whereas the RD-curves of RRF and COMB super-pose onto another curve. Similar to Stefan, the RD-curves for Foreman also exhibit identical PSNR drop for both low and high bit-rates. V. CONCLUSION This paper has presented new schemes for SKIP prediction based fast motion estimation with applications in H.264. Three different approaches were proposed. The first approach utilizes the ρ-domain rate control model for counting the number of zeroes in the sum-of-absolute transformed-differences (SATD). SKIP MB mode was predicted based on this zero count. Spatial and temporal correlation among the collocated MBs in the same frame as well as the previous frame forms the basis of the next approach. Finally, the multiple numbers of reference frame on which the mode search is performed, were reduced based on an initial low-complexity matching cost. It was observed that, each of the proposed approaches individually exhibited the best performance in terms of either prediction quality or compression efficiency or coding efficiency. Subsequently, all three approaches were merged to generate the combined results. The experimental results on standard test sequences demonstrate substantially high speedup with marginal loss in bitrate and prediction quality. REFERENCES [1] T.Wiegand, G. Sullivan, G. Bjontegaard, and A. Luthra, Overview of the H.264/AVC video coding standard, IEEE Trans. Circuits Syst. Video Technol., July [2] J. Ostermann, J. Borman P. List, D. Marpe, M. Narroschke, F. Pereira, T. Stockhammer, and T. Wedi, Video coding with H.264/AVC: tool performance and complexity, IEEE Circuits Syst. Mag., vol. 4, no. 1, pp. 7 28, Jan [3] A. Ortega and K. Ramchandran, Rate-distortion methods for image and video compression, IEEE Sig. Pro. Mag., no. 1, pp , Jan [4] M.Yang and W. Wang, Fast macroblock mode selection based on motion content classification in H.264/AVC, in Proc. IEEE Int. Conf. Img. Proc. (ICIP04), no. II, pp , [5] A. Tanizawa, S. Koto, T. Chujoh and Y. Kikuchi, A study on fast rate-distortion optimized coding mode decision for H.264, in Proc. of IEEE Int. Conf. Img. Proc. (ICIP04), pp , [6] Q. Dui, D. Zhu and R. Ding, Fast Mode Decision For Inter Prediction in H.264, in Proc. of IEEE Int. Conf. Img. Proc. (ICIP04), pp , [7] F. Pan, X. Lin, R. Susanto, K.P. Lim, Z.G. Li, G.N. Feng, D.J. Wu, and S. Wu, Fast mode decision for intra prediction, in Joint Video Team (JVT) JVT-G013, Mar [8] K.P. Lim, S. Wu, D.J. Wu, S. Rahardja, X. Lin, F. Pan and Z.G. Li, Fast inter mode selection, in Joint Video Team JVT-I020, Sep [9] T.Y. Kuo and C.H. Chan, Fast Macroblock Partition Prediction for H.264/AVC, in Proc. IEEE Int. Conf. Multimedia and Expo (ICME04), no. I, pp , June [10] Y.H. Kim, J.W. Yoo, S.W. Lee, J. Shin, J. Paik and H.K. Jung, Adaptive mode decision for H.264 encoder, Electron. Lett., vol. 40, no. 19, Sep [11] P. Yin, H.Y. Cheong, A.M. Tourapis and J. Boyce, Fast mode decision and motion estimation for JVT/H.264, in Proc. Int. Conf. Imag. Proc. (ICIP03), vol. 3, pp , [12] Y.K. Tu, J.F. Yang, M.T. Sun and Y. Tsai, Fast variable-size block motion estimation for efficient H.264/AVC encoding, Signal Processing: Image Comm., vol. 20, no. 7, pp , Aug [13] Z. He and S.K. Mitra, A linear source model and a unified rate control algorithm for DCT video coding, IEEE Trans. Circuits Syst. Video Technol., vol. 12, no. 11, pp , [14] H. Kim and Y. Altunbasak, Low-complexity macroblock mode selection for H.264-AVC encoder in Proc. of IEEE Intl. Conf. Img. Proc. (ICIP04), pp , [15] T. Wiegand, M. Flierl and B. Girod, Entropy-constrained design of quadrate video coding scheme in Proc. of the 6th Intl. Conf. on Img. Proc. and its Apps., vol. 1, pp , [16] Y. Su and M.T. Sun, Fast multiple reference frame motion estimation for H.264, in Proc. of IEEE Int. Conf. Mult. Expo. (ICME03), pp , [17] Z. He and S.K. Mitra, ρ-domain source modeling and rate control for video coding and transmission, in Proc of the IEEE Intl. Conf. on Acous. Spch. Sig. Pro (ICASSP01), vol. 3, pp , [18] J.R. Corbera and S. Lei, Rate control in DCT video coding for lowdelay communication IEEE Trans. Circuits Syst. Video Tech., vol. 9, pp , [19] R. Arminato, R. Schafer, F. Kitson and V. Bhaskaran, Linear predictive coding of motion vector in Proc. IS&T SPIE EI [20] Y.V. Ivanov and C.J. Bleakley, Skip Prediction and Early Termination for Fast Mode Decision in H.264/AVC, in Proc. of International Conference on Digital Telecomm. (ICDT06), [21] H.264 JVTModel JM10.2, [22] E.Y. Lam and J.W. Goodman, A mathematical analysis of the DCT coefficient distributions for image IEEE Trans. Circuits Syst. Video Technol., vol. 9, no. 10, pp , ACKNOWLEDGMENT We would like to thank Dr. L. M. Po of Dept. of Electronic Engineering, City University of Hong Kong, for supplying us with the fast moving sequences in ITU CCIR601 format.

8 52 50 ORG RHO SPT5.3 RRF COMB PSNR (in db) Bitrate (in kbps) Fig. 3 RD Curve for QCIF Container ORG RHO SPT5.3 RRF COMB PSNR (in db) Bitrate (in kbps) Fig. 4 RD Curve for CIF Foreman

9 50 48 ORG RHO SPT5.3 RRF COMB PSNR (in db) Bitrate (in kbps) Fig. 5 RD Curve for CIF Stefan ORG RHO SPT5.3 RRF COMB 44 PSNR (in db) Bitrate (in kbps) Fig. 6 RD Curve for SIF Football

Fast MBAFF/PAFF Motion Estimation and Mode Decision Scheme for H.264

Fast MBAFF/PAFF Motion Estimation and Mode Decision Scheme for H.264 Fast MBAFF/PAFF Motion Estimation and Mode Decision Scheme for H.264 Ju-Heon Seo, Sang-Mi Kim, Jong-Ki Han, Nonmember Abstract-- In the H.264, MBAFF (Macroblock adaptive frame/field) and PAFF (Picture

More information

Selective Intra Prediction Mode Decision for H.264/AVC Encoders

Selective Intra Prediction Mode Decision for H.264/AVC Encoders Selective Intra Prediction Mode Decision for H.264/AVC Encoders Jun Sung Park, and Hyo Jung Song Abstract H.264/AVC offers a considerably higher improvement in coding efficiency compared to other compression

More information

FAST SPATIAL AND TEMPORAL CORRELATION-BASED REFERENCE PICTURE SELECTION

FAST SPATIAL AND TEMPORAL CORRELATION-BASED REFERENCE PICTURE SELECTION FAST SPATIAL AND TEMPORAL CORRELATION-BASED REFERENCE PICTURE SELECTION 1 YONGTAE KIM, 2 JAE-GON KIM, and 3 HAECHUL CHOI 1, 3 Hanbat National University, Department of Multimedia Engineering 2 Korea Aerospace

More information

An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions

An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions 1128 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 11, NO. 10, OCTOBER 2001 An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions Kwok-Wai Wong, Kin-Man Lam,

More information

Fast Mode Decision Algorithm for Intra prediction in H.264/AVC Video Coding

Fast Mode Decision Algorithm for Intra prediction in H.264/AVC Video Coding 356 IJCSNS International Journal of Computer Science and Network Security, VOL.7 No.1, January 27 Fast Mode Decision Algorithm for Intra prediction in H.264/AVC Video Coding Abderrahmane Elyousfi 12, Ahmed

More information

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ISCAS.2005.

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ISCAS.2005. Wang, D., Canagarajah, CN., & Bull, DR. (2005). S frame design for multiple description video coding. In IEEE International Symposium on Circuits and Systems (ISCAS) Kobe, Japan (Vol. 3, pp. 19 - ). Institute

More information

SCALABLE video coding (SVC) is currently being developed

SCALABLE video coding (SVC) is currently being developed IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 16, NO. 7, JULY 2006 889 Fast Mode Decision Algorithm for Inter-Frame Coding in Fully Scalable Video Coding He Li, Z. G. Li, Senior

More information

The H.26L Video Coding Project

The H.26L Video Coding Project The H.26L Video Coding Project New ITU-T Q.6/SG16 (VCEG - Video Coding Experts Group) standardization activity for video compression August 1999: 1 st test model (TML-1) December 2001: 10 th test model

More information

WITH the rapid development of high-fidelity video services

WITH the rapid development of high-fidelity video services 896 IEEE SIGNAL PROCESSING LETTERS, VOL. 22, NO. 7, JULY 2015 An Efficient Frame-Content Based Intra Frame Rate Control for High Efficiency Video Coding Miaohui Wang, Student Member, IEEE, KingNgiNgan,

More information

Reduced complexity MPEG2 video post-processing for HD display

Reduced complexity MPEG2 video post-processing for HD display Downloaded from orbit.dtu.dk on: Dec 17, 2017 Reduced complexity MPEG2 video post-processing for HD display Virk, Kamran; Li, Huiying; Forchhammer, Søren Published in: IEEE International Conference on

More information

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks Research Topic Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks July 22 nd 2008 Vineeth Shetty Kolkeri EE Graduate,UTA 1 Outline 2. Introduction 3. Error control

More information

Project Proposal: Sub pixel motion estimation for side information generation in Wyner- Ziv decoder.

Project Proposal: Sub pixel motion estimation for side information generation in Wyner- Ziv decoder. EE 5359 MULTIMEDIA PROCESSING Subrahmanya Maira Venkatrav 1000615952 Project Proposal: Sub pixel motion estimation for side information generation in Wyner- Ziv decoder. Wyner-Ziv(WZ) encoder is a low

More information

Chapter 2 Introduction to

Chapter 2 Introduction to Chapter 2 Introduction to H.264/AVC H.264/AVC [1] is the newest video coding standard of the ITU-T Video Coding Experts Group (VCEG) and the ISO/IEC Moving Picture Experts Group (MPEG). The main improvements

More information

Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard

Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard Ram Narayan Dubey Masters in Communication Systems Dept of ECE, IIT-R, India Varun Gunnala Masters in Communication Systems Dept

More information

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video Mohamed Hassan, Taha Landolsi, Husameldin Mukhtar, and Tamer Shanableh College of Engineering American

More information

AN IMPROVED ERROR CONCEALMENT STRATEGY DRIVEN BY SCENE MOTION PROPERTIES FOR H.264/AVC DECODERS

AN IMPROVED ERROR CONCEALMENT STRATEGY DRIVEN BY SCENE MOTION PROPERTIES FOR H.264/AVC DECODERS AN IMPROVED ERROR CONCEALMENT STRATEGY DRIVEN BY SCENE MOTION PROPERTIES FOR H.264/AVC DECODERS Susanna Spinsante, Ennio Gambi, Franco Chiaraluce Dipartimento di Elettronica, Intelligenza artificiale e

More information

Dual Frame Video Encoding with Feedback

Dual Frame Video Encoding with Feedback Video Encoding with Feedback Athanasios Leontaris and Pamela C. Cosman Department of Electrical and Computer Engineering University of California, San Diego, La Jolla, CA 92093-0407 Email: pcosman,aleontar

More information

Adaptive Key Frame Selection for Efficient Video Coding

Adaptive Key Frame Selection for Efficient Video Coding Adaptive Key Frame Selection for Efficient Video Coding Jaebum Jun, Sunyoung Lee, Zanming He, Myungjung Lee, and Euee S. Jang Digital Media Lab., Hanyang University 17 Haengdang-dong, Seongdong-gu, Seoul,

More information

A Fast Intra Skip Detection Algorithm for H.264/AVC Video Encoding

A Fast Intra Skip Detection Algorithm for H.264/AVC Video Encoding A Fast ntra Skip Detection Algorithm for H264/AVC Video Encoding Byung-Gyu im, ong-ho im, and Chang-Sik Cho A fast intra skip detection algorithm based on the ratedistortion (RD) cost for an inter frame

More information

WITH the demand of higher video quality, lower bit

WITH the demand of higher video quality, lower bit IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 16, NO. 8, AUGUST 2006 917 A High-Definition H.264/AVC Intra-Frame Codec IP for Digital Video and Still Camera Applications Chun-Wei

More information

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur Module 8 VIDEO CODING STANDARDS Lesson 27 H.264 standard Lesson Objectives At the end of this lesson, the students should be able to: 1. State the broad objectives of the H.264 standard. 2. List the improved

More information

Video coding standards

Video coding standards Video coding standards Video signals represent sequences of images or frames which can be transmitted with a rate from 5 to 60 frames per second (fps), that provides the illusion of motion in the displayed

More information

AUDIOVISUAL COMMUNICATION

AUDIOVISUAL COMMUNICATION AUDIOVISUAL COMMUNICATION Laboratory Session: Recommendation ITU-T H.261 Fernando Pereira The objective of this lab session about Recommendation ITU-T H.261 is to get the students familiar with many aspects

More information

Motion Re-estimation for MPEG-2 to MPEG-4 Simple Profile Transcoding. Abstract. I. Introduction

Motion Re-estimation for MPEG-2 to MPEG-4 Simple Profile Transcoding. Abstract. I. Introduction Motion Re-estimation for MPEG-2 to MPEG-4 Simple Profile Transcoding Jun Xin, Ming-Ting Sun*, and Kangwook Chun** *Department of Electrical Engineering, University of Washington **Samsung Electronics Co.

More information

Comparative Study of JPEG2000 and H.264/AVC FRExt I Frame Coding on High-Definition Video Sequences

Comparative Study of JPEG2000 and H.264/AVC FRExt I Frame Coding on High-Definition Video Sequences Comparative Study of and H.264/AVC FRExt I Frame Coding on High-Definition Video Sequences Pankaj Topiwala 1 FastVDO, LLC, Columbia, MD 210 ABSTRACT This paper reports the rate-distortion performance comparison

More information

Robust 3-D Video System Based on Modified Prediction Coding and Adaptive Selection Mode Error Concealment Algorithm

Robust 3-D Video System Based on Modified Prediction Coding and Adaptive Selection Mode Error Concealment Algorithm International Journal of Signal Processing Systems Vol. 2, No. 2, December 2014 Robust 3-D Video System Based on Modified Prediction Coding and Adaptive Selection Mode Error Concealment Algorithm Walid

More information

Visual Communication at Limited Colour Display Capability

Visual Communication at Limited Colour Display Capability Visual Communication at Limited Colour Display Capability Yan Lu, Wen Gao and Feng Wu Abstract: A novel scheme for visual communication by means of mobile devices with limited colour display capability

More information

Error Concealment for SNR Scalable Video Coding

Error Concealment for SNR Scalable Video Coding Error Concealment for SNR Scalable Video Coding M. M. Ghandi and M. Ghanbari University of Essex, Wivenhoe Park, Colchester, UK, CO4 3SQ. Emails: (mahdi,ghan)@essex.ac.uk Abstract This paper proposes an

More information

Rate-Distortion Analysis for H.264/AVC Video Coding and its Application to Rate Control

Rate-Distortion Analysis for H.264/AVC Video Coding and its Application to Rate Control IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 15, NO. 12, DECEMBER 2005 1533 Rate-Distortion Analysis for H.264/AVC Video Coding and its Application to Rate Control Siwei Ma, Student

More information

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes Digital Signal and Image Processing Lab Simone Milani Ph.D. student simone.milani@dei.unipd.it, Summer School

More information

Principles of Video Compression

Principles of Video Compression Principles of Video Compression Topics today Introduction Temporal Redundancy Reduction Coding for Video Conferencing (H.261, H.263) (CSIT 410) 2 Introduction Reduce video bit rates while maintaining an

More information

ARTICLE IN PRESS. Signal Processing: Image Communication

ARTICLE IN PRESS. Signal Processing: Image Communication Signal Processing: Image Communication 23 (2008) 677 691 Contents lists available at ScienceDirect Signal Processing: Image Communication journal homepage: www.elsevier.com/locate/image H.264/AVC-based

More information

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards COMP 9 Advanced Distributed Systems Multimedia Networking Video Compression Standards Kevin Jeffay Department of Computer Science University of North Carolina at Chapel Hill jeffay@cs.unc.edu September,

More information

1. INTRODUCTION. Index Terms Video Transcoding, Video Streaming, Frame skipping, Interpolation frame, Decoder, Encoder.

1. INTRODUCTION. Index Terms Video Transcoding, Video Streaming, Frame skipping, Interpolation frame, Decoder, Encoder. Video Streaming Based on Frame Skipping and Interpolation Techniques Fadlallah Ali Fadlallah Department of Computer Science Sudan University of Science and Technology Khartoum-SUDAN fadali@sustech.edu

More information

Key Techniques of Bit Rate Reduction for H.264 Streams

Key Techniques of Bit Rate Reduction for H.264 Streams Key Techniques of Bit Rate Reduction for H.264 Streams Peng Zhang, Qing-Ming Huang, and Wen Gao Institute of Computing Technology, Chinese Academy of Science, Beijing, 100080, China {peng.zhang, qmhuang,

More information

Highly Efficient Video Codec for Entertainment-Quality

Highly Efficient Video Codec for Entertainment-Quality Highly Efficient Video Codec for Entertainment-Quality Seyoon Jeong, Sung-Chang Lim, Hahyun Lee, Jongho Kim, Jin Soo Choi, and Haechul Choi We present a novel video codec for supporting entertainment-quality

More information

Analysis of Packet Loss for Compressed Video: Does Burst-Length Matter?

Analysis of Packet Loss for Compressed Video: Does Burst-Length Matter? Analysis of Packet Loss for Compressed Video: Does Burst-Length Matter? Yi J. Liang 1, John G. Apostolopoulos, Bernd Girod 1 Mobile and Media Systems Laboratory HP Laboratories Palo Alto HPL-22-331 November

More information

Overview: Video Coding Standards

Overview: Video Coding Standards Overview: Video Coding Standards Video coding standards: applications and common structure ITU-T Rec. H.261 ISO/IEC MPEG-1 ISO/IEC MPEG-2 State-of-the-art: H.264/AVC Video Coding Standards no. 1 Applications

More information

WE CONSIDER an enhancement technique for degraded

WE CONSIDER an enhancement technique for degraded 1140 IEEE SIGNAL PROCESSING LETTERS, VOL. 21, NO. 9, SEPTEMBER 2014 Example-based Enhancement of Degraded Video Edson M. Hung, Member, IEEE, Diogo C. Garcia, Member, IEEE, and Ricardo L. de Queiroz, Senior

More information

A Study of Encoding and Decoding Techniques for Syndrome-Based Video Coding

A Study of Encoding and Decoding Techniques for Syndrome-Based Video Coding MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com A Study of Encoding and Decoding Techniques for Syndrome-Based Video Coding Min Wu, Anthony Vetro, Jonathan Yedidia, Huifang Sun, Chang Wen

More information

International Journal for Research in Applied Science & Engineering Technology (IJRASET) Motion Compensation Techniques Adopted In HEVC

International Journal for Research in Applied Science & Engineering Technology (IJRASET) Motion Compensation Techniques Adopted In HEVC Motion Compensation Techniques Adopted In HEVC S.Mahesh 1, K.Balavani 2 M.Tech student in Bapatla Engineering College, Bapatla, Andahra Pradesh Assistant professor in Bapatla Engineering College, Bapatla,

More information

Bit Rate Control for Video Transmission Over Wireless Networks

Bit Rate Control for Video Transmission Over Wireless Networks Indian Journal of Science and Technology, Vol 9(S), DOI: 0.75/ijst/06/v9iS/05, December 06 ISSN (Print) : 097-686 ISSN (Online) : 097-5 Bit Rate Control for Video Transmission Over Wireless Networks K.

More information

ROBUST REGION-OF-INTEREST SCALABLE CODING WITH LEAKY PREDICTION IN H.264/AVC. Qian Chen, Li Song, Xiaokang Yang, Wenjun Zhang

ROBUST REGION-OF-INTEREST SCALABLE CODING WITH LEAKY PREDICTION IN H.264/AVC. Qian Chen, Li Song, Xiaokang Yang, Wenjun Zhang ROBUST REGION-OF-INTEREST SCALABLE CODING WITH LEAKY PREDICTION IN H.264/AVC Qian Chen, Li Song, Xiaokang Yang, Wenjun Zhang Institute of Image Communication & Information Processing Shanghai Jiao Tong

More information

Constant Bit Rate for Video Streaming Over Packet Switching Networks

Constant Bit Rate for Video Streaming Over Packet Switching Networks International OPEN ACCESS Journal Of Modern Engineering Research (IJMER) Constant Bit Rate for Video Streaming Over Packet Switching Networks Mr. S. P.V Subba rao 1, Y. Renuka Devi 2 Associate professor

More information

Systematic Lossy Error Protection of Video based on H.264/AVC Redundant Slices

Systematic Lossy Error Protection of Video based on H.264/AVC Redundant Slices Systematic Lossy Error Protection of based on H.264/AVC Redundant Slices Shantanu Rane and Bernd Girod Information Systems Laboratory Stanford University, Stanford, CA 94305. {srane,bgirod}@stanford.edu

More information

Error concealment techniques in H.264 video transmission over wireless networks

Error concealment techniques in H.264 video transmission over wireless networks Error concealment techniques in H.264 video transmission over wireless networks M U L T I M E D I A P R O C E S S I N G ( E E 5 3 5 9 ) S P R I N G 2 0 1 1 D R. K. R. R A O F I N A L R E P O R T Murtaza

More information

Variable Block-Size Transforms for H.264/AVC

Variable Block-Size Transforms for H.264/AVC 604 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 13, NO. 7, JULY 2003 Variable Block-Size Transforms for H.264/AVC Mathias Wien, Member, IEEE Abstract A concept for variable block-size

More information

THE NEWEST international video coding standard is

THE NEWEST international video coding standard is IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 15, NO. 7, JULY 2005 813 Fast Mode Decision Algorithm for Intraprediction in H.264/AVC Video Coding Feng Pan, Xiao Lin, Susanto Rahardja,

More information

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora MULTI-STATE VIDEO CODING WITH SIDE INFORMATION Sila Ekmekci Flierl, Thomas Sikora Technical University Berlin Institute for Telecommunications D-10587 Berlin / Germany ABSTRACT Multi-State Video Coding

More information

PACKET-SWITCHED networks have become ubiquitous

PACKET-SWITCHED networks have become ubiquitous IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 13, NO. 7, JULY 2004 885 Video Compression for Lossy Packet Networks With Mode Switching and a Dual-Frame Buffer Athanasios Leontaris, Student Member, IEEE,

More information

1022 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO. 4, APRIL 2010

1022 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO. 4, APRIL 2010 1022 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO. 4, APRIL 2010 Delay Constrained Multiplexing of Video Streams Using Dual-Frame Video Coding Mayank Tiwari, Student Member, IEEE, Theodore Groves,

More information

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER PERCEPTUAL QUALITY OF H./AVC DEBLOCKING FILTER Y. Zhong, I. Richardson, A. Miller and Y. Zhao School of Enginnering, The Robert Gordon University, Schoolhill, Aberdeen, AB1 1FR, UK Phone: + 1, Fax: + 1,

More information

WYNER-ZIV VIDEO CODING WITH LOW ENCODER COMPLEXITY

WYNER-ZIV VIDEO CODING WITH LOW ENCODER COMPLEXITY WYNER-ZIV VIDEO CODING WITH LOW ENCODER COMPLEXITY (Invited Paper) Anne Aaron and Bernd Girod Information Systems Laboratory Stanford University, Stanford, CA 94305 {amaaron,bgirod}@stanford.edu Abstract

More information

Error Resilient Video Coding Using Unequally Protected Key Pictures

Error Resilient Video Coding Using Unequally Protected Key Pictures Error Resilient Video Coding Using Unequally Protected Key Pictures Ye-Kui Wang 1, Miska M. Hannuksela 2, and Moncef Gabbouj 3 1 Nokia Mobile Software, Tampere, Finland 2 Nokia Research Center, Tampere,

More information

Free Viewpoint Switching in Multi-view Video Streaming Using. Wyner-Ziv Video Coding

Free Viewpoint Switching in Multi-view Video Streaming Using. Wyner-Ziv Video Coding Free Viewpoint Switching in Multi-view Video Streaming Using Wyner-Ziv Video Coding Xun Guo 1,, Yan Lu 2, Feng Wu 2, Wen Gao 1, 3, Shipeng Li 2 1 School of Computer Sciences, Harbin Institute of Technology,

More information

Drift Compensation for Reduced Spatial Resolution Transcoding

Drift Compensation for Reduced Spatial Resolution Transcoding MERL A MITSUBISHI ELECTRIC RESEARCH LABORATORY http://www.merl.com Drift Compensation for Reduced Spatial Resolution Transcoding Peng Yin Anthony Vetro Bede Liu Huifang Sun TR-2002-47 August 2002 Abstract

More information

Modeling and Optimization of a Systematic Lossy Error Protection System based on H.264/AVC Redundant Slices

Modeling and Optimization of a Systematic Lossy Error Protection System based on H.264/AVC Redundant Slices Modeling and Optimization of a Systematic Lossy Error Protection System based on H.264/AVC Redundant Slices Shantanu Rane, Pierpaolo Baccichet and Bernd Girod Information Systems Laboratory, Department

More information

Region Adaptive Unsharp Masking based DCT Interpolation for Efficient Video Intra Frame Up-sampling

Region Adaptive Unsharp Masking based DCT Interpolation for Efficient Video Intra Frame Up-sampling International Conference on Electronic Design and Signal Processing (ICEDSP) 0 Region Adaptive Unsharp Masking based DCT Interpolation for Efficient Video Intra Frame Up-sampling Aditya Acharya Dept. of

More information

Multimedia Communications. Video compression

Multimedia Communications. Video compression Multimedia Communications Video compression Video compression Of all the different sources of data, video produces the largest amount of data There are some differences in our perception with regard to

More information

CONSTRAINING delay is critical for real-time communication

CONSTRAINING delay is critical for real-time communication 1726 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 16, NO. 7, JULY 2007 Compression Efficiency and Delay Tradeoffs for Hierarchical B-Pictures and Pulsed-Quality Frames Athanasios Leontaris, Member, IEEE,

More information

ERROR CONCEALMENT TECHNIQUES IN H.264 VIDEO TRANSMISSION OVER WIRELESS NETWORKS

ERROR CONCEALMENT TECHNIQUES IN H.264 VIDEO TRANSMISSION OVER WIRELESS NETWORKS Multimedia Processing Term project on ERROR CONCEALMENT TECHNIQUES IN H.264 VIDEO TRANSMISSION OVER WIRELESS NETWORKS Interim Report Spring 2016 Under Dr. K. R. Rao by Moiz Mustafa Zaveri (1001115920)

More information

Express Letters. A Novel Four-Step Search Algorithm for Fast Block Motion Estimation

Express Letters. A Novel Four-Step Search Algorithm for Fast Block Motion Estimation IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 6, NO. 3, JUNE 1996 313 Express Letters A Novel Four-Step Search Algorithm for Fast Block Motion Estimation Lai-Man Po and Wing-Chung

More information

Concealment of Whole-Picture Loss in Hierarchical B-Picture Scalable Video Coding Xiangyang Ji, Debin Zhao, and Wen Gao, Senior Member, IEEE

Concealment of Whole-Picture Loss in Hierarchical B-Picture Scalable Video Coding Xiangyang Ji, Debin Zhao, and Wen Gao, Senior Member, IEEE IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 11, NO. 1, JANUARY 2009 11 Concealment of Whole-Picture Loss in Hierarchical B-Picture Scalable Video Coding Xiangyang Ji, Debin Zhao, and Wen Gao, Senior Member,

More information

Color Quantization of Compressed Video Sequences. Wan-Fung Cheung, and Yuk-Hee Chan, Member, IEEE 1 CSVT

Color Quantization of Compressed Video Sequences. Wan-Fung Cheung, and Yuk-Hee Chan, Member, IEEE 1 CSVT CSVT -02-05-09 1 Color Quantization of Compressed Video Sequences Wan-Fung Cheung, and Yuk-Hee Chan, Member, IEEE 1 Abstract This paper presents a novel color quantization algorithm for compressed video

More information

OL_H264e HDTV H.264/AVC Baseline Video Encoder Rev 1.0. General Description. Applications. Features

OL_H264e HDTV H.264/AVC Baseline Video Encoder Rev 1.0. General Description. Applications. Features OL_H264e HDTV H.264/AVC Baseline Video Encoder Rev 1.0 General Description Applications Features The OL_H264e core is a hardware implementation of the H.264 baseline video compression algorithm. The core

More information

Performance Comparison of JPEG2000 and H.264/AVC High Profile Intra Frame Coding on HD Video Sequences

Performance Comparison of JPEG2000 and H.264/AVC High Profile Intra Frame Coding on HD Video Sequences Performance Comparison of and H.264/AVC High Profile Intra Frame Coding on HD Video Sequences Pankaj Topiwala, Trac Tran, Wei Dai {pankaj, trac, daisy} @ fastvdo.com FastVDO, LLC, Columbia, MD 210 ABSTRACT

More information

Chapter 10 Basic Video Compression Techniques

Chapter 10 Basic Video Compression Techniques Chapter 10 Basic Video Compression Techniques 10.1 Introduction to Video compression 10.2 Video Compression with Motion Compensation 10.3 Video compression standard H.261 10.4 Video compression standard

More information

Study of AVS China Part 7 for Mobile Applications. By Jay Mehta EE 5359 Multimedia Processing Spring 2010

Study of AVS China Part 7 for Mobile Applications. By Jay Mehta EE 5359 Multimedia Processing Spring 2010 Study of AVS China Part 7 for Mobile Applications By Jay Mehta EE 5359 Multimedia Processing Spring 2010 1 Contents Parts and profiles of AVS Standard Introduction to Audio Video Standard for Mobile Applications

More information

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and Video compression principles Video: moving pictures and the terms frame and picture. one approach to compressing a video source is to apply the JPEG algorithm to each frame independently. This approach

More information

RATE-DISTORTION OPTIMISED QUANTISATION FOR HEVC USING SPATIAL JUST NOTICEABLE DISTORTION

RATE-DISTORTION OPTIMISED QUANTISATION FOR HEVC USING SPATIAL JUST NOTICEABLE DISTORTION RATE-DISTORTION OPTIMISED QUANTISATION FOR HEVC USING SPATIAL JUST NOTICEABLE DISTORTION André S. Dias 1, Mischa Siekmann 2, Sebastian Bosse 2, Heiko Schwarz 2, Detlev Marpe 2, Marta Mrak 1 1 British Broadcasting

More information

Error-Resilience Video Transcoding for Wireless Communications

Error-Resilience Video Transcoding for Wireless Communications MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Error-Resilience Video Transcoding for Wireless Communications Anthony Vetro, Jun Xin, Huifang Sun TR2005-102 August 2005 Abstract Video communication

More information

An Overview of Video Coding Algorithms

An Overview of Video Coding Algorithms An Overview of Video Coding Algorithms Prof. Ja-Ling Wu Department of Computer Science and Information Engineering National Taiwan University Video coding can be viewed as image compression with a temporal

More information

A Study on AVS-M video standard

A Study on AVS-M video standard 1 A Study on AVS-M video standard EE 5359 Sahana Devaraju University of Texas at Arlington Email:sahana.devaraju@mavs.uta.edu 2 Outline Introduction Data Structure of AVS-M AVS-M CODEC Profiles & Levels

More information

Dual frame motion compensation for a rate switching network

Dual frame motion compensation for a rate switching network Dual frame motion compensation for a rate switching network Vijay Chellappa, Pamela C. Cosman and Geoffrey M. Voelker Dept. of Electrical and Computer Engineering, Dept. of Computer Science and Engineering

More information

H.264/AVC Baseline Profile Decoder Complexity Analysis

H.264/AVC Baseline Profile Decoder Complexity Analysis 704 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 13, NO. 7, JULY 2003 H.264/AVC Baseline Profile Decoder Complexity Analysis Michael Horowitz, Anthony Joch, Faouzi Kossentini, Senior

More information

Multimedia Communications. Image and Video compression

Multimedia Communications. Image and Video compression Multimedia Communications Image and Video compression JPEG2000 JPEG2000: is based on wavelet decomposition two types of wavelet filters one similar to what discussed in Chapter 14 and the other one generates

More information

ROBUST ADAPTIVE INTRA REFRESH FOR MULTIVIEW VIDEO

ROBUST ADAPTIVE INTRA REFRESH FOR MULTIVIEW VIDEO ROBUST ADAPTIVE INTRA REFRESH FOR MULTIVIEW VIDEO Sagir Lawan1 and Abdul H. Sadka2 1and 2 Department of Electronic and Computer Engineering, Brunel University, London, UK ABSTRACT Transmission error propagation

More information

Introduction to Video Compression Techniques. Slides courtesy of Tay Vaughan Making Multimedia Work

Introduction to Video Compression Techniques. Slides courtesy of Tay Vaughan Making Multimedia Work Introduction to Video Compression Techniques Slides courtesy of Tay Vaughan Making Multimedia Work Agenda Video Compression Overview Motivation for creating standards What do the standards specify Brief

More information

THE new video coding standard H.264/AVC [1] significantly

THE new video coding standard H.264/AVC [1] significantly 832 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, VOL. 53, NO. 9, SEPTEMBER 2006 Architecture Design of Context-Based Adaptive Variable-Length Coding for H.264/AVC Tung-Chien Chen, Yu-Wen

More information

Improved Error Concealment Using Scene Information

Improved Error Concealment Using Scene Information Improved Error Concealment Using Scene Information Ye-Kui Wang 1, Miska M. Hannuksela 2, Kerem Caglar 1, and Moncef Gabbouj 3 1 Nokia Mobile Software, Tampere, Finland 2 Nokia Research Center, Tampere,

More information

Speeding up Dirac s Entropy Coder

Speeding up Dirac s Entropy Coder Speeding up Dirac s Entropy Coder HENDRIK EECKHAUT BENJAMIN SCHRAUWEN MARK CHRISTIAENS JAN VAN CAMPENHOUT Parallel Information Systems (PARIS) Electronics and Information Systems (ELIS) Ghent University

More information

Analysis of Video Transmission over Lossy Channels

Analysis of Video Transmission over Lossy Channels 1012 IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 18, NO. 6, JUNE 2000 Analysis of Video Transmission over Lossy Channels Klaus Stuhlmüller, Niko Färber, Member, IEEE, Michael Link, and Bernd

More information

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4 Contents List of figures List of tables Preface Acknowledgements xv xxi xxiii xxiv 1 Introduction 1 References 4 2 Digital video 5 2.1 Introduction 5 2.2 Analogue television 5 2.3 Interlace 7 2.4 Picture

More information

MPEG has been established as an international standard

MPEG has been established as an international standard 1100 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 9, NO. 7, OCTOBER 1999 Fast Extraction of Spatially Reduced Image Sequences from MPEG-2 Compressed Video Junehwa Song, Member,

More information

Error Concealment for Dual Frame Video Coding with Uneven Quality

Error Concealment for Dual Frame Video Coding with Uneven Quality Error Concealment for Dual Frame Video Coding with Uneven Quality Vijay Chellappa, Pamela C. Cosman and Geoffrey M. Voelker University of California, San Diego, vchellap@ucsd.edu,pcosman@ucsd.edu Abstract

More information

ABSTRACT ERROR CONCEALMENT TECHNIQUES IN H.264/AVC, FOR VIDEO TRANSMISSION OVER WIRELESS NETWORK. Vineeth Shetty Kolkeri, M.S.

ABSTRACT ERROR CONCEALMENT TECHNIQUES IN H.264/AVC, FOR VIDEO TRANSMISSION OVER WIRELESS NETWORK. Vineeth Shetty Kolkeri, M.S. ABSTRACT ERROR CONCEALMENT TECHNIQUES IN H.264/AVC, FOR VIDEO TRANSMISSION OVER WIRELESS NETWORK Vineeth Shetty Kolkeri, M.S. The University of Texas at Arlington, 2008 Supervising Professor: Dr. K. R.

More information

Rate-distortion optimized mode selection method for multiple description video coding

Rate-distortion optimized mode selection method for multiple description video coding Multimed Tools Appl (2014) 72:1411 14 DOI 10.1007/s11042-013-14-8 Rate-distortion optimized mode selection method for multiple description video coding Yu-Chen Sun & Wen-Jiin Tsai Published online: 19

More information

Temporal Error Concealment Algorithm Using Adaptive Multi- Side Boundary Matching Principle

Temporal Error Concealment Algorithm Using Adaptive Multi- Side Boundary Matching Principle 184 IJCSNS International Journal of Computer Science and Network Security, VOL.8 No.12, December 2008 Temporal Error Concealment Algorithm Using Adaptive Multi- Side Boundary Matching Principle Seung-Soo

More information

The H.263+ Video Coding Standard: Complexity and Performance

The H.263+ Video Coding Standard: Complexity and Performance The H.263+ Video Coding Standard: Complexity and Performance Berna Erol (bernae@ee.ubc.ca), Michael Gallant (mikeg@ee.ubc.ca), Guy C t (guyc@ee.ubc.ca), and Faouzi Kossentini (faouzi@ee.ubc.ca) Department

More information

OL_H264MCLD Multi-Channel HDTV H.264/AVC Limited Baseline Video Decoder V1.0. General Description. Applications. Features

OL_H264MCLD Multi-Channel HDTV H.264/AVC Limited Baseline Video Decoder V1.0. General Description. Applications. Features OL_H264MCLD Multi-Channel HDTV H.264/AVC Limited Baseline Video Decoder V1.0 General Description Applications Features The OL_H264MCLD core is a hardware implementation of the H.264 baseline video compression

More information

SCENE CHANGE ADAPTATION FOR SCALABLE VIDEO CODING

SCENE CHANGE ADAPTATION FOR SCALABLE VIDEO CODING 17th European Signal Processing Conference (EUSIPCO 2009) Glasgow, Scotland, August 24-28, 2009 SCENE CHANGE ADAPTATION FOR SCALABLE VIDEO CODING Tea Anselmo, Daniele Alfonso Advanced System Technology

More information

Video Over Mobile Networks

Video Over Mobile Networks Video Over Mobile Networks Professor Mohammed Ghanbari Department of Electronic systems Engineering University of Essex United Kingdom June 2005, Zadar, Croatia (Slides prepared by M. Mahdi Ghandi) INTRODUCTION

More information

New Architecture for Dynamic Frame-Skipping Transcoder

New Architecture for Dynamic Frame-Skipping Transcoder 886 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 11, NO. 8, AUGUST 2002 New Architecture for Dynamic Frame-Skipping Transcoder Kai-Tat Fung, Yui-Lam Chan, and Wan-Chi Siu, Senior Member, IEEE Abstract Transcoding

More information

HIGH Efficiency Video Coding (HEVC) version 1 was

HIGH Efficiency Video Coding (HEVC) version 1 was 1 An HEVC-based Screen Content Coding Scheme Bin Li and Jizheng Xu Abstract This document presents an efficient screen content coding scheme based on HEVC framework. The major techniques in the scheme

More information

THE TRANSMISSION and storage of video are important

THE TRANSMISSION and storage of video are important 206 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 21, NO. 2, FEBRUARY 2011 Novel RD-Optimized VBSME with Matching Highly Data Re-Usable Hardware Architecture Xing Wen, Student Member,

More information

A parallel HEVC encoder scheme based on Multi-core platform Shu Jun1,2,3,a, Hu Dong1,2,3,b

A parallel HEVC encoder scheme based on Multi-core platform Shu Jun1,2,3,a, Hu Dong1,2,3,b 4th National Conference on Electrical, Electronics and Computer Engineering (NCEECE 2015) A parallel HEVC encoder scheme based on Multi-core platform Shu Jun1,2,3,a, Hu Dong1,2,3,b 1 Education Ministry

More information

A High Performance VLSI Architecture with Half Pel and Quarter Pel Interpolation for A Single Frame

A High Performance VLSI Architecture with Half Pel and Quarter Pel Interpolation for A Single Frame I J C T A, 9(34) 2016, pp. 673-680 International Science Press A High Performance VLSI Architecture with Half Pel and Quarter Pel Interpolation for A Single Frame K. Priyadarshini 1 and D. Jackuline Moni

More information

Scalable multiple description coding of video sequences

Scalable multiple description coding of video sequences Scalable multiple description coding of video sequences Marco Folli, and Lorenzo Favalli Electronics Department University of Pavia, Via Ferrata 1, 100 Pavia, Italy Email: marco.folli@unipv.it, lorenzo.favalli@unipv.it

More information

Research Article. ISSN (Print) *Corresponding author Shireen Fathima

Research Article. ISSN (Print) *Corresponding author Shireen Fathima Scholars Journal of Engineering and Technology (SJET) Sch. J. Eng. Tech., 2014; 2(4C):613-620 Scholars Academic and Scientific Publisher (An International Publisher for Academic and Scientific Resources)

More information

CODING EFFICIENCY IMPROVEMENT FOR SVC BROADCAST IN THE CONTEXT OF THE EMERGING DVB STANDARDIZATION

CODING EFFICIENCY IMPROVEMENT FOR SVC BROADCAST IN THE CONTEXT OF THE EMERGING DVB STANDARDIZATION 17th European Signal Processing Conference (EUSIPCO 2009) Glasgow, Scotland, August 24-28, 2009 CODING EFFICIENCY IMPROVEMENT FOR SVC BROADCAST IN THE CONTEXT OF THE EMERGING DVB STANDARDIZATION Heiko

More information