Project Proposal: Sub pixel motion estimation for side information generation in Wyner- Ziv decoder.

Similar documents
Error concealment techniques in H.264 video transmission over wireless networks

CHROMA CODING IN DISTRIBUTED VIDEO CODING

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks

Free Viewpoint Switching in Multi-view Video Streaming Using. Wyner-Ziv Video Coding

Selective Intra Prediction Mode Decision for H.264/AVC Encoders

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ISCAS.2005.

WYNER-ZIV VIDEO CODING WITH LOW ENCODER COMPLEXITY

UC San Diego UC San Diego Previously Published Works

Fast Mode Decision Algorithm for Intra prediction in H.264/AVC Video Coding

Fast MBAFF/PAFF Motion Estimation and Mode Decision Scheme for H.264

Comparative Study of JPEG2000 and H.264/AVC FRExt I Frame Coding on High-Definition Video Sequences

Systematic Lossy Error Protection of Video based on H.264/AVC Redundant Slices

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER

FAST SPATIAL AND TEMPORAL CORRELATION-BASED REFERENCE PICTURE SELECTION

A Study on AVS-M video standard

ERROR CONCEALMENT TECHNIQUES IN H.264 VIDEO TRANSMISSION OVER WIRELESS NETWORKS

1. INTRODUCTION. Index Terms Video Transcoding, Video Streaming, Frame skipping, Interpolation frame, Decoder, Encoder.

Analysis of Packet Loss for Compressed Video: Does Burst-Length Matter?

Systematic Lossy Forward Error Protection for Error-Resilient Digital Video Broadcasting

ERROR CONCEALMENT TECHNIQUES IN H.264

A Study of Encoding and Decoding Techniques for Syndrome-Based Video Coding

WE CONSIDER an enhancement technique for degraded

Video Quality Monitoring for Mobile Multicast Peers Using Distributed Source Coding

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

Reduced complexity MPEG2 video post-processing for HD display

Study of AVS China Part 7 for Mobile Applications. By Jay Mehta EE 5359 Multimedia Processing Spring 2010

Wyner-Ziv Coding of Motion Video

Reduced Decoder Complexity and Latency in Pixel-Domain Wyner-Ziv Video Coders

ABSTRACT ERROR CONCEALMENT TECHNIQUES IN H.264/AVC, FOR VIDEO TRANSMISSION OVER WIRELESS NETWORK. Vineeth Shetty Kolkeri, M.S.

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Chapter 2 Introduction to

MULTIVIEW DISTRIBUTED VIDEO CODING WITH ENCODER DRIVEN FUSION

Performance Comparison of JPEG2000 and H.264/AVC High Profile Intra Frame Coding on HD Video Sequences

Motion Re-estimation for MPEG-2 to MPEG-4 Simple Profile Transcoding. Abstract. I. Introduction

Quarter-Pixel Accuracy Motion Estimation (ME) - A Novel ME Technique in HEVC

Exploring the Distributed Video Coding in a Quality Assessment Context

Modeling and Optimization of a Systematic Lossy Error Protection System based on H.264/AVC Redundant Slices

Dual Frame Video Encoding with Feedback

SCALABLE video coding (SVC) is currently being developed

INFORMATION THEORY INSPIRED VIDEO CODING METHODS : TRUTH IS SOMETIMES BETTER THAN FICTION

ROBUST REGION-OF-INTEREST SCALABLE CODING WITH LEAKY PREDICTION IN H.264/AVC. Qian Chen, Li Song, Xiaokang Yang, Wenjun Zhang

Error Concealment for SNR Scalable Video Coding

Video coding standards

COMPLEXITY REDUCTION FOR HEVC INTRAFRAME LUMA MODE DECISION USING IMAGE STATISTICS AND NEURAL NETWORKS.

Distributed Video Coding Using LDPC Codes for Wireless Video

Overview: Video Coding Standards

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

STUDY OF AVS CHINA PART 7 JIBEN PROFILE FOR MOBILE APPLICATIONS

Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard

Distributed video coding supporting hierarchical GOP structures with transmitted motion vectors

Principles of Video Compression

Parameters optimization for a scalable multiple description coding scheme based on spatial subsampling

ROBUST ADAPTIVE INTRA REFRESH FOR MULTIVIEW VIDEO

OBJECT-BASED IMAGE COMPRESSION WITH SIMULTANEOUS SPATIAL AND SNR SCALABILITY SUPPORT FOR MULTICASTING OVER HETEROGENEOUS NETWORKS

Adaptive mode decision with residual motion compensation for distributed video coding

Energy Efficient Video Compression for Wireless Sensor Networks *

Video Over Mobile Networks

WITH the rapid development of high-fidelity video services

AN IMPROVED ERROR CONCEALMENT STRATEGY DRIVEN BY SCENE MOTION PROPERTIES FOR H.264/AVC DECODERS

The H.263+ Video Coding Standard: Complexity and Performance

An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions

Compressed Domain H.264 Baseline Encoder in Video Transcoding Architecture

Robust 3-D Video System Based on Modified Prediction Coding and Adaptive Selection Mode Error Concealment Algorithm

Multiple Description H.264 Video Coding with Redundant Pictures

An Overview of Video Coding Algorithms

PACKET-SWITCHED networks have become ubiquitous

Marie Ramon, François-XavierCoudoux, andmarcgazalet. 1. Introduction

Speeding up Dirac s Entropy Coder

SKIP Prediction for Fast Rate Distortion Optimization in H.264

SCENE CHANGE ADAPTATION FOR SCALABLE VIDEO CODING

The H.26L Video Coding Project

AUDIOVISUAL COMMUNICATION

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

Systematic Lossy Error Protection based on H.264/AVC Redundant Slices and Flexible Macroblock Ordering

Adaptive Key Frame Selection for Efficient Video Coding

Interim Report Time Optimization of HEVC Encoder over X86 Processors using SIMD. Spring 2013 Multimedia Processing EE5359

Systematic Lossy Error Protection of Video Signals Shantanu Rane, Member, IEEE, Pierpaolo Baccichet, Member, IEEE, and Bernd Girod, Fellow, IEEE

Power Reduction via Macroblock Prioritization for Power Aware H.264 Video Applications

Camera Motion-constraint Video Codec Selection

A High Performance VLSI Architecture with Half Pel and Quarter Pel Interpolation for A Single Frame

Visual Communication at Limited Colour Display Capability

ARTICLE IN PRESS. Signal Processing: Image Communication

A Novel Approach towards Video Compression for Mobile Internet using Transform Domain Technique

Scalable multiple description coding of video sequences

Robust Transmission of H.264/AVC Video using 64-QAM and unequal error protection

Region Adaptive Unsharp Masking based DCT Interpolation for Efficient Video Intra Frame Up-sampling

1934 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 21, NO. 4, APRIL 2012

an organization for standardization in the

International Journal for Research in Applied Science & Engineering Technology (IJRASET) Motion Compensation Techniques Adopted In HEVC

Decoder-driven mode decision in a block-based distributed video codec

Adaptive Distributed Compressed Video Sensing

Low Complexity Hybrid Rate Control Schemes for Distributed Video Coding

HEVC Subjective Video Quality Test Results

A parallel HEVC encoder scheme based on Multi-core platform Shu Jun1,2,3,a, Hu Dong1,2,3,b

Conference object, Postprint version This version is available at

Drift Compensation for Reduced Spatial Resolution Transcoding

Real-Time Distributed Video Coding for 1K-pixel Visual Sensor Networks

Using enhancement data to deinterlace 1080i HDTV

Highly Efficient Video Codec for Entertainment-Quality

Transcription:

EE 5359 MULTIMEDIA PROCESSING Subrahmanya Maira Venkatrav 1000615952 Project Proposal: Sub pixel motion estimation for side information generation in Wyner- Ziv decoder. Wyner-Ziv(WZ) encoder is a low -complexity encoder and can be made to achieve compression comparable to traditional high complexity encoders but at the expense of a high-complexity decoder. The high complexity of the decoder is mainly attributed to the generation of side information which involves motion estimation. The quality of the frame reconstructed at the decoder mainly depends on quality of the motion estimation. Hence in this proposal sub pixel motion estimation is considered for side information generation. 1. Overview of WZ Encoder The WZ encoding process involves encoding of key frames and WZ frames. The overall encoding process is illustrated in Figure 1. Some of the input frames are marked as key frames and encoded using H.264 Intra (I) frame [6][18] encoding. The WZ frames are encoded as follows: The difference between previous reconstructed key frame and WZ frame is quantized using a uniform scalar quantizer and the output is encoded using low-density-parity-check accumulated (LDPCA) code. Figure 1. Block diagram of WZ encoder

2. Overview of WZ Decoder The first step in decoding a WZ frame is generation of side information (SI) using key frames. The SI frame generated is used by the low density parity check accumulate (LDPCA) decoder to decode the WZ bitstream and generate the WZ frame. The previous key frame is subtracted from the SI frame generated to produce error frame which is subsequently quantized. This is used by LDPCA decoder to correct bit errors in WZ encoded error frame. The error frame obtained in this way is added to the key frame and de-quantized to reconstruct the WZ frame. At low bitrates some of the macroblocks in the WZ frame cannot be reconstructed. These macroblocks are replaced with the corresponding macroblocks from WZ estimated frame. Figure 2. Block diagram of WZ decoder [1]

3. Generation of Side Information (SI) The SI frame generation is key aspect of WZ decoding process. The quality of the decoding is dependent on the SI frame and in terms of complexity this is a major component of WZ decoder. Figure 3.Side information generation using key frames [1] The generation of SI is illustrated in Figure 3 and it involves, Motion estimation (ME) between two key frames to obtain motion vectors (MV). The estimation is done in both forward and backward directions to obtain MV F and MV B respectively as shown in Figure 3. The block sizes are used for ME are 16x16, 8x8 and 4x4. The derivation of motion vectors for WZ frames. This is done by scaling MVs obtained in the previous step by the ratio calculated as distance between WZ frame to previous key frame to the distance between key frames themselves. In Figure 3, the scaling factor is ½ since the ratio of distance between key frame and WZ frame to distance between two key frames is ½. Obtaining the estimation for macroblock of a WZ frame by interpolation of macroblocks from the previous and next key frames. The motion vectors calculated in the previous step are used here to obtain mapping of macroblocks in WZ frame to key frame macroblocks. The forward predicted frame (P F ) is obtained using forward motion vector MV F and backward predicted frame (P B ) is obtained using backward motion vector MV B. Then the side information frame Y is obtained as (P F + P B )/2 as shown in Figure 3.

4. Sub-pixel motion estimation for SI generation The side information generated can be improved by using sub-pixel motion vectors for both forward and backward predictions. In order to derive these sub-pixel positions interpolation between pixels needs to be performed. For half pixel motion estimation there are three pixel positions that need to be evaluated. For quarter pixel motion estimation there are twelve pixel positions that need to be evaluated. The generation of sub-pixel positions is done as per H.264 standard [8] and is briefly described below: 1. Half-pixel positions: In Figure 4 the pixel positions numbered H33, G33 and D33 are half pixel positions and need to be derived. Figure 4. Full and half pixel positions These are generated by interpolating full pixel or half pixel values using a six tap filter [1-5 20 20-5 1]/32. Following equations can be used, H33 = [F13 + -5 * F23 + 20 * F33 + 20 * F43 + -5 * F53 + F63 + 15] >> 5 G33 = [F31 + -5 * F32 + 20 * F33 + 20 * F34 + -5 * F35 + F36 + 15] >> 5 D33 = [H31 + -5 * H32 + 20 * H33 + 20 * H34 + -5 * H35 + H36 + 15] >> 5 2. Quarter-pixel positions: The quarter pixels are obtained by averaging nearest full pixel or half pixel positions.

Figure 5. Full, half and quarter pixel positions The following equations are used for obtaining quarter pixel positions, q1 = ( F33 + G33 + 1 ) >> 1 q2 = ( G33 + F34 + 1 ) >> 1 q3 = ( F33 + H33 + 1 ) >> 1 q4 = ( H33 + G33 + 1 ) >> 1 q5 = (G33 + D33 + 1 ) >> 1 q6 = ( G33 + H34 + 1 ) >> 1 q7 = ( H33 + D33 + 1 ) >> 1 q8 = ( D33 + H34 + 1 ) >> 1 q9 = ( H33 + F43 + 1 ) >> 1 q10 = ( H33 + G43 + 1 ) >> 1 q11 = ( D33 + G43 + 1 ) >> 1 q12 = ( G43 + H34 + 1 ) >> 1 The forward and backward predicted data obtained for each partition block is averaged to obtain the final prediction block. In case for a block if there is no motion vector, then intra prediction can be used to predict the block from neighboring pixels. The improvement in the quality of SI generated with the sub pixel motion estimation over full pixel motion estimation can be measured both visually and quantitatively [17]. The quantitative measurement can be done by PSNR of the predicted frame with reference to the original frame. The objective is to get a good improvement in the quality of SI frame. 3. Results: The half-pel motion estimation is implemented for WZ frame generation using JM reference software. The even frames are encoded as I frames and odd frames are encoded using WZ encoder. For quality comparison between WZ encoder and H.264 encoder a separate encoding is done with even frame being I frame and odd frame being P frame. The WZ frame obtained using SI prediction is analyzed for PSNR with reference to corresponding

H.264 P frame. The average PSNR plot for a QCIF (176x144) test sequence for a ME search range of 64 is shown in Table 1 and the plot is shown in Figure 6. Coastguard_qcif.yuv (Search Range- 64) SI prediction scheme PSNR(dB) Frame Average 23.594586 Full pel-me 4x4 21.586695 Full pel-me 8x8 26.413514 Full pel-me 16x16 28.0283 Half pel-me 4x4 28.767624 Half pel-me 8x8 30.579922 Half pel-me 16x16 30.350176 Table 1. PSNR for WZ frame. Figure 6. PSNR for WZ frame The PSNR plot in the Figure 6 indicates that the half-pel ME with 8x8 block size performs better than any other schemes. The gain in PSNR using ME for prediction is approximately 7dB compared that of simple frame averaging scheme. Further performance gain can be achieved using quarter-pel motion estimation.

References: 1. E. Peixoto, R. L. de Queiroz, and D. Mukherjee, Mobile video communications using a Wyner- Ziv transcoder, Proc. SPIE 6822, VCIP, 68220R Jan. 2008. 2. A. Aaron, E. Setton and B. Girod, "Towards practical Wyner-Ziv coding of video," Proceedings. 2003 International Conference on Image Processing, 2003. ICIP 2003., vol.3, pp. III-869-872, 14-17 Sept. 2003. 3. K. R. Rao and J. J. Hwang, Techniques and standards for image, video, and audio coding, Prentice Hall PTR, 1996. 4. Jin-Soo Kim, "Brief overview of Wyner-Ziv CODEC" (Private Communication) 5. A. Aaron, D. Varodayan, and B. Girod, Wyner-Ziv residual coding of video, Proc. International Picture Coding Symposium, Beijing, P. R. China, April 2006. 6. T. Wiegand and G.J Sullivan, The H.264/AVC video coding standard, IEEE SP Magazine, vol. 24, pp. 148-153, March 2007. 7. G. J. Sullivan, P. Topiwala, and A. Luthra, "The H.264/AVC advanced video coding standard: Overview and introduction to the fidelity range extensions", SPIE Conf. on applications of digital image processing XXVII, vol. 5558, pp. 53-74, Aug. 2004. 8. S.K. Kwon, A. Tamhankar, and K.R. Rao Overview of H.264/MPEG-4 Part 10 J. VCIR, Vol. 17, pp. 186-216, April 2006, Special Issue on Emerging H.264/AVC Video Coding Standard,. 9. A. Wyner and J. Ziv, "The rate-distortion function for source coding with side information at the decoder," IEEE Trans., Information Theory, vol.22, pp. 1-10, Jan 1976. 10. D. Varodayan, A. Aaron and B. Girod, "Rate-adaptive distributed source coding using lowdensity parity-check codes," Conference Record of the Thirty-Ninth Asilomar Conference on Signals, Systems and Computers, 2005, pp. 1203-1207, Oct. 28 Nov. 1, 2005. 11. Z. Li and E.J. Delp, "Wyner-Ziv video side estimator: conventional motion search methods revisited," IEEE International Conference on Image Processing, 2005. ICIP 2005, vol.1, pp. I- 825-8, 11-14 Sept. 2005. 12. L. Liu and E. J. Delp, "Wyner-Ziv video coding using LDPC codes," Proceedings of the 7th Nordic Signal Processing Symposium, 2006. NORSIG 2006. 13. D. Kubasov, K. Lajnef and C. Guillemot, "A hybrid encoder/decoder rate control for Wyner-Ziv video coding with a feedback channel," IEEE 9th Workshop on Multimedia Signal Processing, 2007. MMSP 2007., pp.251-254, 1-3 Oct. 2007. 14. C. Brites and F. Pereira, "Encoder rate control for transform domain Wyner-Ziv video coding," IEEE International Conference on Image Processing, 2007. ICIP 2007., vol.2, pp.ii -5-II -8, 16-19 Sept. 2007 15. A. Roca, et al, "Rate control algorithm for pixel-domain Wyner-Ziv video coding," Proc. SPIE, vol. 6822, 68221T (2008). 16. D. Mukherjee, Optimal parameter choice for Wyner-Ziv coding of Laplacian sources with decoder side information, HP Labs Technical Report HPL-2007-34, 2007. 17. Z. Wang, et al, "Image quality assessment: From error visibility to structural similarity," IEEE Trans., Image Processing, vol.13, pp. 600-612, April 2004 18. I. Richardson, The H.264 advanced video compression standard, Hoboken, NJ: Wiley, 2010. List of acronyms in alphabetical order: CIF: Common intermediate format. Video resolution of 352x288. JM: Joint Model. The joint model (JM) implementation of the H.264 encoder and decoder. LDPCA: Low-Density-Parity-Check Accumulated. MV: Motion Vector ME: Motion Estimation PSNR: Peak Signal to Noise Ratio QCIF: Quarter CIF. 176 by 144 resolution. SI: Side Information WZ: Wyner-Ziv