CERIAS Tech Report Preprocessing and Postprocessing Techniques for Encoding Predictive Error Frames in Rate Scalable Video Codecs by E

Similar documents
ENCODING OF PREDICTIVE ERROR FRAMES IN RATE SCALABLE VIDEO CODECS USING WAVELET SHRINKAGE. Eduardo Asbun, Paul Salama, and Edward J.

CERIAS Tech Report Wavelet Based Rate Scalable Video Compression by K Shen, E Delp Center for Education and Research Information Assurance

MANY applications require that digital video be delivered

IMPROVEMENTS IN WAVELET-BASED RATE SCALABLE VIDEO COMPRESSION. AThesis. Submitted to the Faculty. Purdue University. Eduardo Asbun

OBJECT-BASED IMAGE COMPRESSION WITH SIMULTANEOUS SPATIAL AND SNR SCALABILITY SUPPORT FOR MULTICASTING OVER HETEROGENEOUS NETWORKS

WYNER-ZIV VIDEO CODING WITH LOW ENCODER COMPLEXITY

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and

A STUDY OF REAL-TIME AND RATE SCALABLE IMAGE AND VIDEO COMPRESSION. AThesis Submitted to the Faculty. Purdue University. Ke Shen

Video coding standards

VERY low bit-rate video coding has triggered intensive. Significance-Linked Connected Component Analysis for Very Low Bit-Rate Wavelet Video Coding

Chapter 10 Basic Video Compression Techniques

INFORMATION THEORY INSPIRED VIDEO CODING METHODS : TRUTH IS SOMETIMES BETTER THAN FICTION

Overview: Video Coding Standards

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

INTRA-FRAME WAVELET VIDEO CODING

Scalable Foveated Visual Information Coding and Communications

Multimedia Communications. Image and Video compression

Visual Communication at Limited Colour Display Capability

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)

An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions

Chapter 2 Introduction to

DWT Based-Video Compression Using (4SS) Matching Algorithm

A Study of Encoding and Decoding Techniques for Syndrome-Based Video Coding

MPEG has been established as an international standard

Unequal Error Protection Codes for Wavelet Image Transmission over W-CDMA, AWGN and Rayleigh Fading Channels

Reduced complexity MPEG2 video post-processing for HD display

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ISCAS.2005.

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

1 Overview of MPEG-2 multi-view profile (MVP)

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

Digital Video Telemetry System

Highly Scalable Wavelet-Based Video Codec for Very Low Bit-Rate Environment. Jo Yew Tham, Surendra Ranganath, and Ashraf A. Kassim

EMBEDDED ZEROTREE WAVELET CODING WITH JOINT HUFFMAN AND ARITHMETIC CODING

Free Viewpoint Switching in Multi-view Video Streaming Using. Wyner-Ziv Video Coding

Principles of Video Compression

The H.263+ Video Coding Standard: Complexity and Performance

Distributed Video Coding Using LDPC Codes for Wireless Video

A Novel Approach towards Video Compression for Mobile Internet using Transform Domain Technique

Copyright 2005 IEEE. Reprinted from IEEE Transactions on Circuits and Systems for Video Technology, 2005; 15 (6):

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

THE popularity of multimedia applications demands support

The H.26L Video Coding Project

Multimedia Communications. Video compression

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Selective Intra Prediction Mode Decision for H.264/AVC Encoders

Visual Communications and Image Processing 2002, C.-C. Jay Kuo, Editor, Proceedings of SPIE Vol (2002) 2002 SPIE X/02/$15.

Research Article. ISSN (Print) *Corresponding author Shireen Fathima

H.261: A Standard for VideoConferencing Applications. Nimrod Peleg Update: Nov. 2003

A Linear Source Model and a Unified Rate Control Algorithm for DCT Video Coding

AUDIOVISUAL COMMUNICATION

Introduction to Video Compression Techniques. Slides courtesy of Tay Vaughan Making Multimedia Work

ABSTRACT ERROR CONCEALMENT TECHNIQUES IN H.264/AVC, FOR VIDEO TRANSMISSION OVER WIRELESS NETWORK. Vineeth Shetty Kolkeri, M.S.

Dr. Ashutosh Datar. Keywords Video Compression, EZW, 3D-SPIHT, WDR, ASWDR, PSNR, MSE.

Error Concealment for SNR Scalable Video Coding

COMPRESSION OF DICOM IMAGES BASED ON WAVELETS AND SPIHT FOR TELEMEDICINE APPLICATIONS

Enhanced Frame Buffer Management for HEVC Encoders and Decoders

Implementation of an MPEG Codec on the Tilera TM 64 Processor

MULTI WAVELETS WITH INTEGER MULTI WAVELETS TRANSFORM ALGORITHM FOR IMAGE COMPRESSION. Pondicherry Engineering College, Puducherry.

H.264/AVC Baseline Profile Decoder Complexity Analysis

Dual frame motion compensation for a rate switching network

Wyner-Ziv Coding of Motion Video

Colour Reproduction Performance of JPEG and JPEG2000 Codecs

SPIHT-NC: Network-Conscious Zerotree Encoding

Motion Compensated Video Compression with 3D Wavelet Transform and SPIHT

Adaptive Key Frame Selection for Efficient Video Coding

Impact of scan conversion methods on the performance of scalable. video coding. E. Dubois, N. Baaziz and M. Matta. INRS-Telecommunications

A Spatial Scalable Video Coding with Selective Data Transmission using Wavelet Decomposition

Comparative Study of JPEG2000 and H.264/AVC FRExt I Frame Coding on High-Definition Video Sequences

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks

VERY low bit-rate video compression aimed at video

CHROMA CODING IN DISTRIBUTED VIDEO CODING

A New Compression Scheme for Color-Quantized Images

A video signal consists of a time sequence of images. Typical frame rates are 24, 25, 30, 50 and 60 images per seconds.

Variable Block-Size Transforms for H.264/AVC

Video 1 Video October 16, 2001

COMP 9519: Tutorial 1

FAST SPATIAL AND TEMPORAL CORRELATION-BASED REFERENCE PICTURE SELECTION

MPEG + Compression of Moving Pictures for Digital Cinema Using the MPEG-2 Toolkit. A Digital Cinema Accelerator

The Development of a Synthetic Colour Test Image for Subjective and Objective Quality Assessment of Digital Codecs

Error concealment techniques in H.264 video transmission over wireless networks

JPEG2000: An Introduction Part II

Video Over Mobile Networks

Speeding up Dirac s Entropy Coder

MSB LSB MSB LSB DC AC 1 DC AC 1 AC 63 AC 63 DC AC 1 AC 63

Color Quantization of Compressed Video Sequences. Wan-Fung Cheung, and Yuk-Hee Chan, Member, IEEE 1 CSVT

Improvement of MPEG-2 Compression by Position-Dependent Encoding

SCALABLE video coding (SVC) is currently being developed

DICOM medical image watermarking of ECG signals using EZW algorithm. A. Kannammal* and S. Subha Rani

Line-Adaptive Color Transforms for Lossless Frame Memory Compression

AN IMPROVED ERROR CONCEALMENT STRATEGY DRIVEN BY SCENE MOTION PROPERTIES FOR H.264/AVC DECODERS

MPEGTool: An X Window Based MPEG Encoder and Statistics Tool 1

176 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 13, NO. 2, FEBRUARY 2003

An Overview of Video Coding Algorithms

Analysis of a Two Step MPEG Video System

Applications of Digital Image Processing XXIV, Andrew G. Tescher, Editor, Proceedings of SPIE Vol (2001) 2001 SPIE X/01/$15.

Digital Image Processing

yintroduction to video compression ytypes of frames ysome video compression standards yinvolves sending:

Chapter 2 Video Coding Standards and Video Formats

International Journal for Research in Applied Science & Engineering Technology (IJRASET) Motion Compensation Techniques Adopted In HEVC

Transcription:

CERIAS Tech Report 2001-118 Preprocessing and Postprocessing Techniques for Encoding Predictive Error Frames in Rate Scalable Video Codecs by E Asbun, P Salama, E Delp Center for Education and Research Information Assurance and Security Purdue University, West Lafayette, IN 47907-2086

PREPROCESSING AND POSTPROCESSING TECHNIQUES FOR ENCODING PREDICTIVE ERROR FRAMES IN RATE SCALABLE VIDEO CODECS Eduardo Asbun, Paul Salama, and Edward J. Delp Video and Image Processing Laboratory (VIPER) Department of Electrical Engineering School of Electrical and Computer Engineering Purdue School of Engineering and Technology Purdue University IUPUI West Lafayette, Indiana 47907-1285 Indianapolis, Indiana 46202-5132 U.S.A. U.S.A. ABSTRACT The characteristics of non natural images, such as predictive error frames used in video compression, present a challenge for traditional compression techniques. Particularly difficult are small images, such as QCIF, where compression artifacts at low data rates are more noticeable. In this paper, we investigate techniques to improve the performance of a wavelet-based, rate scalable video codec at low data rates. These techniques include preprocessing and postprocessing stages to enhance the quality and reduce the compression artifacts of decoded images. 1. INTRODUCTION Many techniques have been developed for compression of natural images [1, 2, 3, 4, 5, 6, 7, 8, 9]. When these techniques are used on other types of images, such as synthetic (computer generated) images or predictive error frames (PEF) used in video compression, their performance is poor. Coding artifacts may be introduced, especially at low data rates. An additional challenge is present when coding small images, such as QCIF (176x144 pixels), because artifacts are more noticeable. Predictive error frames (PEF) are used by video compression algorithms that use motion estimation to reduce the temporal redundancy of video sequences. A PEF, along with a set of motion vectors, is used to reconstruct a frame based on a reference frame. The PEF is usually encoded using transform-based codecs to reduce the spatial redundancy in the image. PEFs typically have low energy content. This work was supported by a grant from Texas Instruments and an equipment grant from Intel. Address all correspondence to E. J. Delp, ace@ecn.purdue.edu, +1 765 494 1740, or http://www.ece.purdue.edu/ ace. Several techniques have been proposed to reduce the coding artifacts of transform-based image compression schemes [10, 11]. These techniques make use of postprocessing to reduce the artifacts introduced by the decoder. In [12], we investigated the use of wavelet shrinkage to improve the performance of a video compression algorithm known as SAMCoW [1, 3]. The SAMCoW, Scalable Adaptive Motion Compensated Wavelet, video compression technique uses a wavelet decomposition of both intracoded and predictive error frames to remove spatial redundancy, and block-based motion compensation to remove temporal redundancy. SAMCoW uses the Color Embedded Zerotree Wavelet (CEZW) still image coder [2, 4] on its intracoded and predictive error frames. CEZW is an embedded technique that uses a combination of an unique spatial orientation tree and color transform to exploit redundancy across color components. A variation of SAMCoW, known as SAMCoW+, was described in [12, 13, 14]. Figure 1: Block diagram of the proposed approach. In this paper, we investigate further preprocessing and postprocessing techniques to reduce the coding artifacts of CEZW at low data rates. A block diagram of the proposed approach is shown in Figure 1. We are interested in the performance of these techniques when used with QCIF images, because this is the frame size commonly used in low data rate video applications.

The complexity of these techniques is an issue, because CEZW is used as part of the SAMCoW video compression algorithm, and hence will impact the overall complexity of the codec. In block-based video compression techniques, such as MPEG-2 and H.263+, PEFs are efficiently encoded using the Discrete Cosine Transform (DCT) on 16x16 blocks of the image. Blocks can be skipped if their energy is below a certain threshold. Wavelet-based video compression algorithms, such as SAMCoW, require that the transform be performed on the entire PEF and, hence, must contend with the global nature of the decomposition and the low pass effect inherent to wavelet filtering. (a) 2. CEZW: EMBEDDED CODING OF COLOR IMAGES CEZW uses a unique spatial orientation tree (SOT) in the YUV color space [2]. It exploits the interdependence between color components to achieve a higher degree of compression by using the concept that at spatial locations where chrominance components have large transitions, the luminance component also has large transitions [2, 4]. Therefore, each node in the SOT of the luminance component also has descendants in the chrominance components at the same spatial location. The luminance component is scanned first. When a luminance coefficient and all its descendants in both the luminance and chrominance components are insignificant, a zerotree symbol is assigned. Otherwise, a positive significant, negative significant, or isolated zero symbol is assigned. The chrominance components are scanned after the luminance component. SAMCoW+ uses CEZW for coding intracoded (I) frames. A variation of CEZW, described in [12], is used for coding PEFs in SAMCoW+. 3. CODING ARTIFACTS IN SAMCOW Maintaining acceptable quality in color images coded at rates less than 0.5 bits per pixel (bpp) is a challenge, especially in small images (QCIF or smaller). Ringing artifacts and areas of discoloration are commonly noticeable when using wavelet-based image compression algorithms. In Figure 2, a 512x512 YUV 4:1:1 image is encoded using CEZW, and decoded at 0.25 bpp. The same image, cropped and scaled to 176x144 pixels, is encoded using CEZW, and decoded at the same data rate. The subjective quality of the smaller image is lower. The same effect is observed in other wavelet-based video encoders, such as SPIHT [8]. The reason for this effect is Figure 2: Effect of image size of CEZW. (a) Original (512x512) Encoded using CEZW, decoded at 0.25 bpp. Cropped and resized (176x144) Encoded using CEZW, decoded at 0.25 bpp. that in small images, one pixel represents more area than in larger images. Therefore, decoding artifacts are more noticeable. In Figure 3, two PEFs from the foreman sequence, extracted from SAMCoW+, are shown. Both are encoded using CEZW and decoded at 0.25 bpp. These are 176x144 YUV 4:1:1 images. Blotchiness and ringing artifacts are evident in the decoded frames. In the decoder, the decoded frame is obtained by adding the decoded PEF to the predicted frame produced by motion compensation. The errors propagate to future frames when the decoded frame is used as a reference frame. In [12], we introduced a variation of CEZW that improves its performance when coding PEFs at low data rates. This technique is based on preprocessing the PEFs before encoding, and coding only certain significant trees in the wavelet decomposition. We found that the performance of SAMCoW+ is improved when this technique is used with CEZW. The preprocessing stage used in this paper and in [12], is an adaptive gain (AG) function followed by wavelet shrinkage, as shown in Figure 4. This stage is used to enhance the most important features of a PEF. The parameters of the AG function are dynamically changed, therefore adapting to the varying content of

(a) Figure 5: Adaptive gain (AG) function used as part of the preprocessing stage. Figure 3: (a) and are original PEFs from frames 35 and 293, respectively, of the foreman sequence. and are the same PEFs encoded using CEZW and decoded at 0.25 bpp. is done to avoid having sharp differences of magnitude between adjacent coefficients. These large differences would have been produced during the encoding process, by not allocating enough bits to encode a frame, effectively ignoring nonzero coefficients. Finally, a high pass filter is used on the frame produced by the CEZW decoder. PEFs in a sequence. The AG function is defined as H AG (p) = 0, if 0 p <t 1, p,ift 1 p <t 2, (1) p + K (t 3 p), if t 2 p <t 3, p,ift 3 p <max, where t 1, t 2,andt 3 are thresholds that depend on the content of the PEF, K is a constant that controls the feature enhancement, and max is the largest pixel magnitude in the PEF. This function is shown in Figure 5. Postprocessing stage in the proposed ap- Figure 6: proach. In Figure 7 (a) and, PEFs from frames 35 and 293, respectively, of the foreman sequence are shown after encoding and decoding using CEZW. Figure 7 and show the same PEFs after preprocessing and postprocessing, as described in this paper. More detail can be seen in the PEFs obtained using our approach. 4. SUMMARY Preprocessing stage in the proposed ap- Figure 4: proach. In this paper, the postprocessing stage consists of an enhancement stage, as shown in Figure 6. The goal is to sharpen the features of the decoded image. Before obtaining the inverse wavelet transform, the wavelet coefficients are modified to compensate for the preprocessing stage. The neighborhood of those coefficients whose magnitude is larger than a threshold, is examined. If their absolute magnitude is relatively close to zero, the coefficient is multiplied by a scale factor. This In this paper, we presented the use of preprocessing and postprocessing techniques to improve the performance of the SAMCoW algorithm by exploiting the characteristics of PEFs (e.g. low energy content). We used image enhancement techniques, including high pass filtering, to postprocess the decoded images. Preprocessing and postprocessing techniques are attractive, because they do not add overhead to the encoded bitstream. The computational requirements of these techniques are low, and do not increase the overall complexity of the video compression algorithm. Future research includes investigating other filter

(a) [5] Z. Xiong, K. Ramchandran, and M. T. Orchard, Space-frequency quantization for wavelet image coding, IEEE Transactions on Image Processing, vol. 6, no. 5, pp. 677 693, May 1997. [6] C. Chrysafis and A. Ortega, Efficient contextbased entropy coding for lossy wavelet image compression, Data Compression Conference, pp. 241 250, Snowbird, UT, March 1998. [7] J. M. Shapiro, Embedded image coding using zerotrees of wavelets coefficients, IEEE Transactions on Signal Processing, vol. 41, no. 12, pp. 3445 3462, December 1993. Figure 7: Experimental results. (a) and are PEFs from frames 35 and 293, respectively, of the foreman sequence, encoded using CEZW and decoded at 0.25 bpp. and are the same PEFs after preprocessing and postprocessing, as described in this paper. pairs. PostScript and PDF versions of this paper, and the images produced by our algorithm, are available via anonymous FTP to skynet.ecn.purdue.edu in the directory /pub/dist/delp/vlbv99/. 5. REFERENCES [1] K. Shen and E. J. Delp, Wavelet based rate scalable video compression, IEEE Transactions on Circuits and Systems for Video Technology, vol.9, no. 1, pp. 109 122, February 1999. [2] K. Shen and E. J. Delp, Color image compression using an embedded rate scalable approach, Proceedings of the IEEE International Conference on Image Processing, vol. III, pp. 34 37, Santa Barbara, California, October 1997. [3] K. Shen, A Study of Real Time and Rate Scalable Image and Video Compression. Ph.D. thesis, School of Electrical and Computer Engineering, Purdue University, December 1997. [4] M. Saenz, P. Salama, K. Shen, and E. J. Delp, An evaluation of color embedded wavelet image compression techniques, SPIE Conference on Visual Communications and Image Processing 99, pp. 282 293, San Jose, California, January 1999. [8] A. Said and W. A. Pearlman, New, fast, and efficient image codec based on set partitioning in hierarchical trees, IEEE Transactions on Circuits and Systems for Video Technology, vol. 6, no. 3, pp. 243 250, June 1996. [9] International Organization for Standardization, ISO/IEC JTC 1/SC 29/WG 1 (ITU-T SG8), Coding of Still Pictures, April 1999. (JPEG2000 Verification Model Version 4.0). [10] T. O Rourke and R. Stevenson, Improved image decompression for reduced transform coding artifacts, IEEE Transactions on Circuits and Systems for Video Technology, vol. 5, no. 6, pp. 490 499, December 1995. [11] M.-Y. Shen and C.-C. Kuo, Artifact reduction in low bit rate wavelet coding with robust nonlinear filtering, Proceedings of the 1998 IEEE Second Workshop on Multimedia Signal Processing, pp. 480 485, Redondo Beach, CA, December 1998. [12] E. Asbun, P. Salama, and E. J. Delp, Encoding of predictive error frames in rate scalable video codecs using wavelet shrinkage, Proceedings of the IEEE International Conference on Image Processing, Kobe, Japan, October 1999. [13] E. J. Delp, P. Salama, E. Asbun, M. Saenz, and K. Shen, Rate scalable image and video compression techniques, Proceedings of 42nd Midwest Symposium on Circuits and Systems, Las Cruces, New Mexico, August 1999. [14] E. Asbun, P. Salama, K. Shen, and E. J. Delp, Very low bit rate wavelet-based scalable video compression, Proceedings of the IEEE International Conference on Image Processing, pp. 948 952, Chicago, Illinois, October 1998.