ENCODING OF PREDICTIVE ERROR FRAMES IN RATE SCALABLE VIDEO CODECS USING WAVELET SHRINKAGE. Eduardo Asbun, Paul Salama, and Edward J.

Similar documents
CERIAS Tech Report Preprocessing and Postprocessing Techniques for Encoding Predictive Error Frames in Rate Scalable Video Codecs by E

CERIAS Tech Report Wavelet Based Rate Scalable Video Compression by K Shen, E Delp Center for Education and Research Information Assurance

MANY applications require that digital video be delivered

IMPROVEMENTS IN WAVELET-BASED RATE SCALABLE VIDEO COMPRESSION. AThesis. Submitted to the Faculty. Purdue University. Eduardo Asbun

OBJECT-BASED IMAGE COMPRESSION WITH SIMULTANEOUS SPATIAL AND SNR SCALABILITY SUPPORT FOR MULTICASTING OVER HETEROGENEOUS NETWORKS

An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

The H.263+ Video Coding Standard: Complexity and Performance

Overview: Video Coding Standards

Visual Communication at Limited Colour Display Capability

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)

VERY low bit-rate video coding has triggered intensive. Significance-Linked Connected Component Analysis for Very Low Bit-Rate Wavelet Video Coding

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and

Chapter 10 Basic Video Compression Techniques

AUDIOVISUAL COMMUNICATION

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

INFORMATION THEORY INSPIRED VIDEO CODING METHODS : TRUTH IS SOMETIMES BETTER THAN FICTION

Scalable Foveated Visual Information Coding and Communications

Video coding standards

Free Viewpoint Switching in Multi-view Video Streaming Using. Wyner-Ziv Video Coding

Principles of Video Compression

Highly Scalable Wavelet-Based Video Codec for Very Low Bit-Rate Environment. Jo Yew Tham, Surendra Ranganath, and Ashraf A. Kassim

EMBEDDED ZEROTREE WAVELET CODING WITH JOINT HUFFMAN AND ARITHMETIC CODING

A STUDY OF REAL-TIME AND RATE SCALABLE IMAGE AND VIDEO COMPRESSION. AThesis Submitted to the Faculty. Purdue University. Ke Shen

Video Transmission. Thomas Wiegand: Digital Image Communication Video Transmission 1. Transmission of Hybrid Coded Video. Channel Encoder.

Multimedia Communications. Image and Video compression

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

MPEG has been established as an international standard

Chapter 2 Introduction to

Adaptive Key Frame Selection for Efficient Video Coding

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

A Study of Encoding and Decoding Techniques for Syndrome-Based Video Coding

Analysis of a Two Step MPEG Video System

Chapter 2. Advanced Telecommunications and Signal Processing Program. E. Galarza, Raynard O. Hinds, Eric C. Reed, Lon E. Sun-

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

WYNER-ZIV VIDEO CODING WITH LOW ENCODER COMPLEXITY

CHROMA CODING IN DISTRIBUTED VIDEO CODING

Unequal Error Protection Codes for Wavelet Image Transmission over W-CDMA, AWGN and Rayleigh Fading Channels

A Linear Source Model and a Unified Rate Control Algorithm for DCT Video Coding

Motion Video Compression

Comparative Study of JPEG2000 and H.264/AVC FRExt I Frame Coding on High-Definition Video Sequences

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ISCAS.2005.

Constant Bit Rate for Video Streaming Over Packet Switching Networks

Video Over Mobile Networks

H.261: A Standard for VideoConferencing Applications. Nimrod Peleg Update: Nov. 2003

Motion Re-estimation for MPEG-2 to MPEG-4 Simple Profile Transcoding. Abstract. I. Introduction

THE popularity of multimedia applications demands support

1. INTRODUCTION. Index Terms Video Transcoding, Video Streaming, Frame skipping, Interpolation frame, Decoder, Encoder.

MPEG-2. ISO/IEC (or ITU-T H.262)

SCALABLE video coding (SVC) is currently being developed

Reduced complexity MPEG2 video post-processing for HD display

ABSTRACT ERROR CONCEALMENT TECHNIQUES IN H.264/AVC, FOR VIDEO TRANSMISSION OVER WIRELESS NETWORK. Vineeth Shetty Kolkeri, M.S.

A Novel Approach towards Video Compression for Mobile Internet using Transform Domain Technique

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora

AN IMPROVED ERROR CONCEALMENT STRATEGY DRIVEN BY SCENE MOTION PROPERTIES FOR H.264/AVC DECODERS

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks

Error concealment techniques in H.264 video transmission over wireless networks

Bit Rate Control for Video Transmission Over Wireless Networks

Introduction to Video Compression Techniques. Slides courtesy of Tay Vaughan Making Multimedia Work

COMPRESSION OF DICOM IMAGES BASED ON WAVELETS AND SPIHT FOR TELEMEDICINE APPLICATIONS

Impact of scan conversion methods on the performance of scalable. video coding. E. Dubois, N. Baaziz and M. Matta. INRS-Telecommunications

Selective Intra Prediction Mode Decision for H.264/AVC Encoders

Dual frame motion compensation for a rate switching network

The H.26L Video Coding Project

Multimedia Communications. Video compression

MSB LSB MSB LSB DC AC 1 DC AC 1 AC 63 AC 63 DC AC 1 AC 63

Fast MBAFF/PAFF Motion Estimation and Mode Decision Scheme for H.264

Color Quantization of Compressed Video Sequences. Wan-Fung Cheung, and Yuk-Hee Chan, Member, IEEE 1 CSVT

WITH the rapid development of high-fidelity video services

INTRA-FRAME WAVELET VIDEO CODING

H.264/AVC Baseline Profile Decoder Complexity Analysis

1022 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO. 4, APRIL 2010

Error Concealment for SNR Scalable Video Coding

Error-Resilience Video Transcoding for Wireless Communications

Research Article. ISSN (Print) *Corresponding author Shireen Fathima

DWT Based-Video Compression Using (4SS) Matching Algorithm

Key Techniques of Bit Rate Reduction for H.264 Streams

IN OBJECT-BASED video coding, such as MPEG-4 [1], an. A Robust and Adaptive Rate Control Algorithm for Object-Based Video Coding

Drift Compensation for Reduced Spatial Resolution Transcoding

MULTI WAVELETS WITH INTEGER MULTI WAVELETS TRANSFORM ALGORITHM FOR IMAGE COMPRESSION. Pondicherry Engineering College, Puducherry.

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

Scalable multiple description coding of video sequences

Modeling and Optimization of a Systematic Lossy Error Protection System based on H.264/AVC Redundant Slices

Advanced Computer Networks

Digital Image Processing

Research Article Design and Analysis of a High Secure Video Encryption Algorithm with Integrated Compression and Denoising Block

Multiview Video Coding

Improvement of MPEG-2 Compression by Position-Dependent Encoding

SPIHT-NC: Network-Conscious Zerotree Encoding

MPEGTool: An X Window Based MPEG Encoder and Statistics Tool 1

Hierarchical SNR Scalable Video Coding with Adaptive Quantization for Reduced Drift Error

MPEG-1 and MPEG-2 Digital Video Coding Standards

SCENE CHANGE ADAPTATION FOR SCALABLE VIDEO CODING

Comparative Analysis of Wavelet Transform and Wavelet Packet Transform for Image Compression at Decomposition Level 2

Systematic Lossy Forward Error Protection for Error-Resilient Digital Video Broadcasting

Implementation of an MPEG Codec on the Tilera TM 64 Processor

Video 1 Video October 16, 2001

A look at the MPEG video coding standard for variable bit rate video transmission 1

Analysis of Video Transmission over Lossy Channels

Design Approach of Colour Image Denoising Using Adaptive Wavelet

Transcription:

ENCODING OF PREDICTIVE ERROR FRAMES IN RATE SCALABLE VIDEO CODECS USING WAVELET SHRINKAGE Eduardo Asbun, Paul Salama, and Edward J. Delp Video and Image Processing Laboratory (VIPER) School of Electrical and Computer Engineering Purdue University West Lafayette, Indiana 47907-1285 U.S.A. ABSTRACT Rate scalable video compression is appealing for low bit rate applications, such as video telephony and wireless communication, where bandwidth available to an application cannot be guaranteed. In this paper, we investigate a set of strategies to increase the performance of SAMCo W, a rate scalable encoder [l, 21. These techniques are based on based on wavelet decomposition, spatial orientation trees, and motion compensation. 1. INTRODUCTION Most of the research in wavelet-based image and video compression has been directed towards optimizing performance for encoding of natural scenes [3, 4, 51. Predictive error frames (PEFs), used in many video compression techniques, present a challenge for many codecs in that they are not (natural. In [6], an algorithm for space-frequency adaptive coding of PEFs is presented. A study of the optimal bit allocation between PEFs and motion vector fields is presented in [7]. In this paper we investigate new techniques for the coding of PEFs. Our approach is based on preprocessing a PEF before encoding it. This preprocessing step uses wavelet shrinkage [8, 91 to reduce the number of relatively insignificant wavelet coefficients before zerotree encoding. An approach to encoding the wavelet coefficients in predictive error frames based on Color Embedded Zerotree Wavelet (CEZW) [l, 10, 111 is described in Section 3. The techniques described above are integrated into a rate scalable video codec, using a dynamic bit allocation strategy for predictivecoded (P) frames. This codec is an extension of the This work was supported by a grant from Texas Instruments. Address all correspondence to E. J. Delp, ace@ecn.purdue.edu, http://wwu.ece.purdue.edu/-ace, or +1 765 494 1740. Scalable Adaptive Motion Compensated Wavelet (SAM- COW) video compression technique presented in [l, 21. In this paper we shall refer to this extension as SAM- COW+. Experimental results are shown in Section 4. 2. SAMCOW Rate scalable video codecs have received considerable attention due to the growing importance of video delivery over heterogeneous data networks; Current video coding standards such as MPEG-2 [12], MPEG-4 [13], and H.263+ [14] provide layered temporal, spatial, and SNR scalability. SAMCo W [l, 21 uses embedded coding such that the data rate can be dynamically changed on a frame-by-frame basis, and does not require the use of separate layers for scalability. The main features of SAMCo W are: i) a modified zerotree wavelet image compression scheme known as CEZW (1, 10, 111 used for coding intracoded and predictive error frames; and ii) adaptive block-based motion compensation [15, 161 used in the spatial domain to reduce temporal redundancy. A complete description of SAMCo W is provided in [l, 21. 2.1. CEZW: Embedded Coding of Color Images CEZW uses a unique spatial orientation tree (SOT) in the YUV color space. It exploits the interdependence between color components to achieve a higher degree of compression by observing that at spatial locations where chrominance components have large transitions, the luminance component also has large transitions [l, 111. Therefore, each node in the SOT of the luminance component also has descendants in the chrominance components at the same spatial location. The luminance component is scanned first. When a luminance coefficient and all its descendants in both the 0-7803-5467-2/99/ $10.00 0 1999 IEEE 832

luminance and chrominance components are insignificant, a zerotree symbol is assigned. Otherwise, a positive significant, negative significant, or isolated zero symbol is assigned. The chrominance components are scanned after the luminance component. SAMCo W uses CEZW for coding intracoded (I) and predictive error frames. A variation of CEZW, described below, is used for coding the PEFs in SAMCo W+. 3. SAMCOWS In this section we introduce SAMCOW+. In SAM- COW+, CEZ W is used for coding I frames. A modified CEZW algorithm is used for PEFs, as shown in Figure 1. The PEF is preprocessed by using feature emphasis techniques and the elimination of information that is not visually significant. The modified CEZW algorithm uses wavelet shrinkage to selectively encode spatial orientation trees. lntracoded Frame Predictive Error Figure 2: Adaptive gain (AG) function used to emphasize features in a PEF. Soft- and hard-thresholding of wavelet coefficients has been used for signal and image denoising (8, 9, 17, 181. Typical thresholding functions are shown in Figure 3. In (81, a uniform soft-threshold is used across scales of the decomposition, whereas in [17, 181 softthresholding is scale-dependent. The latter approach is consistent with the observation that the statistics of the coefficients change at each scale. Figure 1: Coding of intracoded and predictive error frames in SAMCo W+. 3.1. Preprocessing and Wavelet Shrinkage In the preprocessing stage, an adaptive gain (AG) function is used on the PEF. In this function, the areas where the predictive error is more significant are enhanced. The parameters of the AG function are set dynamically, therefore incorporating flexibility to adapt to the varying content of PEFs in a sequence. This AG function is similar to the GAG operator described in [17]. Figure 2 shows the AG function used in preprocessing the PEFs. The AG function is defined as 0 where tl, t2, and t3 are thresholds that depend on the content of the PEF, K is constant that controls the feature enhancement, and max is the largest pixel magnitude in the PEF. The thresholds are chosen based on the statistics of the frame. Soft-thresholding Hard-thresholding Figure 3: Soft- and hard-thresholding of coefficient w In this paper, we follow the procedure described in [8], using a scale adaptive threshold as in [17]. Let f(m,n) be a PEF, and w = Wj"[f(m,n)] be a wavelet coefficient of f(m, n) at level j (1 5 j 5 J ) and spatial orientation d (d E {HH, HL, LH, LL}). The new wavelet coefficient 5 is obtained as follows: where sign(v) = 6 = sign(w)()w) - tj"), (2) +1, if w > 0, 0, -1, ifw=o, if w < 0, (3) = { [I - t;, if 14 > t;, (1.1 - $1, otherwise, (4) and tjd is Some appropriately chosen threshold. The value oft: depends on the statistics of the wavelet de- 833

composition at level j and orientation d, and is obtained as follows: tj = (T,,, - a(j - l))~;, if T, - a(j - 1) > Tmin { Tmincjjdl otherwise (5) Here, a is a decreasing factor between two consecutive levels, and T, and Tmin are maximum and minimum factors for U?, the empirical standard deviation of the wavelet decomposition at the corresponding level and orientation, respectively. 3.2. Encoding of Significant Trees After the features of the PEF are enhanced and the coefficients of the wavelet decomposition of the PEFs are shrunk using the technique described above, the resulting coefficients are then encoded. When using CEZW to encode the coefficients of a wavelet decomposition, several passes are made to refine the precision of the approximations. As the coefficients are examined, the symbols positive significant (POS), negative significant (NEG), isolated zero (IZ), and zerotree (ZTR) are assigned [lo, 111. A coefficient is assigned the symbol IZ when the coefficient is not significant but some of its descendants are significant with respect to a threshold. In this paper, we modify the CEZW algorithm as follows: In the first dominant pass, we will identify the coefficients that are significant (positive and negative) at the coarsest scale. we refer to these coefficients as tree roots,,, and their descendants are part of a significant tree. The result of this step is that only a select number of trees are considered for further processing. In the remaining dominant passes, until the bit rate is exhausted, only coefficients that belong to the significant trees are examined. compensated frame in a group of pictures (GOP) diverges from that of the original since predictive-coded (P) frames are used as reference for other P frames. This causes PEFs towards the end of a GOP to carry more information, especially in sequences with high degree of motion. In DCT-based video codecs such as MPEG-2 or H.263+, a macroblock can be skipped when all quantized coefficients within that macroblock are zero. In a wavelet-based encoder, the coefficients in the decomposition are examined and refined until the bit budget is exhausted. However, when PEFs such as those occurring near the beginning of a GOP do not carry as much information, bits will be used to encode information that is not visually relevant. The opposite will occur near the end of the GOP. In SAMCo W+, a variable number of bits is allocated to the PEF based on the number of significant trees being examined. This allows the data rate to vary depending on the level of activity in the scene. Furthermore, certain frames are not encoded (skipped), that is, no bits are allocated to them. This is to avoid compromising the quality of the encoded frames. 4. RESULTS AND CONCLUSIONS We used a four-level wavelet decomposition on the PEFs, and applied soft-thresholding to all four levels. A PEF towards the end of the GOP in the akiyo sequence is Shown in Figure 4(a). The PEF after preprocessing, as described in Section 3.1, is shown in Figure 4(b). After preprocessing, the information that is most visually Significant in Figure 4(b) is still preserved, but requires fewer bits to represent it. This strategy effectively skips certain trees in the wavelet deco mp0siti.n. With this modification, we intend to select the most representative information in the decomposition. Therefore, we will use the bit budget for the PEFs as efficiently as possible, encoding the most significant information and disregarding coefficients whose contribution is not significant in terms of quality of the encoding. 3.3. Dynamic Bit Allocation In SAMCo W, all PEFs are assigned an equal number of bits to be used for encoding [2]. However, this approach is not efficient considering that the quality of a motion Figure 4: A predictive error frame from the akiyo sequence. (a) Original PEF. (b) PEF after preprocessing. Figure 5 shows the PSNR of the first 60 frames in the akiyo sequence decoded at 24 kbps using SAM- COW+, SAMCoW, and H.263+. The GOP size for SAMCo W+ and SAMCo Wwas 20. Figure 6 shows the PSNR of frames 200-259 of the foreman sequence decoded at 64 kbps using SAMCOW+, SAMCoW, and H.263+. The GOP size for SAMCo W+ and SAMCo W 834

was 10. For both experiments, the target frame rate was 10 frames per second. In SAMCo W+, some frames are not encoded, that is, they are skipped. When this occurs, the decoder repeats the previously decoded frame. To obtain the PSNR values of skipped frames for Figures 5 and 6, we compared the repeated frame, with the frame in the original sequence that would correspond to the frame that was skipped. Therefore, the PSNR values for these frames are low. PSNR values of 60 frames of the akivo seauence at 24 kbm 10 20 9 U) 50 M Frames Figure 7: A frame in the akiyo sequence, decoded at 24 kbps. (a) Original, (b) SAMCo W+, (c) SAMCo W, and (d) H.263+. Figure 5: PSNR values of the akiyo sequence at 24 kbps. I -30 - a U) a 25. 20- PSNR valuw of 617 frames of the foreman seauenca d 64 kbm PEF to enhance its most important features, and softthresholding of coefficients of the wavelet decomposition. These techniques are integrated to SAMCo W+. A new bit allocation scheme is also used in SAMCo W+. The performance and visual quality of SAMCo W is improved for data rates between 24 and 64 kbps. Preprocessing has the advantage of enhancing the most visually importaht features of the PEFs. A disadvantage is that information about the PEF is being discarded. However, at low data rates, this information would not be encoded anyway due to the limited bit budget. Softthresholding has the effect of a low-pass filter on the wavelet decomposition. Therefore, a post-processing stage may be necessary to reduce this effect. 15 10 20 30 a M 4 Frames Figure 6: PSNR values of the foreman sequence at 64 kbps. Figure 7 shows a frame of the decoded akiyo sequence (frame ll in the decoded sequence, corresponding to frame 33 in the original sequence) at 24 kbps. Figure 8 shows a frame of the decoded foreman sequence (frame 13 in the decoded sequence, corresponding to frame 239 in the original sequence) at 64 kbps. In this paper, we have presented new techniques for coding of PEFs. They include preprocessing the 5. REFERENCES K. Shen and E. J. Delp, Wavelet based rate scalable video compression, IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, no. 1, pp. 109-122, February 1999. E. Asbun, P. Salama, K. Shen, and E. J. Delp, Very low bit rate wavelet-based scalable video compression, Proceedings of the IEEE International Conference on Image Processing, pp. 948-952, Chicago, Illinois, October 1998. J. M. Shapiro, Embedded image coding using zerotrees of wavelets coefficients, IEEE Tran- 835

D. L. Donoho and I. M. Johnstone, Ideal spatial adaptation via wavelet shrinkage, Biometrika, vol. 81, no. 3, pp. 425-455, 1994. M. Saenz, P. Salama, K. Shen, and E. J. Delp, An evaluation of color embedded wavelet image compression techniques, SPIE Conference on Visual Communications and Image Processing 99, pp. 282-293, San Jose, California, January 1999. K. Shen and E. J. Delp, Color image compression using an embedded rate scalable approach, Proceedings of the IEEE International Conference on Image Processing, vol. 111, pp. 34-37, Santa Barbara, California, October 1997. International Organization for Standardization, ISO/IEC 13818-2, Information Technology - Generic coding of moving pictures and associated (c) (4 audio information, 1994.-(MPEG-2 Video). Figure 8: A frame in the foreman sequence, decoded at 64 kbps. (a) Original, (b) SAMCo W+, (c) SAMCo W, (d) H.263+. suctions on Signal Processing, vol. 41, no. 12, pp. 3445-3462, December 1993. A. Said and W. A. Pearlman, New, fast, and efficient image codec based on set partitioning in hierarchical trees, IEEE Transactions on Circuits and Systems for Video Technology, vol. 6, no. 3, pp. 243-250, June 1996. Z. Xiong, K. Ramchandran, and M. T. Orchard, Space-frequency quantization for wavelet image coding, IEEE Transactions on Image Processing, vol. 6, no. 5, pp. 677-693, May 1997. M. Wien and W. Niehsen, Space-frequency adaptive coding of motion compensated frame differences, Proceedings of the 1999 Picture Coding Symposium, pp. 65-68, Portland, Oregon, April 1999. G. M. Schuster and A. K. Katsaggelos, A theory for the optimal bit allocation between displacement vector field and displaced frame difference, IEEE Journal on Selected Areas in Communications, vol. 15, no. 9, pp. 1739-1751, December 1997. D. L. Donoho, De-noising by soft-thresholding, IEEE Transactions on Information Theory, vol. 41, no. 3, pp. 613-627, May 1995. [13] International Organization for Standardization, ISO/IEC 14496-2, Information Technology - Coding of Audio- Visual Objects: Video, October 1998. (MPEG-4 Version 1, Part 2: Final Draft International Standard). [14] ITU-T, Draft ITU-T Recommendation H.263 Version 2: Video Coding for Low Bitrate Communication, September 1997. (H.263+). [15] K. Shen and E. J. Delp, A control scheme for a data rate scalable video codec, Proceedings of the IEEE International Conference on Image Processing, vol. 11, pp. 69-72, Lausanne, Switzerland, September 1996. [16] M. L. Comer, K. Shen, and E. J. Delp, Ratescalable video coding using a zerotree wavelet approach, Proceedings of the Ninth Image and Multidimensional Digital Signal Processing Workshop, pp. 162-163, Belize City, Belize, March 1996. [17] X. Zong, A. Laine, and E. Geiser, Speckle reduction and constrast enhancement of echocardiograms via multiscale nonlinear processing, IEEE Transactions on Medical Imaging, vol. 17, no. 4, pp. 532-540, August 1998. [18] S. G. Chang and M. Vetterli, Spatial adaptive wavelet thresholding for image denoising, Proceedings of the IEEE International Conference on Image Processing, vol. 11, pp. 374-377, Santa Barbara, California, October 1997. 836