Performance Comparison of JPEG2000 and H.264/AVC High Profile Intra Frame Coding on HD Video Sequences

Similar documents
Comparative Study of JPEG2000 and H.264/AVC FRExt I Frame Coding on High-Definition Video Sequences

Performance evaluation of Motion-JPEG2000 in comparison with H.264/AVC operated in pure intra coding mode

Selective Intra Prediction Mode Decision for H.264/AVC Encoders

Overview: Video Coding Standards

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Chapter 2 Introduction to

Project Proposal: Sub pixel motion estimation for side information generation in Wyner- Ziv decoder.

Fast MBAFF/PAFF Motion Estimation and Mode Decision Scheme for H.264

FAST SPATIAL AND TEMPORAL CORRELATION-BASED REFERENCE PICTURE SELECTION

The H.26L Video Coding Project

Video coding standards

Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard

Fast Mode Decision Algorithm for Intra prediction in H.264/AVC Video Coding

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Multimedia Communications. Image and Video compression

OBJECT-BASED IMAGE COMPRESSION WITH SIMULTANEOUS SPATIAL AND SNR SCALABILITY SUPPORT FOR MULTICASTING OVER HETEROGENEOUS NETWORKS

International Journal for Research in Applied Science & Engineering Technology (IJRASET) Motion Compensation Techniques Adopted In HEVC

Intra-frame JPEG-2000 vs. Inter-frame Compression Comparison: The benefits and trade-offs for very high quality, high resolution sequences

Error concealment techniques in H.264 video transmission over wireless networks

Video Over Mobile Networks

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER

Variable Block-Size Transforms for H.264/AVC

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks

A video signal consists of a time sequence of images. Typical frame rates are 24, 25, 30, 50 and 60 images per seconds.

WITH the rapid development of high-fidelity video services

An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions

CODING EFFICIENCY IMPROVEMENT FOR SVC BROADCAST IN THE CONTEXT OF THE EMERGING DVB STANDARDIZATION

Introduction to Video Compression Techniques. Slides courtesy of Tay Vaughan Making Multimedia Work

WYNER-ZIV VIDEO CODING WITH LOW ENCODER COMPLEXITY

Multimedia Communications. Video compression

Reduced complexity MPEG2 video post-processing for HD display

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

Highly Efficient Video Codec for Entertainment-Quality

INTRA-FRAME WAVELET VIDEO CODING

Implementation of an MPEG Codec on the Tilera TM 64 Processor

AUDIOVISUAL COMMUNICATION

Free Viewpoint Switching in Multi-view Video Streaming Using. Wyner-Ziv Video Coding

Speeding up Dirac s Entropy Coder

COMPLEXITY REDUCTION FOR HEVC INTRAFRAME LUMA MODE DECISION USING IMAGE STATISTICS AND NEURAL NETWORKS.

Line-Adaptive Color Transforms for Lossless Frame Memory Compression

HEVC Subjective Video Quality Test Results

The Multistandard Full Hd Video-Codec Engine On Low Power Devices

Conference object, Postprint version This version is available at

Visual Communication at Limited Colour Display Capability

Understanding Compression Technologies for HD and Megapixel Surveillance

Constant Bit Rate for Video Streaming Over Packet Switching Networks

ENCODING OF PREDICTIVE ERROR FRAMES IN RATE SCALABLE VIDEO CODECS USING WAVELET SHRINKAGE. Eduardo Asbun, Paul Salama, and Edward J.

COMPRESSION OF DICOM IMAGES BASED ON WAVELETS AND SPIHT FOR TELEMEDICINE APPLICATIONS

Systematic Lossy Forward Error Protection for Error-Resilient Digital Video Broadcasting

A Novel Approach towards Video Compression for Mobile Internet using Transform Domain Technique

Chapter 10 Basic Video Compression Techniques

Bit Rate Control for Video Transmission Over Wireless Networks

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ISCAS.2005.

CERIAS Tech Report Preprocessing and Postprocessing Techniques for Encoding Predictive Error Frames in Rate Scalable Video Codecs by E

Chapter 2 Video Coding Standards and Video Formats

H.264/AVC Baseline Profile Decoder Complexity Analysis

Mauricio Álvarez-Mesa ; Chi Ching Chi ; Ben Juurlink ; Valeri George ; Thomas Schierl Parallel video decoding in the emerging HEVC standard

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora

AN IMPROVED ERROR CONCEALMENT STRATEGY DRIVEN BY SCENE MOTION PROPERTIES FOR H.264/AVC DECODERS

WHITE PAPER. Perspectives and Challenges for HEVC Encoding Solutions. Xavier DUCLOUX, December >>

Video Compression - From Concepts to the H.264/AVC Standard

Distributed Video Coding Using LDPC Codes for Wireless Video

CHROMA CODING IN DISTRIBUTED VIDEO CODING

Motion Video Compression

The H.263+ Video Coding Standard: Complexity and Performance

A High Performance VLSI Architecture with Half Pel and Quarter Pel Interpolation for A Single Frame

WE CONSIDER an enhancement technique for degraded

SCALABLE video coding (SVC) is currently being developed

Error Resilient Video Coding Using Unequally Protected Key Pictures

Film Grain Technology

Advanced Computer Networks

Systematic Lossy Error Protection of Video based on H.264/AVC Redundant Slices

Parameters optimization for a scalable multiple description coding scheme based on spatial subsampling

Project Interim Report

WITH the demand of higher video quality, lower bit

Motion Re-estimation for MPEG-2 to MPEG-4 Simple Profile Transcoding. Abstract. I. Introduction

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

Key Techniques of Bit Rate Reduction for H.264 Streams

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and

17 October About H.265/HEVC. Things you should know about the new encoding.

EMBEDDED ZEROTREE WAVELET CODING WITH JOINT HUFFMAN AND ARITHMETIC CODING

PERCEPTUAL QUALITY COMPARISON BETWEEN SINGLE-LAYER AND SCALABLE VIDEOS AT THE SAME SPATIAL, TEMPORAL AND AMPLITUDE RESOLUTIONS. Yuanyi Xue, Yao Wang

ABSTRACT ERROR CONCEALMENT TECHNIQUES IN H.264/AVC, FOR VIDEO TRANSMISSION OVER WIRELESS NETWORK. Vineeth Shetty Kolkeri, M.S.

Color Image Compression Using Colorization Based On Coding Technique

SCENE CHANGE ADAPTATION FOR SCALABLE VIDEO CODING

Scalable Lossless High Definition Image Coding on Multicore Platforms

ROBUST ADAPTIVE INTRA REFRESH FOR MULTIVIEW VIDEO

A Study of Encoding and Decoding Techniques for Syndrome-Based Video Coding

Robust Joint Source-Channel Coding for Image Transmission Over Wireless Channels

SCALABLE EXTENSION OF HEVC USING ENHANCED INTER-LAYER PREDICTION. Thorsten Laude*, Xiaoyu Xiu, Jie Dong, Yuwen He, Yan Ye, Jörn Ostermann*

Advanced Video Processing for Future Multimedia Communication Systems

Comparative Analysis of Wavelet Transform and Wavelet Packet Transform for Image Compression at Decomposition Level 2

THE new video coding standard H.264/AVC [1] significantly

Video 1 Video October 16, 2001

A Novel Macroblock-Level Filtering Upsampling Architecture for H.264/AVC Scalable Extension

Copyright 2005 IEEE. Reprinted from IEEE Transactions on Circuits and Systems for Video Technology, 2005; 15 (6):

Scalable multiple description coding of video sequences

ERROR CONCEALMENT TECHNIQUES IN H.264 VIDEO TRANSMISSION OVER WIRELESS NETWORKS

Transcription:

Performance Comparison of and H.264/AVC High Profile Intra Frame Coding on HD Video Sequences Pankaj Topiwala, Trac Tran, Wei Dai {pankaj, trac, daisy} @ fastvdo.com FastVDO, LLC, Columbia, MD 210 ABSTRACT This paper reconsiders the rate-distortion performance comparison of with H.264/AVC High Profile I frame coding for high definition (HD) video sequences. This work is a follow-on to our paper at SPIE 05 [14], wherein we further optimize both codecs. This also extends a similar earlier study involving H.264/AVC Main Profile [2]. Coding simulations are performed on a set of 720p and 1080p HD video sequences, which have been commonly used for H.264/AVC standardization work. As expected, our experimental results show that H.264/AVC I frame coding offers consistent R-D performance gains (around 0.2 to 1 db in peak signal-to-noise ratio) over color image coding. As in [1, 2], we do not consider scalability, complexity in this study ( is used in non-scalable, but optimal mode). Keywords: H.264, AVC, High Profile,, image coding, video coding. 1. INTRODUCTION H.264, or MPEG-4 Part 10, is an international video coding standard developed by the ITU-T Video Coding Experts Group (VCEG) together with the ISO/IEC Moving Picture Experts Group (MPEG) as the product of a collective partnership effort known as the Joint Video Team (JVT) [3, 4]. H.264 technology, also known as Advanced Video Coding (AVC), is designed to provide good video quality at substantially lower bit rates comparing to previous standards. It is also intended to have a reasonable level of computational complexity and to be flexible enough for a wide range of applications, from broadcasting, DVD storage, teleconferencing, to wireless multimedia communications. On the other hand, is the wavelet-based still image compression standard [5, 6] whose aim is not only to improve coding performance over the original DCT-based JPEG standard [7] but also to add or improve features such as scalability, editability, and lossless coding capability. Although H.264 and are developed for different signals, there are several application areas where they overlap: video applications requiring fast, frequent, and convenient frame access for editting purposes, for instance, Digital Cinema; high-quality high-resolution medical and satellite imaging; video applications requiring real-time simple encoding, etc. There have been several performance comparisons evaluating and H.264/AVC I frame coding [1, 2]. The rate-distortion performance of Motion- and H.264/AVC Main Profile (MP) intra coding was first reported by Marpe et al. in [2]. Using a set of test video sequences with different resolutions, [2] showed that H.264/AVC intra coding has around 0.2 ~ 2 db PSNR gains over for the low and middle resolution sequences, e.g., CIF and ITU-R 601 720x576i (25Hz) sequences. However, in their opinion, has an advantage over H.264/AVC MP for test sequences with very high-resolution content, e.g., 1080p sequences, while for 720p HD sequences, it is reported that both perform virtually at the same level [1, 2]. By contrast, our paper [14] last year at SPIE showed that for 720p, AVC High Profile (HP) holds a clear advantage over, while for 1080p, the results are more even, or a slight advantage for AVC. Recently, it has also been shown that H.264/AVC Fidelity Range Extensions (FRExt) amendment [8] HP provides a major breakthrough in compression efficiency over MP generally. Consistent R-D gains in favor of H.264/AVC FRExt intra coding over for a set of monochrome ISO/IEC images are also reported in [1]. In this paper, we reinvestigate the intra-frame coding performance of H.264/AVC HP in comparison with (Part 1) [5] for six 720p (1280x720-pixel resolution) HD sequences and four 1080p (1920x1080-pixel resolution) HD sequences. The chosen test sequences cover three different classes of still

frames based on their level of spatial contents, ranging from smooth to moderate to high. We confirm that H.264 HP intra coding consistently outperforms in the rate-distortion sense on almost all of our test sequences, especially at high bit-rate high-quality setting. A key difference in this paper over the paper [14] a year ago is that we have further optimized both H.264/AVC HP and. In particular, we have removed the visual weighting in (a default parameter setting), while using some more advanced settings in the AVC HP coding [15-18]. In fact, H.264/AVC HP has so many encoder parameter settings, even for Intra coding, that optimizing it parametrically is itself a fine art. The important test conditions and coding tools used are described in more detail below in the experimental results section. The organization of the paper is as follows. To make this paper self-consistent, we provide a quick overview of the two codecs first in Section 2. Our evaluating methodology including various H.264 and settings and our high-definition test video sequences are discussed in Section 3, followed by our experimental results and discussions of RD performance in Section 4. Finally, Section 5 concludes the paper with a few remarks. 2. DESCRIPTION OF EVALUATING COMPRESSION ALGORITHMS We provide in this section a brief description of three image compression algorithms under common performance evaluation:, H.264/AVC Main Profile intra-frame coding, and H.264/AVC High Profile FRExt intra-frame coding. All three compression schemes are based on the classic three-stage transform-coding paradigm, consisting of a signal decomposition stage, followed by uniform scalar quantization, and context-adaptive entropy coding. Our tests dispense with the Main Profile (MP) in favor of HP, since HP is a superset of MP, and in fact superior. We discuss MP mainly to illustrate HP better. 2.1. Unlike its predecessor JPEG [7], which is based on the 8x8 block DCT decomposition, relies on the wavelet transform as its main de-correlation engine. This multi-resolution transform with lengthvarying basis functions decomposes an input image into wavelet coefficients grouped by sub-bands, representing different spatial-frequency components. The set of resulting wavelet coefficients are furthers split into small coding units called code-blocks, which are independently processed by a coding scheme called Embedded Bitplane Coding with Optimal Truncation (EBCOT) followed by adaptive context-based binary arithmetic coding. has a few distinctive features that we did not enable in this evaluation. Most notable is the scalability feature, allowing one to extract different regions, components, images of different fidelities and/or spatial resolutions out of one single compressed bit-stream. The drawback of scalability is its adverse effect on rate-distortion performance. To max its performance, our comparisons are conducted with the non-scalable single-layer mode. Another feature that we also elect to disable is the tiling mode, which partitions the input image into non-overlapped rectangular tiles to be encoded independently. The tiling feature, intended for lower-complexity and parallel processing, also most likely lowers R-D performance. 2.2. H.264/AVC Main Profile Intra-Frame Coding Both based on the transform-coding paradigm, the main difference between H.264/AVC Main Profile intra coding and is at the transformation stage. Other differences in the quantization and entropy coding stage are dictated by the characteristics of the produced transform coefficients. While employs the global wavelet transform (tiling is its only option for image partitioning), H.264 follows the block coding philosophy, which is more in line of the block-translational motion model employed in its inter-frame coding framework. Unlike all of its video coding standard predecessors, H.264 s transform block size is reduced from 8x8 to 4x4. As a pre-processing step, H.264 relies on spatial prediction using neighboring pixels from previously encoded blocks to take advantage of inter-block spatial correlation. The residual prediction error is de-correlated by a 4x4 low-complexity multiplier-less integer transform that approximates the original 4x4 DCT well but can be implemented in 16-bit fixed-point architectures. The DC coefficients of neighboring blocks are collected into 4x4 blocks and then further processed using the same 4x4 integer transform (2x2 blocks and 2x2 Hadamard transform are used in the chrominance space). The combination of spatial prediction and the wavelet-like 2-level transform iteration has proven to be very effective in

smooth image regions one reason why H.264 can stay competitive with in high-resolution high-quality applications whereas the block-coding based JPEG is not. This H.264 R-D performance result is rather consistent with a few recent reports that the block DCT coding framework can be very competitive with the global wavelet coding framework if inter-block correlation is properly taken into account coupled with appropriately designed context-adaptive entropy coding [9, 10, 11]. After transformation, the H.264 transform coefficients are scalar quantized, zig-zag scanned, and entropy coded by Context-based Adaptive Binary Arithmetic Coding (CABAC). Another entropy coding choice that provides a faster simpler implementation but sacrifices some coding efficiency is called Context-Adaptive Variable-Length Coding (CAVLC), switching from different VLC tables designed from exponential-golomb codes based on locally available contexts collected from neighboring blocks. 2.3. H.264/AVC FRExt High Profile Intra-Frame Coding The JVT completed the development of some extensions to the original H.264 standard in July, 2003. The resulting codec is known as H.264 Fidelity Range Extensions (FRExt), also known as the High Profile [8]. Amongst the extensions as expected from the naming are support for higher-fidelity video pixel resolution (including 10-bit and 12-bit video samples) and support for higher-resolution color spaces such as YUV 4:2:2 and YUV 4:4:4. The main FRExt feature that improves coding efficiency (our top criterion in this paper) is the addition of the 8x8 integer transform another DCT approximation and all coding modes as well as prediction schemes associated with the adaptive selection between the 4 4 and 8 8 integer transforms. The addition of the larger block size of 8x8 is critical in high-resolution high bit-rate applications as shown in later sections. 3. EVALUATING METHODOLOGY 3.1. Video Test Sequences In our performance evaluation, we select six progressive-scan high-definition video sequences (60Hz) at 720p resolution and four at 1080p resolution. All of the test sequences are in the YUV 4:2:0 color format where two chrominance components (U, V) are down-sampled by a factor of two in each spatial dimension. These six 720p sequences can be grouped into three different categories according to their different spatial contents: Smooth spatial details: Jets (first 60 frames) and ShuttleStart (first 60 frames) Moderate spatial details: BigShip (first 60 frames) and Crew (first 60 frames) High spatial details: City (first 60 frames) and Harbour (first 120 frames). Since we are conducting I-frame coding, 60 frames is quite adequate to establish the trend. The only exception is Harbour where we consider the first 120 frames. Both 1080p video sequences Hollywood and Kungfu can be classified as having moderate spatial details: a typical landscape panning sequence of the famous Hollywood hill and a fast-action Kungfu sequence with relatively smooth, stable background. 1080p video sequences Duck and Crowd can be classified as having high spatial details. All sequences are available on our web site at www.fastvdo.com. 3.2. Codec Settings In our coding experiments, we use publicly available software implementations of H.264/AVC and. The latest release of the reference software (JM 11.0) is used for H.264/AVC encoder, and each frame of the test sequences is coded in the I frame mode. For coding, D. Taubman s "Kakadu" (version 2.2) software [6] is used to code each frame to reach the target bit rates. Note that we turned off the visual weighting. The configuration of the H.264/AVC JM encoder [4] is chosen as follows: 8x8 transform mode: enabled, allowing adaptive choice between 4x4/8x8 transform and all associated prediction modes CABAC: enabled R-D optimization: enabled De-blocking filter: enabled AdaptiveRounding: 1, AdaptRndPeriod: 1, AdaptRndChroma: 1, AdaptRndWFactorIRef: 8, AdaptRndWFactorINRef : 8, OffsetMatrixPresentFlag: 0

DisableThresholding: 1 Log2MaxFNumMinus4: 0 Log2MaxPOCLsbMinus4: 0 ReferenceReorder: 0 PocMemoryManagement: 0 PicOrderCntType: 2 Q_Matrix: disabled, ScalingMatrixPresentFlag: 0 1 The "Kakadu" encoder [6] is driven in default mode (except visual weighting is turned off): One tile per frame (no tiling); some limited results with 4 tiles for comparison. 9/7-tap biorthogonal Daubechies wavelet filters (default floating-point transform) 5 levels of wavelet decomposition Single-layer mode (no scalability option) Code-block size of 64x64 wavelet coefficients EBCOT encoding scheme R-D optimization for a given target bit rate. No_weights on (in particular, visual weighting is off) 3.3. Evaluating Criteria To compare the objective performance, we illustrate the curves of average PSNR values of the luminance and chrominance components over all encoded frames versus the final bit rate. For each experiment, H.264/AVC FRExt codes each frame in I-frame mode with one fixed quantization step size. Also, the quantization values for luminance and chrominance components are the same (which is the default mode of HP profile). For, we code each frame with the target bit rate derived from the set total bit rates, frame rate, and sequence resolution. Since this experiment concerns high-bit-rate scenarios, we choose the test psnr points to cover the range between 30 ~ 55dB. 4. EXPERIMENTAL RESULTS Fig. 1 to Fig. 4 depicts the rate-distortion curves as the outcome of our coding experiments for each test sequence. Overall, H.264/AVC FRExt has an average 0.5dB PSNR gain over in the Luminance component and it also works well with the Chrominance components for all three types of HD sequences. The typical result comes from the ShuttleStart sequence, where H264 FRExt has a consistent 0.5+ db gain over as shown in Fig. 2. From Fig. 1 ~ Fig.2 and Fig. 4 ~ Fig.6, we can observe that H264/AVC FRExt has higher gains in all three components than at high bit rates. The PSNR gains reduce while the bit rates decreases. At the high end, we generally get a clear differentiation of performance in favor of H.264/AVC HP, with different asymptotic behavior. That asymptotic difference we attribute to recent encoder optimizations, mainly to the adaptive quantization method of [15-18]. 1 In the experiments, we find that enabling the scaling quantization matrix in FRExt (i.e., ScalingMatrixPresentFlag=1 which is the default setting in FRExt configuration file) severely lowers the coding efficiency, especially at the high bit-rate setting.

Jets(720p,60frames) Jets(720p,60frames) 0 20 60 80 100 120 1 160.5.5.5.5.5.5.5.5.5.5 0 20 60 80 100 120 1 160 ShuttleStart(720p,60frames).5.5.5.5.5.5.5.5.5.5 0 10 20 30 60 70 80 90 Jets(720p,60frames) ShuttleStart(720p,60frames) 55 0 20 60 80 100 120 1 160 56 55.5 55.5.5.5.5.5.5.5.5.5 0 10 20 30 60 70 80 90 Jets(720p,60frames) ShuttleStart(720p,60frames) 56 55 0 20 60 80 100 120 1 160.5.5.5.5.5.5.5 0 10 20 30 60 70 80 90 Figure 1: R-D curves for three components of the smooth spatial content 720p Jets sequence (left) and the ShuttleStart sequence (right) comparing H.264/AVC FRExt intra coding and.

BigShips(720p,60frames) City(720p,60frames).5.5.5.5.5.5.5.5.5.5.5.5.5 60 80 100 120 1 160 180 200 220.5.5.5.5.5.5.5.5.5.5 70 90 110 130 1 BigShips(720p,60frames) City(720p,60frames).5.5.5.5.5.5.5.5.5.5.5.5.5.5.5 0 20 60 80 100 120 1 160 180 200 220.5.5.5.5.5.5.5.5.5.5.5.5.5 0 20 60 80 100 120 1 160 180 BigShips(720p,60frames) City(720p,60frames).5.5.5.5.5.5.5.5.5.5.5.5.5.5 0 20 60 80 100 120 1 160 180 200 220.5.5.5.5.5.5.5.5.5.5.5.5 0 20 60 80 100 120 1 160 180 Figure 2: R-D curves for three components of the moderate spatial content 720p BigShip sequence and the City sequence comparing H.264/AVC High Profile FRExt intra coding and.

Hollywood(1080p,30frames) Kungfu(1080p,30frames).5.5.5.5.5.5.5.5.5.5.5.5 0 20 60 80 100 120 1 160.5.5.5.5.5.5.5.5.5.5.5.5.5 0 10 20 30 60 70 80 Hollywood(1080p,30frames) Kungfu(1080p,30frames).5.5.5.5.5.5.5.5.5 0 100 1 200 2 300.5.5.5.5.5.5.5.5.5 0 10 20 30 60 70 80 Hollywood(1080p,30frames) Kungfu(1080p,30frames).5.5.5.5.5.5.5.5 0 100 1 200 2 300.5.5.5.5.5.5.5.5 0 10 20 30 60 70 80 Figure 3: R-D curves for three components of the 1080p Hollywood sequence (left) and the Kungfu sequence (right) comparing H.264/AVC High Profile FRExt intra coding and.

Duck(1080p,30frames) Crowd(1080p,30frames).5.5.5.5.5.5.5.5.5.5.5.5 120 1 160 180 200 220 2 260 280 300 320 3.5.5.5.5.5.5.5.5.5.5.5.5 120 1 160 180 200 220 2 260 280 300 320 3 Duck(1080p,30frames) Crowd(1080p,30frames).5.5.5.5.5.5.5.5.5.5.5.5 120 1 160 180 200 220 2 260 280 300 320 3.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5 35.5.5 35 33.5 32.5 33 31.5 32 31 0 100 1 200 2 300 3 Duck(1080p,30frames) Crowd(1080p,30frames).5.5.5.5.5.5.5.5.5.5.5.5 1 160 180 200 220 2 260 280 300 320 3.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5.5 35.5.5 35 33.5 32.5 33 32 0 100 1 200 2 300 3 Figure 4: R-D curves for three components of the 1080p Duck sequence (left) and the Crowd sequence (right) comparing H.264/AVC High Profile FRExt intra coding and.

Duck(1080p,60frames) Crowd(1080p,60frames) JPEK2000 1TILE 4 TILES JPEK2000 1TILE 4 TILES 0 100 200 300 0 32 0 100 1 200 2 300 3 Duck(1080p,60frames) Crowd(1080p,60frames) 1 TILE 4 TILES 32 0 100 1 200 2 300 3 0 35 1 TILE 33 4 TILES 31 0 100 1 200 2 300 3 Duck(1080p,60frames) Crowd(1080p,60frames) 1 TILE 4 TILES 1 TILE 4 TILES 0 100 1 200 2 300 3 0 32 0 100 1 200 2 300 3 Figure 5: R-D curves for three components of the 1080p Duck sequence (left) and the Crowd sequence (right) comparing H.264/AVC High Profile FRExt intra coding and. 5. CONCLUSION This comparative study points out the objective RD-performance superiority of the latest H.264/AVC FRExt I-frame coding scheme of the High Profile over the international still image compression standard in high-resolution high-bit-rate video coding applications where fast and convenient frame access is of highest priority. Along with benchmarks in [1, 2], on our experiment again confirms that as far as R-D performance is concerned, H.264/AVC FRExt is currently the leader not only in video compression but also in still image compression as well. At the 720p resolution, H.264 consistently offers significant

improvement in peak signal-to-noise ratio, especially in the luminance component. In fact, we even notice an asymptotic difference in slopes between the two codecs (e.g., with the Jets sequence, figure 1). The asymptotic difference, we believe, is mainly due to the use of adaptive quantization offset [15], in which the dequantization is more accurately near the centroid of a quantization bin than in the default settings (wherein a constant rather than computed offset from the center of the bin is used), as motivated by RD optimization. This advance, included in the AVC standard, is not currently available in. In fact, the concept of [15] may not be properly implemented in the chroma channels yet, and there may be room for further improvements. In any case, there is little bitrate allocated to chroma channels, and differences in chroma psnr values are hard to observe. And at the 1080p resolution, contrasting to popular conceptions and results from previous coding experiments using H.264 Main-Profile [1, 2], our study points out that H.264 FRExt High-Profile I-frame coding algorithm is at least very competitive with in the rate distortion sense. Furthermore, we remark that modern codecs, especially when used in high quality, high bandwidth applications, are more memory constrained rather than processor constrained, especially memory bandwidth. In this context, since the wavelet transform is a global transform (whereas AVC uses block transforms), the memory bandwidth requirements of far exceed those of AVC. When tiling is used in to constrain memory bandwidth (e.g., for 128x128 tiles, 1080p would have 16x8=128 tiles), H.264 HP may in fact be superior, even at 1080p or higher. In this paper, we tested only up to 4 tiles, and noted no significant variation in performance. In future papers, we will conduct more realistic tiling tests. In figures 6-9, we reprint our results from the previous SPIE 05 paper [14], for comparison. In fact, in that paper, the parametric settings for neither the AVC HP nor the codec were fully optimized. AVC HP did not have adaptive quantization offset turned on, for example, which is responsible for the asymptotic difference at high bit rate between HP and. Also, we had one default setting for, which included visual-weighting. However, note that even then, visual weighting turns off automatically when the bitrate exceeds one bit per sample, the threshold for which is at about 55 Mb/s. Thus, in retrospect, we can now be assured that the results were valid, since there is little difference between the new results and the previous ones. 6. ACKNOWLEDGEMENTS We thank Gary Sullivan for useful comments and suggestions regarding optimizing AVC encoding, and especially in the use of the adaptive quantization offset method of [15-18].

REFERENCES [1] D. Marpe, V. George, and T. Wiegand, Performance comparison of intra-only H.264/AVC HP and for a set of monochrome ISO/IEC test images, JVT-M014, 18-22 Oct., 2004. [2] D. Marpe, V. George, H. L. Cycon, and K. U. Barthel, Performance Evaluation of Motion- in Comparison with H.264 / AVC Operated in Intra Coding Mode, Proc. SPIE, Vol. 66, pp. 129-1, Feb. 2004. [3] ITU-T Recommendation H.264 and ISO/IEC 196-10 MPEG-4 Part 10, Advanced Video Coding (AVC), 2003. [4] I. E. G. Richardson, H.264 and MPEG-4 Video Compression, John Wiley & Sons, Sep. 2003. [5] ISO/IEC 1-3, Motion- ( Part 3), 2002. [6] D. S. Taubman and M. W. Marcellin, : Image Compression Fundamentals, standards, and Practice, Kluwer Academic Publishers, 2001. [7] W. B. Pennebaker and J. L. Mitchell, JPEG: Still Image Data Compression Standard, Kluwer Academic, Jan. 1993. [8] G. J. Sullivan, P. N. Topiwala, and A. Luthra, The H.264/AVC Advanced Video Coding standard: overview and introduction to the Fidelity Range Extensions, Proc. SPIE, Aug. 2004. [9] H. S. Malvar, Fast progressive image coding without wavelets, IEEE Data Compression Conference, Snowbird, UT, pp. 2-2, Mar. 2000. [10] C. Tu and T. D. Tran, Context based entropy coding of block transform coefficients for image compression, IEEE Trans. on Image Processing, vol. 11, pp. 1271-1283, Nov. 2002. [11] W. Dai, L. Liu, and T. D. Tran, Adaptive block-based image coding with pre-/post-filtering, IEEE Data Compression Conference, pp. 73-82, Snowbird, UT, Mar. 2005. [12] G. Sullivan et al., Draft of Version 3 of H.264/AVC (ITU-T Recommendation H.264 and ISO/IEC 196-10 (MPEG-4 part 10) Advanced Video Coding), Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG Doc. JVT-L0, 12th Meeting: Redmond, WA, USA, July, 2004. [13] ITU-T Rec. T.800 and ISO/IEC 1-1, Image Coding System: Core Coding System ( Part 1), 2000. [14] P. Topiwala, Comparative Study of and H.264/AVC FRExt I-Frame Coding on High Definition Video Sequences, Proc. SPIE Int l Symposium, Dig. Im. Proc., San Diego, Aug., 2005. [15] G. J. Sullivan, Adaptive quantization encoding technique using an equal expected-value rule, JVT- N011, 14 th JVT Meeting, Hong Kong, January, 2005. [16] G. J. Sullivan, One page 12-point font summary of good scalar quantization for symmetric pdfsources, JVT-O014, 15 th JVT Meeting, Busan, Korea, April, 2005. [17] G. J. Sullivan, Efficient Scalar Quantization of Exponential and Laplacian Random Variables, IEEE Transactions on Information Theory, Vol., pp. 15-14, Sept. 1996. [18] G. J. Sullivan, Adaptive quantization yielding 10dB gain, JVT-P015, 16 th JVT Meeting, Poznan, PO, July, 2005.

Jets (720p, 60 frames).5 ShuttleStart (720p, 60 frames) 25 35 55 65 75 85.5.5.5.5.5.5 25 35 55 65 75 Jets (720p, 60 frames) 55 ShuttleStart (720p, 60 frames).5.5.5 25 35 55 65 75 85.5.5.5.5 25 35 55 65 75 Jets (720p, 60 frames).5 ShuttleStart (720p, 60 frames).5.5.5 25 35 55 65 75 85.5.5 25 35 55 65 75 Figure 6: R-D curves for three components of the smooth spatial content 720p Jets sequence (left) and the ShuttleStart sequence (right) comparing H.264/AVC FRExt intra coding and. This figure is quoted from [14].

BigShip (720p, 60 frames) 20 30 60 70 80 90 ) Y - PSNR (db Crew (720p, 60 frames) 5 15 25 35 BigShip (720p, 60 frames).5.5.5.5.5.5 20 30 60 70 80 90 ) U - PSNR (db.5.5.5.5 Crew (720p, 60 frames) 5 15 25 35.5.5.5.5.5.5 BigShip (720p, 60 frames) 20 30 60 70 80 90.5.5.5.5.5 Crew (720p, 60 frames) 5 15 25 35 Figure 7: R-D curves for three components of the moderate spatial content 720p BigShip sequence and the Crew sequence comparing H.264/AVC High Profile FRExt intra coding and. This figure is quoted from [14].

35 33 City (720p, 60 frames) 20 30 60 70 80 Harbour (720p, 120 frames) 35 25 65 85.5.5.5.5.5.5 City (720p, 60 frames) 20 30 60 70 80 Harbour (720p, 120 frames) 25 65 85.5.5.5.5.5 City (720p, 60 frames) 20 60 80 V-PSNR (db) Harbour (720p, 120 frames) 25 65 85 Figure 8: R-D curves for three components of the high spatial content 720p City sequence and the Harbour sequence comparing H.264/AVC High Profile FRExt intra coding and. This figure is quoted from [14].

Y- PSNR 35 Hollywood (1080p, 60 frames) H.264 Frext 0 20 60 80 100 Y-PSNR Kungfu (1080p, 60 frames) H.264 Frext 0 20 60 80 U- PSNR Hollyw ood (1080p, 60 frames) H.264 Frext 0 20 60 80 100 R U- PSN Kungfu (1080p, 60 frames) H.264 Frext 0 20 60 80 Hollywood (1080p, 60 frames) Kungfu (1080p, 60 frames) V- PSNR H.264 Frext 0 20 60 80 100 V- PSNR H.264 Frext 0 20 60 80 Figure 9: R-D curves for three components of the 1080p Hollywood sequence (left) and the Kungfu sequence (right) comparing H.264/AVC High Profile FRExt intra coding and. This figure is quoted from [14].