A joint source channel coding strategy for video transmission

Similar documents
Embedding Multilevel Image Encryption in the LAR Codec

On viewing distance and visual quality assessment in the age of Ultra High Definition TV

Video Over Mobile Networks

No title. Matthieu Arzel, Fabrice Seguin, Cyril Lahuec, Michel Jezequel. HAL Id: hal

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

INTRA-FRAME WAVELET VIDEO CODING

Masking effects in vertical whole body vibrations

Video Transmission. Thomas Wiegand: Digital Image Communication Video Transmission 1. Transmission of Hybrid Coded Video. Channel Encoder.

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and

QUEUES IN CINEMAS. Mehri Houda, Djemal Taoufik. Mehri Houda, Djemal Taoufik. QUEUES IN CINEMAS. 47 pages <hal >

OBJECT-BASED IMAGE COMPRESSION WITH SIMULTANEOUS SPATIAL AND SNR SCALABILITY SUPPORT FOR MULTICASTING OVER HETEROGENEOUS NETWORKS

A Novel Approach towards Video Compression for Mobile Internet using Transform Domain Technique

Free Viewpoint Switching in Multi-view Video Streaming Using. Wyner-Ziv Video Coding

Minimax Disappointment Video Broadcasting

Compte-rendu : Patrick Dunleavy, Authoring a PhD. How to Plan, Draft, Write and Finish a Doctoral Thesis or Dissertation, 2007

Influence of lexical markers on the production of contextual factors inducing irony

Learning Geometry and Music through Computer-aided Music Analysis and Composition: A Pedagogical Approach

From SD to HD television: effects of H.264 distortions versus display size on quality of experience

Reply to Romero and Soria

Modeling and Optimization of a Systematic Lossy Error Protection System based on H.264/AVC Redundant Slices

Comparative Analysis of Wavelet Transform and Wavelet Packet Transform for Image Compression at Decomposition Level 2

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)

A new conservation treatment for strengthening and deacidification of paper using polysiloxane networks

Robust Joint Source-Channel Coding for Image Transmission Over Wireless Channels

Error-Resilience Video Transcoding for Wireless Communications

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Error Resilient Video Coding Using Unequally Protected Key Pictures

Video coding standards

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

Interactive Collaborative Books

Robust Transmission of H.264/AVC Video using 64-QAM and unequal error protection

Bit Rate Control for Video Transmission Over Wireless Networks

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora

Multimedia Communications. Image and Video compression

Sound quality in railstation : users perceptions and predictability

ENCODING OF PREDICTIVE ERROR FRAMES IN RATE SCALABLE VIDEO CODECS USING WAVELET SHRINKAGE. Eduardo Asbun, Paul Salama, and Edward J.

On the Citation Advantage of linking to data

ROBUST IMAGE AND VIDEO CODING WITH ADAPTIVE RATE CONTROL

Chapter 10 Basic Video Compression Techniques

Scalable Foveated Visual Information Coding and Communications

REBUILDING OF AN ORCHESTRA REHEARSAL ROOM: COMPARISON BETWEEN OBJECTIVE AND PERCEPTIVE MEASUREMENTS FOR ROOM ACOUSTIC PREDICTIONS

Robust Transmission of H.264/AVC Video Using 64-QAM and Unequal Error Protection

Error Concealment for SNR Scalable Video Coding

176 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 13, NO. 2, FEBRUARY 2003

PaperTonnetz: Supporting Music Composition with Interactive Paper

Workshop on Narrative Empathy - When the first person becomes secondary : empathy and embedded narrative

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ISCAS.2005.

Color Quantization of Compressed Video Sequences. Wan-Fung Cheung, and Yuk-Hee Chan, Member, IEEE 1 CSVT

SCALABLE video coding (SVC) is currently being developed

THE popularity of multimedia applications demands support

Adaptive Key Frame Selection for Efficient Video Coding

A PRELIMINARY STUDY ON THE INFLUENCE OF ROOM ACOUSTICS ON PIANO PERFORMANCE

Parameters optimization for a scalable multiple description coding scheme based on spatial subsampling

The Brassiness Potential of Chromatic Instruments

Digital Video Telemetry System

Improvement of MPEG-2 Compression by Position-Dependent Encoding

Constant Bit Rate for Video Streaming Over Packet Switching Networks

Laurent Romary. To cite this version: HAL Id: hal

Comparative Study of JPEG2000 and H.264/AVC FRExt I Frame Coding on High-Definition Video Sequences

Open access publishing and peer reviews : new models

Artefacts as a Cultural and Collaborative Probe in Interaction Design

AN UNEQUAL ERROR PROTECTION SCHEME FOR MULTIPLE INPUT MULTIPLE OUTPUT SYSTEMS. M. Farooq Sabir, Robert W. Heath and Alan C. Bovik

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

Motion blur estimation on LCDs

Unequal Error Protection Codes for Wavelet Image Transmission over W-CDMA, AWGN and Rayleigh Fading Channels

A Study of Encoding and Decoding Techniques for Syndrome-Based Video Coding

Robust 3-D Video System Based on Modified Prediction Coding and Adaptive Selection Mode Error Concealment Algorithm

Performance Improvement of AMBE 3600 bps Vocoder with Improved FEC

A study of the influence of room acoustics on piano performance

Hierarchical SNR Scalable Video Coding with Adaptive Quantization for Reduced Drift Error

AUDIOVISUAL COMMUNICATION

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

Primo. Michael Cotta-Schønberg. To cite this version: HAL Id: hprints

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Regularity and irregularity in wind instruments with toneholes or bells

Fast MBAFF/PAFF Motion Estimation and Mode Decision Scheme for H.264

La convergence des acteurs de l opposition égyptienne autour des notions de société civile et de démocratie

An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions

Multimedia Communication Systems 1 MULTIMEDIA SIGNAL CODING AND TRANSMISSION DR. AFSHIN EBRAHIMI

DICOM medical image watermarking of ECG signals using EZW algorithm. A. Kannammal* and S. Subha Rani

Natural and warm? A critical perspective on a feminine and ecological aesthetics in architecture

CERIAS Tech Report Preprocessing and Postprocessing Techniques for Encoding Predictive Error Frames in Rate Scalable Video Codecs by E

Chapter 2. Advanced Telecommunications and Signal Processing Program. E. Galarza, Raynard O. Hinds, Eric C. Reed, Lon E. Sun-

HEVC: Future Video Encoding Landscape

Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard

Understanding Compression Technologies for HD and Megapixel Surveillance

Error concealment techniques in H.264 video transmission over wireless networks

INFORMATION THEORY INSPIRED VIDEO CODING METHODS : TRUTH IS SOMETIMES BETTER THAN FICTION

Error prevention and concealment for scalable video coding with dual-priority transmission q

Analysis of Video Transmission over Lossy Channels

COMP 9519: Tutorial 1

ABSTRACT ERROR CONCEALMENT TECHNIQUES IN H.264/AVC, FOR VIDEO TRANSMISSION OVER WIRELESS NETWORK. Vineeth Shetty Kolkeri, M.S.

WYNER-ZIV VIDEO CODING WITH LOW ENCODER COMPLEXITY

Enabling Error-Resilient Internet Broadcasting using Motion Compensated Spatial Partitioning and Packet FEC for the Dirac Video Codec

PAPER Wireless Multi-view Video Streaming with Subcarrier Allocation

Scalable multiple description coding of video sequences

3D MR Image Compression Techniques based on Decimated Wavelet Thresholding Scheme

DWT Based-Video Compression Using (4SS) Matching Algorithm

Dual frame motion compensation for a rate switching network

FLEXIBLE SWITCHING AND EDITING OF MPEG-2 VIDEO BITSTREAMS

Transcription:

A joint source channel coding strategy for video transmission Clency Perrine, Christian Chatellier, Shan Wang, Christian Olivier To cite this version: Clency Perrine, Christian Chatellier, Shan Wang, Christian Olivier. A joint source channel coding strategy for video transmission. A joint source channel coding strategy for video transmission, Apr 2008, Damascus, Syria. pp.5, 2008. <hal-00348732> HAL Id: hal-00348732 https://hal.archives-ouvertes.fr/hal-00348732 Submitted on 21 Dec 2008 HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

A joint source channel coding strategy for video transmission C. PERRINE, C. CHATELLLIER, S. WANG and C. OLIVIER Laboratoire XLIM-SIC, Université de Poitiers Téléport 2, Bvd M. et P. Curie, BP 30179, F-86962 Futuroscope-Chasseneuil cedex Tel : +33 5 49 49 74 41 clency.perrine@sic.sp2mi.univ-poitiers.fr Abstract - This paper presents a joint source channel coding designed for video transmission application. The aim is to improve the visual quality of the reconstructed video even when transmission errors occur while keeping the processing as simple as possible. The BER excepted is higher than 10-4. This coding is based on an association of a wavelet transform (WT) and a vector quantization (VQ) optimally mapped on a QAM-M modulation. The whole of the transmission chain is jointly exploited in order to make it well-adapted to the complex and low data rate channel. Keywords : Joint source channel coding, vector quantization, video transmission, GOP. I. INTRODUCTION Nowadays, most of telecommunication systems allow transmitting video in real time. Furthermore, the quality of the received video is an important parameter for the Quality of Service (QoS) supplied by the providers. This quality depends on the compression algorithm, the channel coding strategy and the transmission conditions in the wireless channel. The latest norms as H.264 or MPEG- 4 [1] [2] have shown good performances related to the compression rate (optimized motion vector coding) and the scalability aspects. However, when errors occur over the wireless channel, video remains sensitive. The aim of this paper is to present a joint source channel coding strategy based on a wavelet transform increasing the robustness of the transmission chain by reducing the impact of transmission errors (Bit Error Rate BER > 10-4 ) on the quality of restored images while contributing a good rate-distorsion. Wavelet-based coding is promising for efficient and scalable compression of video [7]. The real-time application constraint is taken into account insomuch as the complexity is reduced. The video coding we suggest in this article is based on a vector quantization [3] for each sub-band of wavelet coefficients followed by an optimally QAM (16 or 256) mapped strategy. This video coding is named (Wavelet Transform Self Organized Map) video. This algorithm based on fixed length coding has the advantage in opposition to the variable length coding (entropy coding) to be more robust regarding the transmission error but obviously with a lower compression rate. The application expected in this study is videoconference which means limited scene change and low data rate channel. In section II, the video coding method is described, resorting to strategy for fixed images [4] [10]. It is shown its adequacy with a digital QAM-256 modulation and the high performances related to the High BER. We propose, as for MPEG-4, to use the Group Of Pictures (GOP) layer containing 12 successive frames but from wavelet transform coefficients. In section III, several transmission results through AWGN channels are presented involving the video coding. The BER expected is higher than 10-4. This video coding is more efficient in terms of robustness than in terms of compression aspects. II. VIDEO PRINCIPLE After showing the difficulties to transmit video sequences when BER is high, the principle of for fixed images is described [4] [10] and an extension to the video applications is detailed. A. Issue An example of video transmission is given in the figure 1. The standard of compression is MPEG4. It is transmitted through the AWGN channel using à QAM-16 modulation. The BER is closed to 10-4. We can notice that

MPEG-4 encoding is extremely sensitive to errors due to the Variable Length Codes (VLC). (a) Original sequence Susi_025 Susi_026 Susi_027 The second stage is to apply a vector quantization on the remaining subbands of the wavelet decomposition. To do this, we use Kohonen s Self-Organizing Map (SOM) algorithm, because of its interesting property of autoorganization: close vectors in terms of Euclidian distance correspond to close indices in terms of location in the selforganized codebook. This method is named coding [5]. It requires 5 codebooks known by the transmitter and the receiver. The number and the vector shape depend on the subbands, the compression rate and the restored image quality expected. For our example in the figure 2, each codebook is composed of 256 vectors V i for which the size is all the more important since the subband is more informative. The compression rate for the quantization step is 4,4 [11] with n =m = 512 (see table 1) : 64 64 3 + 128 128 2 T = 4,4 (2) c 64 64 2 + 128 128 /8 (b) MPEG-4, BER = 1,9.10-4, QAM-16, PSNR = 15.84 db Figure 1. Transmission through ideal channel (without errors) and through AWGN channel: (a) original sequence (b) MPEG-4, BER = 1,9.10-4 In order to avoid this phenomenon, a source coding based on vector quantization is used which has the particularity to be a fixed length coding and well adapted to loss data compression. Before presenting the algorithm for video, we suggest to give a brief description of algorithm for fixed images [10]. Sub-band Original size Size after quantization LL 3 n/8 x m/8 n/8 x m/8 HL 3 n/8 x m/8 (n/8 x m/8)/2 LH 3 n/8 x m/8 (n/8 x m/8)/2 HL 2 n/4 x m/4 (n/4 x m/4)/16 LH 2 n/4 x m/4 (n/4 x m/4)/16 Table 1 : Size of the entire data before and after quantization The global compression rate is 25,6 (5,88 x 4,4). B. algorithm for fixed images In our application, we use the Daubechies (9/7) biorthogonal wavelet such as in the JPEG 2000 case. The wavelet decomposition is applied at scale three and we preserve the five most significant sub images (LL 3, HL 3, LH 3, HL 2 et LH 2 ) and discard the less important subbands. This choice is a good compromise between compression rate and visual quality of restored images. It is strengthened by an information entropy study for the 10 subbands related to our image sequence training basis. The size of the entire data before and after quantization is given in table 1. The compression rate for this first step is 5,88 for any n x m image [11] : n m T c = 5,88 ( n / 8 m / 8) 3 + ( n / 8 m / 8) 2 (1) Figure 2. video method : VQ principle for 1 plan. 1) Robustness against transmission errors Only the indexes of the codebook s vectors are transmitted through the channel using the right digital modulation. The size of the codebook is well-justified by the QAM-256 or QAM-16 modulation (figure 3). The codebook is optimally mapped on high spectral efficiency modulation and Kohonen topological maps (figure 3). They are more appropriate to QAM modulation because the layout of the codebook elements on the digital modulation points (in red) minimizes the average distortion during the transmission of codebook elements.

The robustness of the scheme suggested in this article is justified by the interdependence between the video coding and the QAM modulation. - the P-frame (predictive coded picture) contains difference information from the preceding I or P- frame - the B-frame (bidirectionnally predictive coded picture) contains difference information from preceding and/or I or P-frame The figure 4 illustrates the GOP structure with 13 frames. I B B P I Figure 3. Codebook superimposing to QAM- 256 modulation I 1 I 2 I 3 I 13 Figure 4. GOP structure with 13 frames In the figure 3, a 256-QAM has been used but in order to minimize the BER a 16-QAM modulation can be used. In this case, the optimal mapping of a 256-vector codebook is no longer possible. In order to solve this problem, Pyndiah [9] suggests reorganizing the codebook differently, that is to say using a four-dimension codebook (4x4x4x4) suitable for 4 bits (16-QAM). The results are not as good (1,5 db loss in PSNR) as in the 256-QAM case but the system remains robust. C. video In this section, video (based on for fixed images) is presented. This algorithm allows us to code image sequences by introducing GOP technique in order to increase the compression rate while keeping a good visual quality taking inspiration from the latest video standards. These techniques are applied to wavelet subband and a quantization. 1) Principle We adopt the same strategy as they use in MPEG-4 standard, that is to say based on GOP technique using 12 successive images. Our method requires the construction of 2x5 codebooks obtained with differential video references : Mom-daughter, Claire, Missa, Grand-mom. For the codebooks, each vector has variable size as mentioned in section 2.2. The method we use based on GOP technique is performed from DWT sub-band. In order to exploit the temporal redundancy GOP scheme is defined as follows: - the I-frame (intra coded picture) reference picture corresponds to a fixed image and is independent of other picture types The reference pictures (I-frame) are transformed related to the DWT algorithm. These pictures are encoded according to the algorithm for fixed images as mentioned before. These pictures will serve as reference to calculate P and B-frame and for decoding. The P- frames are predictively coded. A motion compensation of preceding pictures is done so that this image prediction could be realized. The B-frames are bidirectionally coded. Motion-compensated prediction is obtained with preceding and succeeding pictures. Differential wavelet transform coefficients are calculated for each subband and between these images. Dif 1, Dif 2, Dif 3,, will be the reference vectors. Thus, we are able to construct 5 codebooks for reference images and from difference vectors we are able to construct the new difference codebooks. The difference equations are the following: Dif 1 = (I 4 ) (I 1 ) Dif 2 = (I 2 ) [(I 1 ) + 1/3Dif 1 ] Dif 3 = (I 3 ) [(I 4 ) 1/3Dif 1 ] Dif 4 = (I 7 ) (I 4 ) Dif 5 = (I 5 ) [(I 4 ) + 1/3Dif 1 ] Dif 6 = (I 6 ) [(I 7 ) 1/3Dif 1 ] Dif 7 = (I 10 ) (I 7 ) Dif 8 = (I 8 ) [(I 7 ) + 1/3Dif 1 ] Dif 9 = (I 9 ) [(I 10 ) 1/3Dif 1 ] Dif 10 = (I 11 ) [(I 10 ) + 1/3Dif 1 ] Dif 11 = (I 12 ) [(I 13 ) 1/3Dif 1 ] (3)

where (I j ) means a wavelet coefficient of I j plan. Dif 4, Dif 5, Dif 6, etc, are obtained in the same way. In the figure 5 is shown the GOP principle. I 1(I) I 4(P) (I 1) (I 4) (I 1) Dif 1 The results obtained with video are illustrated in the figure 6c and those obtained with MPEG-4 in the figure 6d. In the figure 6c, the PSNR value remains constant compared to the figure 6b. In the figure 6e, we show a result with a high BER (BER = 1,34.10-2 ) still using video coding and with the same compression rate. This underlines the robustness of the method suggested in this article and its interest for video transmission through complex channels compared to MPEG-4 where the result is unacceptable. I 2(B) (I 2) Dif 2 I 3(B) (I 3) Dif 3 (a) Original sequence Mom_024 Mom_025 Mom_035 Figure 5. GOP (b) video, without errors (Average PSNR = 27,28 db) I. VIDEO TRANSMISSION THROUGH AWGN CHANNEL In this section, we show the results obtained with video coding detailed above compared to MPEG-4 standard. The digital modulation is a QAM-16. Each symbol is coded using 4 bits what is easy to adapt to our method [9]. The main advantage, in comparison with QAM-256 modulation, is that it is easy to implement and more robust against the noise without disorganizing the initial scheme designed for 256 vectors (coded using 8 = 2x4 bits). The compression rate is about 44:1. The video Mom is transmitted over the AWGN channel (figure 6a : video clip from Mom: original : Mom_024, Mom_025, Mom_035). The codebooks are constructed in the same way as previously relatively to the chosen compression rate and from 4 basis video : Mom-daughter, Claire, Missa, Grand-mom. The result of this sequence obtained with video is given in the figure 6b. The restored sequence appears quite good in terms of visual quality. The PSNR equals 27, 28 db, that is obviously less efficient than MPEG-4 or H264 standards. The original sequence (figure 6a.) is then transmitted through an AWGN channel with the following features: - BER 2.10-4 - SNR = 8 db - T c = 63.5 for video - T c = 61 for MPEG-4 standard - Modulation : QAM-16 (c) video, BER = 1,8.10-4, QAM-16, Average PSNR = 27,26 db (d) MPEG-4, BER = 1,9.10-4, QAM-16, Average PSNR = 13.68 db (e) video, de BER =1,34.10-2, QAM-16 Average PSNR = 25,52dB Figure 6. Sequence transmitted through ideal and AWGN channel comparable compression rate : (a) original sequence (b) video, without errors, (c) video, BER = 1,8.10-4, (d) MPEG-4, BER =1,9.10-4 (e) video, BER = 1,32.10-2

IV. CONCLUSION AND OUTLOOKS This paper presents a joint source channel coding designed for video application, the video coding. It is based on an association of a wavelet transform, an appropriate codebook for each image of decomposition and a classical predictive coding involving a sequence of 12 images. We show that this strategy is particularly welladapted to video transmission through complex and low data rate channel while keeping the useful bit rate as optimum. A comparison with MPEG-4 shows clearly the performances of video in terms of visual quality and compression rate compromise. The visual quality can be improved by introducing as for video standard motion vector what will allow a significant reduction in the residual distortion on the restored images [7]. Its encoding will be done by using vector quantization adapted to the modulation in order to keep the global strategy related to the robustness and the optimal useful bit rate. Finally, some errors (figures 6c and 6e) could be attenuated by using video PDE (Partial Differential Equations) -based restoring algorithm as we developed [8] for fixed images. International Conference on Artificial Neural Networks, vol. 1, Paris, 1995, p. 287 291. [10] C. Chatellier, H. Boeglen, C. Perrine, C. Olivier, O. Haeberlé, "A robust joint source channel coding scheme for image transmission over the ionospheric channel" Signal Processing: Image Communication, 22, pp 543-556, 2007. [11] S. Wang, Stratégie de codage conjoint de séquences video base ondelettes Ph.D thesis, University of Poitiers, 2008 REFERENCES [1] H264/MPEG-4 Part 10 Tutorials (Richardson), http://www.vcodex.com/h264.html [2] MPEG-4 Video Group, Coding of audio-visual objects: Video, ISO/IEC JTC1/SC29/WG11 N2202, March 1998 [3] Aitsab O. "Turbo codes et codage conjoint source canal : application à la transmission d images". PhD thesis, Ecole Nationale Supérieure des Télécommunications de Bretagne, Février 1998 [4] Souhard, B. "Codage conjoint source canal : Application à la transmission d images fixes sur canal ionosphérique", Thèse de l Université de Poitiers, Mars 2004. [5] Kohonen T. ''Self Organization and associative memory". Springer-Verlag, 1994. [6] Wang S., Chatellier C., et Olivier C. "Codage conjoint source canal pour des séquences d images visioconférences", CORESA, Caen, pp 89-93, Novembre 2006 [7] Agostini M.A., Antonini M., Barlaud M. "Model-based bit allocation between wavelet subbabds and motion information in MCWT video coders", EUSIPCO 2006, Florence (Italy), September 2006 [8] Bourdon P., Augereau B., Chatellier C., Olivier C.: "A multi-resolution, geometry-driven error concealment method for corrupted JPEG color images" - Signal Processing: Image and Communication, 20, pp 681-694, August 2005 [9] J. Kangas, in: Increasing the error tolerance in transmission of vector quantize image by self organizing map,