Free Viewpoint Switching in Multi-view Video Streaming Using. Wyner-Ziv Video Coding

Similar documents
WYNER-ZIV VIDEO CODING WITH LOW ENCODER COMPLEXITY

MULTIVIEW DISTRIBUTED VIDEO CODING WITH ENCODER DRIVEN FUSION

Systematic Lossy Error Protection of Video based on H.264/AVC Redundant Slices

Project Proposal: Sub pixel motion estimation for side information generation in Wyner- Ziv decoder.

Visual Communication at Limited Colour Display Capability

A Study of Encoding and Decoding Techniques for Syndrome-Based Video Coding

Distributed Video Coding Using LDPC Codes for Wireless Video

Wyner-Ziv Coding of Motion Video

Modeling and Optimization of a Systematic Lossy Error Protection System based on H.264/AVC Redundant Slices

Systematic Lossy Forward Error Protection for Error-Resilient Digital Video Broadcasting

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ISCAS.2005.

Robust 3-D Video System Based on Modified Prediction Coding and Adaptive Selection Mode Error Concealment Algorithm

CHROMA CODING IN DISTRIBUTED VIDEO CODING

Systematic Lossy Error Protection of Video Signals Shantanu Rane, Member, IEEE, Pierpaolo Baccichet, Member, IEEE, and Bernd Girod, Fellow, IEEE

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora

INFORMATION THEORY INSPIRED VIDEO CODING METHODS : TRUTH IS SOMETIMES BETTER THAN FICTION

Systematic Lossy Error Protection based on H.264/AVC Redundant Slices and Flexible Macroblock Ordering

New Approach to Multi-Modal Multi-View Video Coding

Dual Frame Video Encoding with Feedback

ROBUST REGION-OF-INTEREST SCALABLE CODING WITH LEAKY PREDICTION IN H.264/AVC. Qian Chen, Li Song, Xiaokang Yang, Wenjun Zhang

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

Bit Rate Control for Video Transmission Over Wireless Networks

Energy Efficient Video Compression for Wireless Sensor Networks *

Scalable Foveated Visual Information Coding and Communications

Interactive multiview video system with non-complex navigation at the decoder

Chapter 2 Introduction to

WITH the rapid development of high-fidelity video services

Key Techniques of Bit Rate Reduction for H.264 Streams

Decoder-driven mode decision in a block-based distributed video codec

Robust Joint Source-Channel Coding for Image Transmission Over Wireless Channels

PACKET-SWITCHED networks have become ubiquitous

Dual frame motion compensation for a rate switching network

Rate-Distortion Analysis for H.264/AVC Video Coding and its Application to Rate Control

ROBUST IMAGE AND VIDEO CODING WITH ADAPTIVE RATE CONTROL

ENCODING OF PREDICTIVE ERROR FRAMES IN RATE SCALABLE VIDEO CODECS USING WAVELET SHRINKAGE. Eduardo Asbun, Paul Salama, and Edward J.

CERIAS Tech Report Preprocessing and Postprocessing Techniques for Encoding Predictive Error Frames in Rate Scalable Video Codecs by E

Constant Bit Rate for Video Streaming Over Packet Switching Networks

Fast MBAFF/PAFF Motion Estimation and Mode Decision Scheme for H.264

Error Resilient Video Coding Using Unequally Protected Key Pictures

Analysis of Packet Loss for Compressed Video: Does Burst-Length Matter?

THE CAPABILITY of real-time transmission of video over

Multiview Video Coding

Video Quality Monitoring for Mobile Multicast Peers Using Distributed Source Coding

Wireless Multi-view Video Streaming with Subcarrier Allocation by Frame Significance

ROBUST ADAPTIVE INTRA REFRESH FOR MULTIVIEW VIDEO

Error Resilience for Compressed Sensing with Multiple-Channel Transmission

Compressed-Sensing-Enabled Video Streaming for Wireless Multimedia Sensor Networks Abstract:

Analysis of Video Transmission over Lossy Channels

1022 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO. 4, APRIL 2010

INTRA-FRAME WAVELET VIDEO CODING

UC San Diego UC San Diego Previously Published Works

Chapter 10 Basic Video Compression Techniques

Adaptive Key Frame Selection for Efficient Video Coding

Reduced complexity MPEG2 video post-processing for HD display

UNBALANCED QUANTIZED MULTI-STATE VIDEO CODING

Motion Re-estimation for MPEG-2 to MPEG-4 Simple Profile Transcoding. Abstract. I. Introduction

Color Quantization of Compressed Video Sequences. Wan-Fung Cheung, and Yuk-Hee Chan, Member, IEEE 1 CSVT

SCALABLE video coding (SVC) is currently being developed

1. INTRODUCTION. Index Terms Video Transcoding, Video Streaming, Frame skipping, Interpolation frame, Decoder, Encoder.

GLOBAL DISPARITY COMPENSATION FOR MULTI-VIEW VIDEO CODING. Kwan-Jung Oh and Yo-Sung Ho

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and

Error Concealment for SNR Scalable Video Coding

FAST SPATIAL AND TEMPORAL CORRELATION-BASED REFERENCE PICTURE SELECTION

AUDIOVISUAL COMMUNICATION

Multi-view Video Streaming with Mobile Cameras

Concealment of Whole-Picture Loss in Hierarchical B-Picture Scalable Video Coding Xiangyang Ji, Debin Zhao, and Wen Gao, Senior Member, IEEE

PAPER Wireless Multi-view Video Streaming with Subcarrier Allocation

Robust Transmission of H.264/AVC Video Using 64-QAM and Unequal Error Protection

Robust Transmission of H.264/AVC Video using 64-QAM and unequal error protection

Adaptive Distributed Compressed Video Sensing

Error-Resilience Video Transcoding for Wireless Communications

Reduced Decoder Complexity and Latency in Pixel-Domain Wyner-Ziv Video Coders

Minimax Disappointment Video Broadcasting

HIGH Efficiency Video Coding (HEVC) version 1 was

Robust wireless video multicast based on a distributed source coding approach $

An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions

ARTICLE IN PRESS. Signal Processing: Image Communication

Marie Ramon, François-XavierCoudoux, andmarcgazalet. 1. Introduction

The H.263+ Video Coding Standard: Complexity and Performance

SCALABLE EXTENSION OF HEVC USING ENHANCED INTER-LAYER PREDICTION. Thorsten Laude*, Xiaoyu Xiu, Jie Dong, Yuwen He, Yan Ye, Jörn Ostermann*

Video coding standards

Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard

Overview: Video Coding Standards

OBJECT-BASED IMAGE COMPRESSION WITH SIMULTANEOUS SPATIAL AND SNR SCALABILITY SUPPORT FOR MULTICASTING OVER HETEROGENEOUS NETWORKS

Exploring the Distributed Video Coding in a Quality Assessment Context

Drift Compensation for Reduced Spatial Resolution Transcoding

A Novel Approach towards Video Compression for Mobile Internet using Transform Domain Technique

IN OBJECT-BASED video coding, such as MPEG-4 [1], an. A Robust and Adaptive Rate Control Algorithm for Object-Based Video Coding

Video Transmission. Thomas Wiegand: Digital Image Communication Video Transmission 1. Transmission of Hybrid Coded Video. Channel Encoder.

Comparative Study of JPEG2000 and H.264/AVC FRExt I Frame Coding on High-Definition Video Sequences

SUMMIT LAW GROUP PLLC 315 FIFTH AVENUE SOUTH, SUITE 1000 SEATTLE, WASHINGTON Telephone: (206) Fax: (206)

Distributed Video Coding

Wyner-Ziv to H.264 Video Transcoder for Low Cost Video Encoding

Popularity-Aware Rate Allocation in Multi-View Video

Multimedia Communications. Image and Video compression

Introduction to Video Compression Techniques. Slides courtesy of Tay Vaughan Making Multimedia Work

RATE-REDUCTION TRANSCODING DESIGN FOR WIRELESS VIDEO STREAMING

ISSN (Print) Original Research Article. Coimbatore, Tamil Nadu, India

Hierarchical SNR Scalable Video Coding with Adaptive Quantization for Reduced Drift Error

Transcription:

Free Viewpoint Switching in Multi-view Video Streaming Using Wyner-Ziv Video Coding Xun Guo 1,, Yan Lu 2, Feng Wu 2, Wen Gao 1, 3, Shipeng Li 2 1 School of Computer Sciences, Harbin Institute of Technology, Harbin, 150001, China 2 Microsoft Research Asia, Beijing, 100080, China 3 Institute of Computing Technology, Chinese Science Academy, Beijing, 100080, China ABSTRACT The free viewpoint switching is one of the most important features of multi-view video streaming. The key problem lies in how to achieve the best performance when the camera processing capability and the network bandwidth are limited. In this paper, we propose a novel free viewpoint switching scheme for multi-view video scenario, in which the distributed video coding technique is employed. In this scheme, the multi-camera video sources are encoded separately with the traditional hybrid video coding scheme, and meanwhile an alternative bitstream is produced for every frame based on the Wyner-Ziv coding method for the purpose of error correction when the viewpoint switching occurs. When switching happens, the Wyner-Ziv bits corresponding to the actual reference frame at the switching point is transmitted and used to recover the true reference. Instead of completely removing the mismatch, the proposed switching scheme tries to reduce the mismatch to an acceptable level so as to save the bits for the switching frame. A wavelet transform domain Wyner- Ziv coding method is proposed to produce the Wyner-Ziv bits for the switching frame. Conclusively, with the proposed scheme, the inter-camera communication can be avoided and the drifting error can be controlled efficiently when the viewpoint switching occurs. Keywords: Multi-view video coding, free viewpoint switching, Wyner-Ziv coding, wavelet transform, turbo codes 1. INTRODUCTION The key characteristic of multi-view video lies in the interactivity, which can give users the opportunity to choose their favorite viewpoint freely and get the three-dimensional panoramic scene. Because of the large data amount, multi-view video needs much more bandwidth than traditional video in transmission. In the past decade, a number of multi-view video coding (MVC) techniques have been proposed [1]-[4], which usually employ the multi-hypothesis prediction scheme to utilize either temporal or view correlation in terms of the coding cost. With the inter-view prediction, multiple views have to be transmitted for decoding a single view. Therefore, it is not a practical way to adopt these MVC techniques in the real multi-view video streaming systems. Intuitively, it is desirable to develop a free viewpoint switching scheme, which can switch between the different views upon request instead of transmitting all the views. In general, the switching between these bitstreams is restricted only on the key frames to avoid the drifting problem. As we know, the more key frames the worse coding efficiency. Thus, key frames are normally encoded periodically far apart from each other in the bitstream, which will reduce the flexibility of the viewpoint switching. Recently, a multi-view video streaming system has been reported in [5], which offers the feature of free viewpoint switching at any desired position. In this system, only one of the multi-view bitsreams is transmitted if the viewpoint switching does not occur. When the viewpoint switching occurs at the P frame, the additional global motion parameters between the two cameras are transmitted to reduce the mismatch. A new rate-distortion optimization (RDO) model considering both the source distortion and the potential distortion caused by the viewpoint switching has been employed in the motion vector and coding mode selection in [5]. However, this system also has some disadvantages in the practical applications. Firstly, the RDO scheme with error control will inevitably decrease the absolute coding efficiency, which is The work was done when the author was with Microsoft Research Asia as an intern. Visual Communications and Image Processing 2006, edited by John G. Apostolopoulos, Amir Said, Proc. of SPIE-IS&T Electronic Imaging, SPIE Vol. 6077, 60770U, 2005 SPIE-IS&T 0277-786X/05/$15 SPIE-IS&T/ Vol. 6077 60770U-1

not good if the viewpoint switching never or seldom occurs. Secondly, the optimization process requires the data exchange between two cameras, which is also unpractical in many real applications. And thirdly, the above RDO scheme requires forecasting the potential switching points based on some user model, which is also a difficult task. Besides the technique directly developed for multi-view video streaming, some other algorithms developed for the multiple bit-rate (MBR) streaming, e.g. the SP frame in H.264, can also be employ in the free viewpoint switching scenario. However, they still suffer from the similar problems. For example, in the SP frame based switching, the coding efficiency of the SP frames is worse than the regular P frames. Therefore, a desirable free viewpoint switching scheme should meet the following two requirements. Firstly, it should not lower the absolute coding performance of the original bit-streams. Secondly, it should support the viewpoint switching at any desired frame. Distributed source coding (DSC) may be a promising technology to solve the above problems. Theory of Slepian-Wolf shows that, even if the correlated sources are encoded without getting information from each other, the coding performance can be as good as joint encoding if the compressed signals can be jointly decoded [6]. Wyner and Ziv have extended this theory to the case of lossy source coding [7]. Recently, several practical Slepian-Wolf and Wyner-Ziv coding techniques have been proposed in the field of video coding, namely, distributed video coding (DVC), e.g. the techniques in [8] and in [9]. Inspired by the basic idea of DVC, we propose a new free viewpoint switching scheme for multi-view video streaming in this paper. Wyner-Ziv coding method is employed for the purpose of error correction when the viewpoint switching occurs. In the proposed system, the different views are coded independently with the traditional hybrid video coding method. Each inter frame is also encoded with Wyner-Ziv encoder and the coding bits are stored in a buffer. If there is no switching in this frame, the stored bits are discarded after the normal coding bits are transmitted. If switching happens, the stored bits are transmitted as Wyner-Ziv frame and used to limit the mismatch between reference frames in encoding and decoding to the maximum extent. Actually, the Wyner-Ziv frame can also be on-line generated only at the switching point for the real-time multi-view video streaming scenario. The rest of this paper is organized as follow. Section 2 describes the proposed viewpoint switching system including two practical implementations and the Wyner-Ziv coding employed in this scheme. Experimental results and the corresponding analysis are given in section 3. Conclusions are drawn in section 4. 2.1 Problem statement 2. PROPOSED VIEWPOINT SWITCHING SYSTEM Figure 1 shows the working flow of a typical multi-view video streaming system. The multiple views are encoded with standard-compliant coding scheme, e.g. H.263+. The normal bitstream of one view is transmitted when there is no switching. The user at the client can freely select the preferred view to watch, and send the selection to the server via a feedback channel. Then, the server sends the new bitstream corresponding to the selected view to the client hereafter. Throughout the streaming process, the user can switch from one view to another freely. Figure 1: Typical multi-view video streaming system. Since only one bitstream is transmitted at one time during the streaming process, the above scenario can be effortlessly implemented with the existing streaming system and network conditions. However, the limitation of this architecture is that the view switching can only occurs at the key frames, i.e. at the Intra frames, as shown in Figure 2. If the switch point happens to be a P frame, the mismatch between the received and reconstructed reference frame at the decoder and the originally reconstructed reference frame at the encoder may result in severe drifting problem. Therefore, in order to provide more flexibilities of viewpoint switching to the users, it is very desirable to reduce this kind of mismatch. SPIE-IS&T/ Vol. 6077 60770U-2

Figure 2: Switching at the key frame. An efficient solution to the mismatch problem described above is to add the inter-view prediction information into the bitstream. When the switch happens, the embedded side information should be helpful to generate the new reference frames that are equivalent or very similar to the original reference frames of the current frame to be decoded. But as we know, in many cases, the processing ability of the cameras is limited. If inter-view prediction is done between two cameras when encoding, the large data exchange will become the main problem. Therefore, we propose a novel multiview switching system based on Wyner-Ziv coding. The inherent characteristic of Wyner-Ziv coding that can be intra encoded and inter decoded can solve the problem gracefully. The mismatch between different references can be compensated without adding much complexity in encoder. 2.2 Proposed DVC based switching scheme Figure 3 illustrates the proposed viewpoint switching scheme in terms of the above multi-view video streaming scenario. 1,n 2,n 3,n n,n 1,n+1 1,n+1 2,n+1 2,n+1 3,n+1 3,n+1 n,n+1 1,n+2 1,n+2 2,n+2 2,n+2 3,n+2 3,n+2 n,n+2 1,n+3 1,n+3 2,n+3 2,n+3 3,n+3 3,n+3 n,n+3 Figure 3: Free viewpoint switching in multi-view video streaming. As shown in the figure, frames of all views are coded as the traditional intra or inter frames. When the viewpoint switching happens, bitstream from the new selected view will be transmitted instead of the current bitstream. Supposing the switching occurs at a P frame, the key problem lies in how to limit the mismatch between the reference and reconstructed frames of the switched frame with the least cost at any position. We tackle this problem with the distributed video coding technique. In particular, besides the traditional inter frame coding, we also encode each inter frame with Wyner-Ziv encoder. The produced Wyner-Ziv bits are stored along with the corresponding traditionally intercoded frame, as described with the dashed block in Figure 3. If the viewpoint switching does not occur at this frame, the SPIE-IS&T/ Vol. 6077 60770U-3

Wyner-Ziv frame will be dropped; otherwise, the Wyner-Ziv frame will be transmitted prior to the next inter-coded frame so as to correct the mismatch. In this paper, we propose two practical implementation methods based on the above viewpoint switching framework, as shown in Figure 4. The switching is from View 1 to View 2 and the switching paths for the two methods are different. One is from P 1,n+1 to P 2,n+2, another is from P 1,n+1 to P 2,n+3. We will describe the two methods in details as follows. Method 1 The left diagram in Figure 4 shows this method. The start point of the switching is P 1, n+1 and the end point is P 2, n+2.. When switching happens, the encoder transmits frame P 1, n+1 and Wyner-Ziv frame W 2, n+1 corresponding to P 2, n+1 to the decoder. At the decoder, the decoded P 1, n+1 is warped toward P 1, n+2 using global motion model. A six-parameter affine model is used in this paper. The warped version of P 1, n+1, i.e. Y 1, n+1, is used as the side information of P 2, n+1 and is sent to the Wyner-Ziv decoder together with W 2, n+1. After decoding, the reconstructed frame P 2, n+1 that is the prediction P 2, n+1 can be used as the reference of P 2, n+2. Thus, the following frames in View 2 can be almost correctly decoded continuously. Method 2 The right diagram in Figure 4 shows this method. Different from method 1, the goal of this method is to avoid transmitting the whole frame P 2, n+2. As shown in the right diagram in Figure 4, the Wyner-Ziv frame W 2, n+2 correspond to frame P 2, n+2 is transmitted instead of P 2, n+1. In the meanwhile, the motion vectors of P 2, n+2 and some intra-coded blocks are also transmitted to help producing the side information frame. At the decoder, the reconstructed frame P 1, n+1 is warped towards P 2, n+1. The warped frame, the motion information M 2, n+2 and the intra-coded blocks in frame P 2, n+2 are jointly used to produce the side information frame Y 1, n+1. Then, Y 1, n+1 and W 2, n+2 can be sent to the Wyner-Ziv decoder to achieve the reconstructed frame P 2, n+1, which can serve as the reference to decode frame P 2, n+3 as well as the following frames in View 2. Figure 4: Structures of two switching methods. 2.3 Wyner-Ziv coding scheme For the coding of the Wyner-Ziv frame, we employ the basic idea proposed in [7], i.e. source coding with side information using the turbo codes. Figure 5 shows the Wnyer-Ziv encoder/decoder diagram used in our system. At the encoder, the reconstructed frame of the current inter frame is coded with a low complexity method, including wavelet transform, scalar quantization and turbo encoder. The wavelet transform is used to decompose the frame into several levels. Uniform scalar quantization is employed to quantize the low-pass and high-pass subband coefficients into M levels. Note that the Wyner-Ziv encoder is independent of the side information in the proposed system. The quantized wavelet coefficients are input into turbo encoder, and only parts of the produced parity bits are selected based on a puncture matrix. Without the feedback information from the decoder, the rate allocation may not be very accurate. In other words, sometimes we have to waste some bits to guarantee the convergence in the turbo decoder. However, it is still valuable in the viewpoint of flexibility. SPIE-IS&T/ Vol. 6077 60770U-4

At the encoder, the reconstructed frame, e.g. P 2, n+1 in method 1, is taken as the source frame of Wyner-Ziv coding. It is firstly decomposed into several levels by wavelet transform. In this paper, we use 9/7 wavelet filter and 3 decomposition levels. Then, the transform coefficients are quantized into M levels using a uniform scalar quantizer. Note that different subbands may have different M values. M is related to the selection of quantization step size for each subband. The quantized coefficients are input into turbo encoder bit-plane by bit-plane. Coefficients of the same subband are encoded together. Only the parity bits are stored in the parity buffer and transmitted to the decoder. A puncture matrix is employed to select parity bits. At the decoder, both the side information frame, e.g. Y 1, n+1 in method 1, and the Wyner-Ziv bits, e.g. W 2, n+1, are available. The key problem is how to reconstruct the Wyner-Ziv frame using the side information and the Wyner-Ziv bits. As shown in Figure 5, the turbo decoder and the reconstruction module assume a Laplacian residual distribution between the frame X to be decoded and the side information frame Y. Let d be the difference between the corresponding coefficients in X and α α d Y. Then, the distribution of d can be approximated as f ( d) = 2 e for each subbands [9]. Let c i j denotes the ith bit of a coefficient c j, and c i j denotes the estimated reconstruction value for c i j. The probability P can be computed using the residual distribution model as follows: C i P= e α d j i', with d = ( m I( c ) + offset) I( y ). (1) α 2 i C j i j j Here m i represents the magnitude at ith bitplane. I (c j i ) indicates the possible value of c j i, which is equal to 1 or 0. y j is the coefficient of side information corresponding to c j. offset is a estimated value used to compensate the lower part of c j, because the lower bitpane of c j is still not decoded now. The value of offset is according to the distribution parameter and the quantization step size. After the current bitplane is decoded, it will be used to help decoding the next bitplane. Figure 5: Block diagram of the Wyner-Ziv coding scheme used in the proposed system. The advantages of the proposed scheme are summarized as follows: 1) The communications between the different cameras can be removed. In the previous bitstream switching schemes, e.g. the SP frame in H.264, the reconstructed frames from both bitstreams should be jointly analyzed and processed at the encoder. However, in proposed scheme, in addition to the traditional intra/inter coded frames, the Wyner-Ziv frame is also independently encoded. Thus, no data has to be exchanged in cameras. This advantage will be enlarged in the case of dense multi-view system. 2) The drifting errors can be limited to a minimum extent. This is also one of the most significant features of the proposed scheme. Assuming P 1,n+1 or its global warped version is used as the reference frame after the viewpoint SPIE-IS&T/ Vol. 6077 60770U-5

switching, there are probably some large local errors, which will propagate to the following inter-coded frames. However, in the proposed system, the Wyner-Ziv coding can correct most of the large local errors. 3) Switching point can be selected at any frame without the needs of forecasting the switching position and frequency. And also, the coding efficiency as well as the transmission efficiency is the same as the traditional single view streaming if the viewpoint switching has never occur. 3. EXPERIMENTAL RESULTS In order to verify the efficiency of the proposed scheme, experiments on the real multi-view video sequence are carried out. Test sequence, race1 and golf2 provided by KDDI Lab are used in the test. In each sequence, two views with 300 frames each are selected. IPPP structure with GOP size 30 is used in the H.263+ coding. Each reconstructed P frame is also encoded with the Wyner-Ziv encoder and the punctured parity bits are stored in a buffer. We assume that each GOP has a switching point at the middle position. According to the application scenario without inter-camera communication, four switching methods are compared, including the direct switching, global warping-based switching and the two proposed Wyner-Ziv switching methods. Note that the proposed methods require extra bits to be transmitted at the switching point. The numbers of bits for different QP are shown in Table 1, in which W-Z represents Wyner-Ziv bits, and P/MV means P frame or MV bits. In method 1, Wyner-Ziv bits and a P frame are transmitted when switching happens. While in method 2 Wyner-Ziv bits are transmitted together with some MV bits and intra-coded blocks. Figure 6 shows the last reconstructed frames of a GOP through the three different schemes. Because the visual qualities of the two Wyner-Ziv methods are similar, only the result of method 1 is given. Based on Figure 6, both direct switching and the global warping-based switching suffer from the error propagation. Actually, although sometimes the global warping can achieve visually acceptable reference frame at the switching point, the mismatch between it and the true reference frame usually concentrates on some local regions, which will inevitably propagate to the following frames, as shown in Figure 6. The testing results along the whole sequences at different bitrates are shown in Figure 7. Note that the anchor results, noted by Theory bound, are calculated by always using the perfect reconstructed frames, which cannot be achieved in practice. Obviously, the proposed viewpoint switching scheme can achieve the best rate-distortion performance. 4. CONCLUSIONS In this paper, a novel free viewpoint switching scheme has been proposed for the multi-view video streaming, in which the distributed video coding technique is employed to reduce the mismatch between the available reference frame and the true reconstruction frame when the switching happens. In particular, a wavelet transform domain Wyner-Ziv coding method is proposed to produce the switching frame. With the proposed scheme, inter-camera communication is avoided, and the drifting error can be controlled efficiently. REFERENCES 1. R. S. Wang, Y. Wang, Multiview Video Sequence Analysis, Compression, and Virtual Viewpoint Synthesis, IEEE Transactions on Circuits and Systems for Video Technology, Vol.10, pp.397-410, April 2000. 2. N. Grammalidis, M. G. Strintzis, Disparity and Occlusion Estimation in Multiocular Systems and Their Coding for the Communication of Multiview Image Sequences, IEEE Transactions on Circuits and Systems for Video Technology, Vol.8. pp.328-344, June 1998. 3. H. Kimata and M. Kitahara, Multi-view video coding based on scalable video coding for free-viewpoint video, ISO/IEC JTC1/SC29/WG11 M11571, Hong Kong, Jan. 2005. 4. Y. Ho, S. Yoon and S. Kim, A framework for multi-view video coding using layered depth image, ISO/IEC JTC1/SC29/WG11 M11582, Hong Kong, Jan. 2005. 5. X. Guo, Y. Lu, W. Gao and Q. Huang, Viewpoint switching in multi-view video streaming, Proc. International Symposium on Circuits and Systems, Kobe, Japan, May 2005. 6. D. Slepian and J. Wolf, Noiseless coding of correlated information sources, IEEE Transactions on Information Theory, vol. 19, pp.471-480, July 1973. 7. A. D. Wyner and J. Ziv, The rate-distortion function for source coding with side information at the decoder, IEEE Transaction on Information Theory, vol. 22, pp.1-10, Jan.1976. SPIE-IS&T/ Vol. 6077 60770U-6

8. R. Puri, and K. Ramchandran, PRISM: a new robust video coding architecture based on distributed compression principles, Proc. of 40th Allerton Conference on Communication, Control, and Computing, Allerton, IL, Oct. 2002. 9. Aaron, S. Rane and B. Girod, Transform-domain Wyner-Ziv codec for video, Proc. Visual Communications and Image Processing, VCIP-2004, San Jose, CA, January 2004. Table 1: Number of bits for switching frame. Method Method 1 Method 2 Switching Bits Type Race1 Golf2 QP QP 4 6 8 10 12 4 6 8 10 12 W-Z bits 144880 110400 91392 86400 73200 96000 73200 57024 50688 44352 P/MV bits 73438 45268 31359 23654 19053 15224 8688 5784 3912 3032 Total 218318 155668 122751 110054 92253 111224 81888 62808 54600 47384 W-Z bits 186920 144880 110400 104064 91392 104064 86400 66048 59712 47040 P/MV bits 31435 22306 19201 15233 13230 8723 5960 3920 2124 1911 Total 218355 167186 129601 119297 104622 112787 92360 69968 61836 48951 (a) (b) (c) Figure 6: Comparison of the visual quality of the three last P frames in a GOP after switching occurs. Direct switching (a), Global warping-based switching (b) and Wyner-Ziv switching (c). SPIE-IS&T/ Vol. 6077 60770U-7

Race1 PSNR-Y [db] 38 36 34 32 30 28 26 24 22 Theory bound Direct switching Warping-based switching Scheme 1 Scheme 2 20 200 400 600 800 1000 1200 1400 1600 1800 2000 Bitrate (kbps) Golf2 41 39 37 35 PSNR-Y [db] 33 31 29 27 Theory bound 25 Direct switching Warping-based switching 23 Scheme 1 Scheme 2 21 100 200 300 400 500 600 700 800 Bitrate (kbps) Figure 7: Results of the sequence of Race1 and Golf2. SPIE-IS&T/ Vol. 6077 60770U-8