Advanced Video Processing for Future Multimedia Communication Systems

Similar documents
Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora

The H.26L Video Coding Project

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

Chapter 2 Introduction to

Robust 3-D Video System Based on Modified Prediction Coding and Adaptive Selection Mode Error Concealment Algorithm

Overview: Video Coding Standards

ESTIMATING THE HEVC DECODING ENERGY USING HIGH-LEVEL VIDEO FEATURES. Christian Herglotz and André Kaup

Video coding standards

A video signal consists of a time sequence of images. Typical frame rates are 24, 25, 30, 50 and 60 images per seconds.

HEVC: Future Video Encoding Landscape

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

Mauricio Álvarez-Mesa ; Chi Ching Chi ; Ben Juurlink ; Valeri George ; Thomas Schierl Parallel video decoding in the emerging HEVC standard

Chapter 10 Basic Video Compression Techniques

Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard

International Journal for Research in Applied Science & Engineering Technology (IJRASET) Motion Compensation Techniques Adopted In HEVC

4 6 July 2018 Hannover, Germany

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ISCAS.2005.

The H.263+ Video Coding Standard: Complexity and Performance

Principles of Video Compression

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Multimedia Communications. Video compression

Intra-frame JPEG-2000 vs. Inter-frame Compression Comparison: The benefits and trade-offs for very high quality, high resolution sequences

AUDIOVISUAL COMMUNICATION

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

WYNER-ZIV VIDEO CODING WITH LOW ENCODER COMPLEXITY

Fast MBAFF/PAFF Motion Estimation and Mode Decision Scheme for H.264

WE CONSIDER an enhancement technique for degraded

17 October About H.265/HEVC. Things you should know about the new encoding.

Video Over Mobile Networks

A Study of Encoding and Decoding Techniques for Syndrome-Based Video Coding

HEVC Subjective Video Quality Test Results

COMPLEXITY REDUCTION FOR HEVC INTRAFRAME LUMA MODE DECISION USING IMAGE STATISTICS AND NEURAL NETWORKS.

Joint source-channel video coding for H.264 using FEC

AN IMPROVED ERROR CONCEALMENT STRATEGY DRIVEN BY SCENE MOTION PROPERTIES FOR H.264/AVC DECODERS

WITH the rapid development of high-fidelity video services

Comparative Study of JPEG2000 and H.264/AVC FRExt I Frame Coding on High-Definition Video Sequences

Multimedia Communications. Image and Video compression

MULTI-CORE SOFTWARE ARCHITECTURE FOR THE SCALABLE HEVC DECODER. Wassim Hamidouche, Mickael Raulet and Olivier Déforges

Introduction to Video Compression Techniques. Slides courtesy of Tay Vaughan Making Multimedia Work

PACKET-SWITCHED networks have become ubiquitous

OL_H264e HDTV H.264/AVC Baseline Video Encoder Rev 1.0. General Description. Applications. Features

Into the Depths: The Technical Details Behind AV1. Nathan Egge Mile High Video Workshop 2018 July 31, 2018

Parameters optimization for a scalable multiple description coding scheme based on spatial subsampling

Analysis of Video Transmission over Lossy Channels

Hardware Implementation for the HEVC Fractional Motion Estimation Targeting Real-Time and Low-Energy

HEVC Real-time Decoding

Project Proposal: Sub pixel motion estimation for side information generation in Wyner- Ziv decoder.

Conference object, Postprint version This version is available at

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER

SCALABLE video coding (SVC) is currently being developed

ERROR CONCEALMENT TECHNIQUES IN H.264 VIDEO TRANSMISSION OVER WIRELESS NETWORKS

H.264/AVC Baseline Profile Decoder Complexity Analysis

Region Adaptive Unsharp Masking based DCT Interpolation for Efficient Video Intra Frame Up-sampling

FAST SPATIAL AND TEMPORAL CORRELATION-BASED REFERENCE PICTURE SELECTION

WHITE PAPER. Perspectives and Challenges for HEVC Encoding Solutions. Xavier DUCLOUX, December >>

MPEG + Compression of Moving Pictures for Digital Cinema Using the MPEG-2 Toolkit. A Digital Cinema Accelerator

Error concealment techniques in H.264 video transmission over wireless networks

CODING EFFICIENCY IMPROVEMENT FOR SVC BROADCAST IN THE CONTEXT OF THE EMERGING DVB STANDARDIZATION

Video Compression. Representations. Multimedia Systems and Applications. Analog Video Representations. Digitizing. Digital Video Block Structure

Multiview Video Coding

1022 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO. 4, APRIL 2010

Reduced complexity MPEG2 video post-processing for HD display

Speeding up Dirac s Entropy Coder

A parallel HEVC encoder scheme based on Multi-core platform Shu Jun1,2,3,a, Hu Dong1,2,3,b

CONSTRAINING delay is critical for real-time communication

A HIGH THROUGHPUT CABAC ALGORITHM USING SYNTAX ELEMENT PARTITIONING. Vivienne Sze Anantha P. Chandrakasan 2009 ICIP Cairo, Egypt

ITU-T Video Coding Standards

Free Viewpoint Switching in Multi-view Video Streaming Using. Wyner-Ziv Video Coding

Understanding Compression Technologies for HD and Megapixel Surveillance

Error Concealment for SNR Scalable Video Coding

A robust video encoding scheme to enhance error concealment of intra frames

OL_H264MCLD Multi-Channel HDTV H.264/AVC Limited Baseline Video Decoder V1.0. General Description. Applications. Features

STUDY OF AVS CHINA PART 7 JIBEN PROFILE FOR MOBILE APPLICATIONS

On Complexity Modeling of H.264/AVC Video Decoding and Its Application for Energy Efficient Decoding

Constant Bit Rate for Video Streaming Over Packet Switching Networks

Study of AVS China Part 7 for Mobile Applications. By Jay Mehta EE 5359 Multimedia Processing Spring 2010

Motion Re-estimation for MPEG-2 to MPEG-4 Simple Profile Transcoding. Abstract. I. Introduction

Distributed Video Coding Using LDPC Codes for Wireless Video

H.261: A Standard for VideoConferencing Applications. Nimrod Peleg Update: Nov. 2003

Video Transmission. Thomas Wiegand: Digital Image Communication Video Transmission 1. Transmission of Hybrid Coded Video. Channel Encoder.

Real-time SHVC Software Decoding with Multi-threaded Parallel Processing

Research Article. ISSN (Print) *Corresponding author Shireen Fathima

MULTIVIEW DISTRIBUTED VIDEO CODING WITH ENCODER DRIVEN FUSION

Highly Efficient Video Codec for Entertainment-Quality

The Multistandard Full Hd Video-Codec Engine On Low Power Devices

Authors: Glenn Van Wallendael, Sebastiaan Van Leuven, Jan De Cock, Peter Lambert, Joeri Barbarien, Adrian Munteanu, and Rik Van de Walle

Impact of scan conversion methods on the performance of scalable. video coding. E. Dubois, N. Baaziz and M. Matta. INRS-Telecommunications

Lecture 23: Digital Video. The Digital World of Multimedia Guest lecture: Jayson Bowen

Interactive multiview video system with non-complex navigation at the decoder

Dual Frame Video Encoding with Feedback

Video Compression - From Concepts to the H.264/AVC Standard

Implementation of an MPEG Codec on the Tilera TM 64 Processor

A Study on AVS-M video standard

ELEC 691X/498X Broadcast Signal Transmission Fall 2015

An Overview of Video Coding Algorithms

A High Performance VLSI Architecture with Half Pel and Quarter Pel Interpolation for A Single Frame

Spatially scalable HEVC for layered division multiplexing in broadcast

Scalable multiple description coding of video sequences

Transcription:

Advanced Video Processing for Future Multimedia Communication Systems André Kaup Friedrich-Alexander University Erlangen-Nürnberg

Future Multimedia Communication Systems Trend in video to make communication more immersive by higher image resolution and quality stereo and multi-view systems Data rate of uncompressed UHD video: 3840 x 2160 x 24 bit/pel x 30 frames/s = 6 Gbit/s or 746 MB/s One CD every second One DVD every 6 seconds One Blu-ray disc every 36 seconds Conclusion: compression is necessary and higher resolutions are more challenging with respect to efficiency and picture quality Page 2

Intraframe Prediction Assumption: images have (locally) high spatial correlation Intraframe prediction uses already decoded image blocks in causal neighborhood Best prediction mode according to optimization constraint is transmitted as side information Page 3

Interframe Prediction Assumption: video has (locally) high temporal correlation Interframe predicion using motion compensated reference Multiple reference images for long term prediction Motion vector is transmitted as side information Page 4

Overview Advancing compression efficiency Spatiotemporal prediction In-loop video denoising Measurement and prediction of energy consumption Improving displaying quality Scalable lossless compression Multi-view video using super-resolution Random sampling and reconstruction Conclusions and outlook Page 6

Spatiotemporal Prediction State-of-the-art: switching between interframe und intraframe prediction modes Decision taken using rate-distortion optimization Extended approach: joint spatiotemporal prediction as additional mode Page 7

Non-Local Means Refined Prediction Processing area is regarded for prediction Motion compensated block Reconstructed neighboring blocks Page 10

Non-Local Means Refined Prediction Task: Recover original signal in area from given signal samples Non-local means: Estimate refined samples in using weighted non-local average filter Page 11

Non-Local Means Refined Prediction Example for weight calculation: Samples with similar neighborhood get a large weight Samples with dissimilar neighborhood get a small weight Page 13

Test Sequences Crew Foreman Vimto Page 14

Simulation Results H.264/AVC JM10.2 Baseline Profile, Level 2.0 CIF sequences IPPP, 100 frames Search range: 16 sample 1 bit/block for signaling the new mode NLM-RP parameters: [Seiler, Richter, Kaup, PCS 2010] Page 15

Simulation Results Motion compensated prediction Non-local means refined prediction QP34: 33.54 db @ 447 kbit/s QP34: 34.25 db @ 434 kbit/s Page 16

Simulation Results Motion compensated prediction Non-local means refined prediction QP34: 33.54 db @ 447 kbit/s QP34: 34.25 db @ 434 kbit/s Page 17

Overview Advancing compression efficiency Spatiotemporal prediction In-loop video denoising Measurement and prediction of energy consumption Improving displaying quality Scalable lossless compression Multi-view video using super-resolution Random sampling and reconstruction Conclusions and outlook Page 18

Prediction Error Signal Problem Prediction error signal has more noise than the current frame itself Solution Remove noise from the predictor Page 19

Inter-Frame Encoder with In-Loop Denoising Simplified block diagram of an inter-frame encoder with in-loop denoising Page 20

Inter-Frame Decoder with In-Loop Denoising Simplified block diagram of an inter-frame decoder with in-loop denoising Denoising is performed after displaying the decoded frames Page 21

Quantization of Noise Noise filtering works on transformed and quantized reference signal Analytical model for Gaussian noise and perfect prediction Observation: Noise depends on Page 22

Simulation Conditions HEVC reference software HM-2.2 Coding of 100 frames QP 2 {12 37} Coding configurations: ldlc_p, ldhe_p, ldlc, ldhe SVT test sequences from ftp://vqeg.its.bldrdoc.gov/ Resolution of 3840x2160 pixels with 50 frames per second Using a centrically cropped version of 2560x2160 pixels Denoising parameters AWF: window of 3x3 pixels L =1, H =3 Page 26

Simulation Results for ParkJoy (ldlc) Estimated noise of the input sequence σ n 1.8 Page 27

Overview Advancing compression efficiency Spatiotemporal prediction In-loop video denoising Measurement and prediction of energy consumption Improving displaying quality Scalable lossless compression Multi-view video using super-resolution Random sampling and reconstruction Conclusions and outlook Page 30

Energy Consumption of a Video Decoder Battery constraint: Operating time of portable devices is limited by battery capacity Especially critical for HD and UHD content Goal: Extend operating times by reducing the required decoding energy Page 31

Modeling the Decoding Energy Decoding energy estimated through processing time: [Herglotz, Kaup, ISCAS 2015] Mean power Offset Energy Decoding time Decoding energy estimated through processor events: M=3 events considered: - Number of instruction fetches - Level 1 data cache misses - Hardware interrupts Page 32

Modeling the Decoding Energy Cont d Decoding energy estimated through high-level features: Frame width (pixels) Frame height (pixels) Number of frames Bit stream file size System specific variables [Herglotz, Kaup, EUSIPCO 2015] Page 33

Modeling the Decoding Energy Cont d Decoding energy estimated through bit stream features: [Herglotz, Kaup, ICIP 2014] Feature index (up to 90) Number of occurrences Feature specific energy Examples: Intra prediction (mode and block size) Coefficient decoding 110000101011110011001101000011111...... 0...... 0.. 0-1 0 0 0 0 0 Page 34

Estimation Accuracies Test set: 120 sequences, 16-40 frames, QP=10,32,45 Encoder configurations: intra, low delay (P), random access, Software: HM-13.0, libde165, FFmpeg Hardware: Pandaboard, Beagleboard, FPGA Estimation error: Mean absolute estimation errors: Page 35

Overview Advancing compression efficiency Spatiotemporal prediction In-loop video denoising Measurement and prediction of energy consumption Improving displaying quality Scalable lossless compression Multi-view video using super-resolution Random sampling and reconstruction Conclusions and outlook Page 36

Motivation New video coding standard HEVC primarily targeting consumer applications with lossy compression Need for lossless compression in professional applications Medical imaging (telemedicine) Archiving (cinema) High bitrate limited channel capacity Scalable lossless coding using two layers Lossy base layer (BL) Lossless enhancement layer (EL) en.wikipedia.org/wiki/file:rupturedaaa.png Page 37

System Overview Page 38

Base Layer Lossy BL compression using HEVC Page 39

Enhancement Layer Lossless EL coding using the proposed Sample-based Weighted Prediction for Enhancement Layer Coding (SELC) Page 40

Enhancement Layer Coding SELC Encoder SELC Decoder Intra prediction: Non-linear sample-based weighted prediction (SWP) Implemented using fast lockup tables Entropy coding/decoding: Modified context-adaptive binary arithmetic coding (CABAC) [Wige, Kaup, ICIP 2013] Page 41

Intra Prediction (SWP) I Four-pixel neighborhood and four-pixel patch Neighborhood of current pixel Patch around a pixel Current pixel Patch pixel current pixel Patch around the current pixel is compared to the patches of the neighborhood pixels... (-1,-1) Current pixel shift=(0,0) (0,-1) (1,1) (-1,0) [3] P. Amon et al., RCE2: Sample-based weighted intra prediction for lossless coding, document JCTVC-M0052, JCT-VC, Apr. 2013. Page 42

Experimental Results Coding efficiency: Relative bitrate differences 1 for EL coding compared to SHM-2.1 HM-11.0 SELC QP22 QP27 QP32 QP37 QP22 QP27 QP32 QP37 1.2% 1.0% 0.3% 0.8% -2.6% -4.7% -6.5% -7.3% Runtime: Relative runtime increase 2 for EL processing compared to BL processing only SHM-2.1 HM-11.0 SELC QP22 QP27 QP32 QP37 QP22 QP27 QP32 QP37 QP22 QP27 QP32 QP37 Enc 25.3% 30.6% 34.9% 37.7% 18.5% 22.5% 25.5% 27.7% 0.6% 0.7% 0.9% 0.8% Dec 244.4% 338.9% 443.6% 536.2% 260.1% 361.9% 451.4% 533.8% 202.8% 279.6% 334.3% 374.8% 1 : average values w/o ElFuente 2 : average values for all sequences Page 45

Overview Advancing compression efficiency Spatiotemporal prediction In-loop video denoising Measurement and prediction of energy consumption Improving displaying quality Scalable lossless compression Multi-view video using super-resolution Random sampling and reconstruction Conclusions and outlook Page 46

Super-Resolution Super-Resolution (SR) is a key issue in image and video processing domain Goal: create reasonable high-frequency content for a low-resolution image or video sequence +? = Page 47

Motivation Mixed-resolution multi-view video plus depth format (MR-MVD) Goal: Usage of neighboring high-frequency content to refine lowresolution destination view Page 48

Super-Resolution Based on High-Frequency Synthesis State of the art: l( u, l l ( u, + l ( u, l h ( u, left view right view r( u, r l ( u, d r ( u, warping r h ( u, Page 49

Super-Resolution Based on High-Frequency Synthesis Impact of additional depth inaccuracies on visual SR quality: original translation scale zoom Different depth distortion scenarios have different impact on SR quality Goal: Create an algorithm that is robust to each of those distortions Page 52

Displacement-Compensated Super-Resolution l( u, l l ( u, + l dc ( u, Displacement estimation Displacement compensation l l ( u, l h ( u, left view right view warping warping r( u, r l ( u, d r ( u, r h ( u, [Richter, Kaup, CSVT 2015] Page 53

Simulation Results Translation: Shifting all depth entries 5 pixel positions to the top right. Scaling: Limiting the 8 bit depth entries [0; 255] to [0; 127]. Zoom: Dropping 10% of rows and columns and resizing the cropped depth map via nearest neighbor interpolation. Page 59

Simulation Results PSNR evaluation, 2 Original depth Translated depth Scaled depth Zoomed depth l l l l dc ( u, ( u, l dc ( u, l ( u, l l dc ( u, ( u, l dc ( u, l ( u, ( u, Ballet Breakdancers Cones Teddy Avg. gain 36.97 38.82 33.10 33.71 36.68 37.38 34.63 35.09 38.01 37.95 34.71 35.43 0.58 35.98 37.47 34.18 34.82 38.11 38.04 34.61 35.48 0.95 34.77 36.11 30.35 31.04 37.83 38.01 34.24 35.06 3.22 34.16 36.86 32.23 33.16 37.80 37.86 34.09 35.07 2.10 Page 60

Simulation Results Visual comparison: ballet l l ( u, l ( u, l( u, l dc ( u, Page 62

Overview Advancing compression efficiency Spatiotemporal prediction In-loop video denoising Measurement and prediction of energy consumption Improving displaying quality Scalable lossless compression Multi-view video using super-resolution Random sampling and reconstruction Conclusions and outlook Page 64

Example: ¼ Sampling Mask FSE Low-resolution Sensor Masked Sensor High-resolution Image Large pixel Acquired pixel Reconstructed pixel [Schöberl, Seiler, Kaup, ICIP 2011] Page 66

Aliasing Regular versus non-regular sub-sampling Page 67

Frequency Selective Extrapolation Sparse signal model generation as a weighted superposition of Fourier basis functions [Seiler, Kaup, SPL 2010] Page 68

Frequency Selective Extrapolation Measured image Signal model Reconstructed image Page 69

Reconstruction by Frequency Selective Extrapolation (it.500) (it.100) (it.200) (it.50) (it.10) (it.5) (it.1) Sampled image Reconstructed image Page 70

Comparison Low resolution image Reconstructed image Page 75

Simulation Results on Image Data Base Reconstruction algorithm PSNR [db] (KODAK) PSNR [db] (TECNICK) Frequency Selective Extrapolation 28.80 31.50 Linear Interpolation 27.31 29.81 Steering Kernel Regression 27.55 30.30 [M. Jonscher, J. Seiler, T. Richter, M. Bätz, A. Kaup, ICIP 2014] Page 76

Summary and Conclusions Future video communication systems will require more efficient compression and be more immersive Efficient compression Video is a cube: Spatiotemporal prediction Noise might be significant: In-loop denoising Energy will play a role: Decoding energy measurement Improved immersiveness Picture quality matters: Scalable lossless coding 3D is on the way: Super-resolution for multi-view Sampling revisited: Random pixel reconstruction Page 77

About the Future Page 78