Perceptual Coding: Hype or Hope?

Similar documents
Project No. LLIV-343 Use of multimedia and interactive television to improve effectiveness of education and training (Interactive TV)

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

KEY INDICATORS FOR MONITORING AUDIOVISUAL QUALITY

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks

Lecture 2 Video Formation and Representation

Evaluation of video quality metrics on transmission distortions in H.264 coded video

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER

PERCEPTUAL QUALITY COMPARISON BETWEEN SINGLE-LAYER AND SCALABLE VIDEOS AT THE SAME SPATIAL, TEMPORAL AND AMPLITUDE RESOLUTIONS. Yuanyi Xue, Yao Wang

UC San Diego UC San Diego Previously Published Works

Intra-frame JPEG-2000 vs. Inter-frame Compression Comparison: The benefits and trade-offs for very high quality, high resolution sequences

PERCEPTUAL QUALITY ASSESSMENT FOR VIDEO WATERMARKING. Stefan Winkler, Elisa Drelie Gelasca, Touradj Ebrahimi

A SUBJECTIVE STUDY OF THE INFLUENCE OF COLOR INFORMATION ON VISUAL QUALITY ASSESSMENT OF HIGH RESOLUTION PICTURES

PERCEPTUAL VIDEO QUALITY ASSESSMENT ON A MOBILE PLATFORM CONSIDERING BOTH SPATIAL RESOLUTION AND QUANTIZATION ARTIFACTS

Inputs and Outputs. Review. Outline. May 4, Image and video coding: A big picture

DATA hiding technologies have been widely studied in

Understanding IP Video for

HEBS: Histogram Equalization for Backlight Scaling

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora

Monitoring video quality inside a network

WYNER-ZIV VIDEO CODING WITH LOW ENCODER COMPLEXITY

Selective Intra Prediction Mode Decision for H.264/AVC Encoders

Understanding PQR, DMOS, and PSNR Measurements

P SNR r,f -MOS r : An Easy-To-Compute Multiuser

Minimax Disappointment Video Broadcasting

ELEC 691X/498X Broadcast Signal Transmission Fall 2015

An Evaluation of Video Quality Assessment Metrics for Passive Gaming Video Streaming

Video Quality Monitoring for Mobile Multicast Peers Using Distributed Source Coding

A Study of Encoding and Decoding Techniques for Syndrome-Based Video Coding

Systematic Lossy Error Protection of Video based on H.264/AVC Redundant Slices

Quantify. The Subjective. PQM: A New Quantitative Tool for Evaluating Display Design Options

HIGH DYNAMIC RANGE SUBJECTIVE TESTING

Reconstruction of Ca 2+ dynamics from low frame rate Ca 2+ imaging data CS229 final project. Submitted by: Limor Bursztyn

Deliverable reference number: D2.1 Deliverable title: Criteria specification for the QoE research

ERROR CONCEALMENT TECHNIQUES IN H.264 VIDEO TRANSMISSION OVER WIRELESS NETWORKS

ARTEFACTS. Dr Amal Punchihewa Distinguished Lecturer of IEEE Broadcast Technology Society

Scalable Foveated Visual Information Coding and Communications

Lecture 1: Introduction & Image and Video Coding Techniques (I)

Video Codec Requirements and Evaluation Methodology

A Statistical Framework to Enlarge the Potential of Digital TV Broadcasting

COMPRESSION OF DICOM IMAGES BASED ON WAVELETS AND SPIHT FOR TELEMEDICINE APPLICATIONS

On viewing distance and visual quality assessment in the age of Ultra High Definition TV

3D MR Image Compression Techniques based on Decimated Wavelet Thresholding Scheme

STAT 113: Statistics and Society Ellen Gundlach, Purdue University. (Chapters refer to Moore and Notz, Statistics: Concepts and Controversies, 8e)

Case Study: Can Video Quality Testing be Scripted?

Feasibility Study of Stochastic Streaming with 4K UHD Video Traces

Digital Representation

Advantages of Incorporating Perceptual Component Models into a Machine Learning framework for Prediction of Display Quality

Image and video encoding: A big picture. Predictive. Predictive Coding. Post- Processing (Post-filtering) Lossy. Pre-

Estimating the impact of single and multiple freezes on video quality

Image Compression Techniques Using Discrete Wavelet Decomposition with Its Thresholding Approaches

Project Proposal: Sub pixel motion estimation for side information generation in Wyner- Ziv decoder.

Lund, Sweden, 5 Mid Sweden University, Sundsvall, Sweden

IEEE TRANSACTIONS ON BROADCASTING 1

Real Time PQoS Enhancement of IP Multimedia Services Over Fading and Noisy DVB-T Channel

Agenda minutes each

A Novel Parallel-friendly Rate Control Scheme for HEVC

Spatially scalable HEVC for layered division multiplexing in broadcast

A Novel Approach towards Video Compression for Mobile Internet using Transform Domain Technique

OBJECTIVE VIDEO QUALITY METRICS: A PERFORMANCE ANALYSIS

RATE-DISTORTION OPTIMISED QUANTISATION FOR HEVC USING SPATIAL JUST NOTICEABLE DISTORTION

Data Representation. signals can vary continuously across an infinite range of values e.g., frequencies on an old-fashioned radio with a dial

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

A Color Gamut Mapping Scheme for Backward Compatible UHD Video Distribution

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

Browsing News and Talk Video on a Consumer Electronics Platform Using Face Detection

Pick your Layers wisely - A Quality Assessment of H.264 Scalable Video Coding for Mobile Devices

Schemes for Wireless JPEG2000

SUBJECTIVE QUALITY OF VIDEO BIT-RATE REDUCTION BY DISTANCE ADAPTATION

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ICASSP.2016.

1022 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 19, NO. 4, APRIL 2010

Distributed Video Coding Using LDPC Codes for Wireless Video

ACHIEVING HIGH QOE ACROSS THE COMPUTE CONTINUUM: HOW COMPRESSION, CONTENT, AND DEVICES INTERACT

Technology Cycles in AV. An Industry Insight Paper

MULTIMEDIA TECHNOLOGIES

SCALABLE video coding (SVC) is currently being developed

Introduction to image compression

Reduced-reference image quality assessment using energy change in reorganized DCT domain

THE CAPABILITY of real-time transmission of video over

Visual Communication at Limited Colour Display Capability

Frame Processing Time Deviations in Video Processors

PSNR r,f : Assessment of Delivered AVC/H.264

Objective Quality Analysis of MPEG-1, MPEGZ & Windows Media Video

An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions

ROBUST ADAPTIVE INTRA REFRESH FOR MULTIVIEW VIDEO

INTRA-FRAME WAVELET VIDEO CODING

SUBJECTIVE ASSESSMENT OF H.264/AVC VIDEO SEQUENCES TRANSMITTED OVER A NOISY CHANNEL

How to Manage Video Frame- Processing Time Deviations in ASIC and SOC Video Processors

NO-REFERENCE QUALITY ASSESSMENT OF HEVC VIDEOS IN LOSS-PRONE NETWORKS. Mohammed A. Aabed and Ghassan AlRegib

CODING EFFICIENCY IMPROVEMENT FOR SVC BROADCAST IN THE CONTEXT OF THE EMERGING DVB STANDARDIZATION

What is Statistics? 13.1 What is Statistics? Statistics

PER-TITLE ENCODING. Jan Ozer Jan Ozer, 2017, all rights reserved

Objective Video Quality Assessment of Direct Recording and Datavideo HDR-40 Recording System

Constant Bit Rate for Video Streaming Over Packet Switching Networks

Computer Audio and Music

Essence of Image and Video

Design Challenge of a QuadHDTV Video Decoder

Quality impact of video format and scaling in the context of IPTV.

06 Video. Multimedia Systems. Video Standards, Compression, Post Production

Interleaved Source Coding (ISC) for Predictive Video Coded Frames over the Internet

Frame Interpolation and Motion Blur for Film Production and Presentation GTC Conference, San Jose

Transcription:

QoMEX 2016 Keynote Speech Perceptual Coding: Hype or Hope? June 6, 2016 C.-C. Jay Kuo University of Southern California 1

Is There Anything Left in Video Coding? First Asked in Late 90 s Background After MPEG-4 Bottleneck in low-bit-rate video coding In Response H.264/AVC & H.265 Asked Again Recently Background After great success of H.264 and completion of H.265 Industrial companies taking the lead nowadays & focusing on performance fine-tuning In Response Perceptual coding?? 2

Video Coding in Next Decade (2015-25) Two driving applications Ultra High Definition (UHD) Video UHD video (4K or 8K) at a bit rate of 25Mbps or above TV broadcasting Streaming Video HD/UHD video at a rate of 1-10Mbps Streaming to mobile devices Both somehow can be covered by H.264/H.265 standards Will video coding R&D still be vibrant? Business perspective: If demand (video streaming applications) is still higher than supply (channel bandwidth, 4G+WiFi), there will be a need Technology perspective: any new idea? 3

New Quality Metrics Mathematically Driven Mostly data fitting with a parametric model QA databases are needed for parameter/weight training Examples: SSIM, FSIM, MMF, etc. Lin and Kuo, Perceptual visual quality metrics: a survey, Journal of Visual Communication and Image Representation, 2011. No psychovisual support Coarse-grain quality metrics Psychovisual Model Driven Built upon human visual system (HVS) characteristics Hu, et al., Compressed image quality metric based on perceptually weighted distortion, IEEE T-IP, Vol. 24, No. 12, December 2015. Hu, et al., Objective video quality assessment based on perceptually weighted mean-squared-error, to appear in IEEE T-CSVT 4

Nothing expected to be dramatically different along this direction FRESH IDEA IS ESSENTIAL! 5

Perceived Quality: Continuous or Discrete? JPEG libjpeg Quality Factor: 1~100 + Original = 101 quality levels Can you differentiate all of them? 6

Stair Quality Function (SQF) 7

Just-Noticeable-Difference (JND) Revisited 1. Anchor 2. Non-noticeable 3. Just Noticeable 4. Noticeable 8

From QoS to QoE Quality of Services (QoS) System Centric Quality of Experience (QoE) Human Centric 9

MCL-JCI Dataset Media Communications Lab (MCL) JND-based Compressed Image (JCI) Dataset 50 source images Resolution: 1920x1080 Source image + 100 JPEG-coded image with QF=1,,100 Each was viewed by 30 subjects Just released in the MCL website http://mcl.usc.edu/mcl-jci-dataset/

50 Source Images in MCL-JCI Dataset Image Resolution: 1920 x 1080 Each is coded by JPEG with QF=1,, 100 Total dataset size: 50x101=5,050 11

SI & Colorfulness Distribution of Source Images 12

Semantics and Property-Specific Categories Number People 5 Animals 3 Plants 4 Buildings 8 Water or Lake 5 Sky 3 Bridge 3 Boats or Cars 5 Indoor 8 Dark Scene 6 13

Subjective Tests Participants About 150 people Age: 20-40 Each source image viewed by 30 subjects Controlled environment Viewing distance: 2 meters (1.6 times the picture height) Displayed on a 65 TV with native resolution of 3840x2160 Two images displayed side-by-side: anchor and comparison images 14

Sequential JND Point Search 15

Bisection Search 16

Statistics of JND Numbers and Highest/Lowest QF Values Left scale: Mean of JND numbers (blue bar): 5-9 Standard Deviation of JND numbers (green bar): 1-4 17 Right scale: Highest QF values of JND points (yellow curve): 36-88 Lowest QF values of JND points (orange curve): 5

Number of Level = 5 (Minimum cases) 18

Number of Level = 8 (Maximum cases) 19

JND Distribution JND is a random variable Different from person to person Highly dependent on visual content and test environment 20

JND Processing Pipeline GMM (Gaussian Mixture Model) JND number: the number of mixtures JND location: the mean of each mixture JND height: the weight of each mixture JND Histogram Processed JND Points SQF 21

Variance of JND Variance Can be used to determine the Q-function The tail region indicates the percentage of viewers who can tell the difference between the two quality levels (unsatisfied viewers) JND-based Rate Control Smaller variance: easier to meet the need of most viewers Small variance Large variance 22

Extension from Image to Video Divide a video sequence into smaller intervals that have similar content Apply the JND idea to each interval The JND becomes a time-varying function

24 SQF for FoxBird of 5-Second Length

MCL-JCV Dataset Media Communications Lab (MCL) JND-based Compressed Video (JCV) Dataset 30 video clips Duration: 5 seconds Frame Rate: 24, 25 or 30 fps Resolution: 1920x1080 Each was viewed by 50 subjects All data have been collected. They are under analysis now and will be released soon

Video Sources Representative thumbnail frames of 30 selected source sequences

Stair Quality Function (SQF) and Rate Control Rate control via satisfied user ratio (SUR) Coding bit rate is determined by perception experience of a large number of audience (rather than several gold eyes) Choose the proper rate so that X% cannot see the difference while (100-X)% can -> X% is called the satisfied user ratio (SUR) 27

From PSNR to SUR What is the desired bit rate for certain content within a short interval? We can use the measured 1 st JND statistics to answer this question The 1 st JND point -> from perceptually lossless to perceptually lossy

Histogram of 1st JND for Kimono raw samples cleaned samples (same as raw, no outliers)

Video Source: Kimono (slide 26) What is the desired bit rate for Kimono? 27 Mbps provides a good tradeoff between the SUR and bandwidth requirement from the right figure No such information is available from the left figure

Kimono 85Mbps

Kimono 20Mbps

Histogram of 1st JND for CarFirework Hist of raw samples Hist of cleaned samples (1 outlier was removed)

Video Source: CarFirework What is the desired bit rate for CarFirework? 19 Mbps provides a good tradeoff between the SUR (80%) and bandwidth

CarFirework 44Mbps

CarFirework 19Mbps

Histogram of 1st JND for FoxBird Hist of raw samples Hist of cleaned samples (2 outliers were removed)

Video Source: FoxBird What is the desired bit rate for FoxBird? 2.6 Mbps provides a good tradeoff between the SUR (85%) and bandwidth The SUR vs BR is highly content dependent

FoxBird 4.1Mbps

FoxBird 2.6Mbps

Prediction of the 1 st JND Distribution Linking perceptual video coding to big data and machine learning Feature extraction Machine learning Need a training dataset Diversified content 41

Completely New Ball Game Insufficiency of rate-distortion theory Source coding theory or joint source-channel coding theory does not take human perception into account Human perception is a statistical phenomenon Varying between individuals HVS is a nonlinear system The system fires only when the signal is above a certain threshold Foundation of JND 42

Future R&D (1) Prediction of SQF for arbitrary image/video content Machine learning Training: demanding ground truth data (MCL-JCI and MCL-JCV) The first JND point is the most critical one Mean (location) and spread (standard deviation) Time varying: demanding dynamic prediction Rate control can be done based on the satisfied user ratio (SUR) criterion Develop JND-based perceptual coder Push the first JND point as far as possible (smaller QF or larger QP) May lead to the next generation video coding standard 43

Future R&D (2) Other JND-based QoE Applications Image/video super-resolution Image/video retargeting Image/video content delivery 44

Acknowledgements USC Haiqiang Wang Sudeng Hu Joe Yuchieh Lin Lina Jin Netflix Ioannis Katsavounidis Anne Aaron David Ronca

Related Publications MCL-JCV dataset Haiqiang Wang, Weihao Gan, Sudeng Hu, Joe Yuchieh Lin, Lina Jin, Longguang Song, Ping Wang, Ioannis Katsavounidis, Anne Aaron and C.-C. Jay Kuo, MCL-JCV: a JND-based H.264/AVC video quality assessment dataset, IEEE ICIP, Phoenix, Arizona, USA, September 25-28, 2016. GMM-based stair quality model Sudeng Hu, Haiqiang Wang and C.-C. Jay Kuo, A GMM-based stair quality model for human perceived JPEG images, IEEE ICASSP, Shanghai, China, March 20-25, 2016. MCL-JCI dataset Lina Jin, Joe Yuchieh Lin, Sudeng Hu, Haiqiang Wang, Ping Wang, Ioannis Katsavounidis, Anne Aaron and C.-C. Jay Kuo. Statistical Study on Perceived JPEG Image Quality via MCL-JCI Dataset Construction and Analysis, IS&T Conference on Human Vision and Electronic Imaging (HVEI), San Francisco, CA, USA, February 14-18, 2016. JND-based quality measure of coded image/video Joe Yuchieh Lin, Lina Jin, Sudeng Hu, Ioannis Katsavounidis, Anne Aaron and C.-C. Jay Kuo. Experimental Design and Analysis of JND Test on Coded Image/Video. SPIE Optical Engineering+ Applications. International Society for Optics and Photonics, August 12, 2015.