An Evaluation of Video Quality Assessment Metrics for Passive Gaming Video Streaming

Similar documents
Project No. LLIV-343 Use of multimedia and interactive television to improve effectiveness of education and training (Interactive TV)

Lund, Sweden, 5 Mid Sweden University, Sundsvall, Sweden

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

H.264/AVC analysis of quality in wireless channel

Evaluation of video quality metrics on transmission distortions in H.264 coded video

PER-TITLE ENCODING. Jan Ozer Jan Ozer, 2017, all rights reserved

HEVC Subjective Video Quality Test Results

Subjective quality and HTTP adaptive streaming: a review of psychophysical studies

1. INTRODUCTION. Index Terms Video Transcoding, Video Streaming, Frame skipping, Interpolation frame, Decoder, Encoder.

Deliverable reference number: D2.1 Deliverable title: Criteria specification for the QoE research

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks

PERCEPTUAL QUALITY ASSESSMENT FOR VIDEO WATERMARKING. Stefan Winkler, Elisa Drelie Gelasca, Touradj Ebrahimi

SERIES J: CABLE NETWORKS AND TRANSMISSION OF TELEVISION, SOUND PROGRAMME AND OTHER MULTIMEDIA SIGNALS Measurement of the quality of service

KEY INDICATORS FOR MONITORING AUDIOVISUAL QUALITY

GNURadio Support for Real-time Video Streaming over a DSA Network

Margaret H. Pinson

Understanding PQR, DMOS, and PSNR Measurements

ACHIEVING HIGH QOE ACROSS THE COMPUTE CONTINUUM: HOW COMPRESSION, CONTENT, AND DEVICES INTERACT

Adaptive Key Frame Selection for Efficient Video Coding

II. SYSTEM MODEL In a single cell, an access point and multiple wireless terminals are located. We only consider the downlink

Objective video quality measurement techniques for broadcasting applications using HDTV in the presence of a reduced reference signal

OBJECTIVE VIDEO QUALITY METRICS: A PERFORMANCE ANALYSIS

SUBJECTIVE AND OBJECTIVE EVALUATION OF HDR VIDEO COMPRESSION

Video Codec Requirements and Evaluation Methodology

TR 038 SUBJECTIVE EVALUATION OF HYBRID LOG GAMMA (HLG) FOR HDR AND SDR DISTRIBUTION

Lecture 2 Video Formation and Representation

MPEG Solutions. Transition to H.264 Video. Equipment Under Test. Test Domain. Multiplexer. TX/RTX or TS Player TSCA

Video Quality Evaluation for Mobile Applications

Video Transmission. Thomas Wiegand: Digital Image Communication Video Transmission 1. Transmission of Hybrid Coded Video. Channel Encoder.

UHD Features and Tests

ARTEFACTS. Dr Amal Punchihewa Distinguished Lecturer of IEEE Broadcast Technology Society

RECOMMENDATION ITU-R BT Methodology for the subjective assessment of video quality in multimedia applications

ERROR CONCEALMENT TECHNIQUES IN H.264 VIDEO TRANSMISSION OVER WIRELESS NETWORKS

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora

Fast MBAFF/PAFF Motion Estimation and Mode Decision Scheme for H.264

A SUBJECTIVE STUDY OF THE INFLUENCE OF COLOR INFORMATION ON VISUAL QUALITY ASSESSMENT OF HIGH RESOLUTION PICTURES

ABSTRACT 1. INTRODUCTION

ROBUST ADAPTIVE INTRA REFRESH FOR MULTIVIEW VIDEO

PERCEPTUAL QUALITY COMPARISON BETWEEN SINGLE-LAYER AND SCALABLE VIDEOS AT THE SAME SPATIAL, TEMPORAL AND AMPLITUDE RESOLUTIONS. Yuanyi Xue, Yao Wang

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER

The History of Video Quality Model Validation

UC San Diego UC San Diego Previously Published Works

Estimating the impact of single and multiple freezes on video quality

HIGH DYNAMIC RANGE SUBJECTIVE TESTING

Metrics for Video Quality Assessment in Mobile Scenarios

Subjective Test Methodology Design for Perceptual Quality Optimization

AUDIOVISUAL COMMUNICATION

Monitoring video quality inside a network

Colour Reproduction Performance of JPEG and JPEG2000 Codecs

Perceptual Coding: Hype or Hope?

Multiview Video Coding

Perceptual Video Metrics, a new vocabulary for QoE. Jeremy Bennington Cheetah Technologies

White Paper. Video-over-IP: Network Performance Analysis

White Paper : Achieving synthetic slow-motion in UHDTV. InSync Technology Ltd, UK

Pick your Layers wisely - A Quality Assessment of H.264 Scalable Video Coding for Mobile Devices

Quality impact of video format and scaling in the context of IPTV.

Compressed-Sensing-Enabled Video Streaming for Wireless Multimedia Sensor Networks Abstract:

A Novel Approach towards Video Compression for Mobile Internet using Transform Domain Technique

QUALITY ASSESSMENT OF VIDEO STREAMING IN THE BROADBAND ERA. Jan Janssen, Toon Coppens and Danny De Vleeschauwer

RECOMMENDATION ITU-R BT.1203 *

FULL-HD HEVC-ENCODED VIDEO QUALITY ASSESSMENT DATABASE. Enrico Masala. Politecnico di Torino Torino, Italy

SUBJECTIVE ASSESSMENT OF H.264/AVC VIDEO SEQUENCES TRANSMITTED OVER A NOISY CHANNEL

High Dynamic Range for HD and Adaptive Bitrate Streaming

Feasibility Study of Stochastic Streaming with 4K UHD Video Traces

PAL uncompressed. 768x576 pixels per frame. 31 MB per second 1.85 GB per minute. x 3 bytes per pixel (24 bit colour) x 25 frames per second

Audio and Video II. Video signal +Color systems Motion estimation Video compression standards +H.261 +MPEG-1, MPEG-2, MPEG-4, MPEG- 7, and MPEG-21

Error concealment techniques in H.264 video transmission over wireless networks

COMPRESSION OF DICOM IMAGES BASED ON WAVELETS AND SPIHT FOR TELEMEDICINE APPLICATIONS

Quality Assessment of the MPEG-4 Scalable Video CODEC

OPERATOR VIDEO MONITORING PRACTICES. April 17, 2013

MULTIMEDIA TECHNOLOGIES

Error Resilient Video Coding Using Unequally Protected Key Pictures

Real Time PQoS Enhancement of IP Multimedia Services Over Fading and Noisy DVB-T Channel

PERCEPTUAL VIDEO QUALITY ASSESSMENT ON A MOBILE PLATFORM CONSIDERING BOTH SPATIAL RESOLUTION AND QUANTIZATION ARTIFACTS

HIGH DEFINITION H.264/AVC SUBJECTIVE VIDEO DATABASE FOR EVALUATING THE INFLUENCE OF SLICE LOSSES ON QUALITY PERCEPTION

CODING EFFICIENCY IMPROVEMENT FOR SVC BROADCAST IN THE CONTEXT OF THE EMERGING DVB STANDARDIZATION

Comparative Study of JPEG2000 and H.264/AVC FRExt I Frame Coding on High-Definition Video Sequences

1 Overview of MPEG-2 multi-view profile (MVP)

Selective Intra Prediction Mode Decision for H.264/AVC Encoders

Color Image Compression Using Colorization Based On Coding Technique

Bit Rate Control for Video Transmission Over Wireless Networks

Video Quality Evaluation with Multiple Coding Artifacts

MPEGTool: An X Window Based MPEG Encoder and Statistics Tool 1

Set-Top Box Video Quality Test Solution

DVB-T2: An Outline of HDTV and UHDTV Programmes Broadcasting

High Dynamic Range What does it mean for broadcasters? David Wood Consultant, EBU Technology and Innovation

IEEE TRANSACTIONS ON BROADCASTING 1

techniques for 3D Video

Ch. 1: Audio/Image/Video Fundamentals Multimedia Systems. School of Electrical Engineering and Computer Science Oregon State University

Intra-frame JPEG-2000 vs. Inter-frame Compression Comparison: The benefits and trade-offs for very high quality, high resolution sequences

EBU Workshop on Frequency and Network Planning Aspects of DVB-T2 Part 2

Constant Bit Rate for Video Streaming Over Packet Switching Networks

FLEXIBLE SWITCHING AND EDITING OF MPEG-2 VIDEO BITSTREAMS

Predicting the immediate future with Recurrent Neural Networks: Pre-training and Applications

ATSC vs NTSC Spectrum. ATSC 8VSB Data Framing

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

Keep your broadcast clear.

VIDEO GRABBER. DisplayPort. User Manual

EarStudio: Analog volume control. The importance of the analog volume control

Overview of ITU-R BS.1534 (The MUSHRA Method)

Transcription:

An Evaluation of Video Quality Assessment Metrics for Passive Gaming Video Streaming Nabajeet Barman*, Steven Schmidt, Saman Zadtootaghaj, Maria G. Martini*, Sebastian Möller *Wireless Multimedia & Networking Research Group, Kingston University, London, U.K. Quality and Usability Lab, TU Berlin, Germany Telekom Innovation Labs, Deutsche Telekom AG, Berlin, Germany

Motivation Gaming Videos: Increasing in popularity Increasing number of Gaming OTT Providers: Twitch.tv, YouTube Gaming, Hitbox.tv Twitch.tv alone consists of approximately 2 million streamers, 15 million daily active users Twitch.tv Ranked 4 th in terms of total traffic during peak hours: After Netflix, Google and Apple

Motivation ctd Gaming videos consist of synthetic and artificial content Streaming Requirements: Real-time, CBR, 1-pass

Motivation ctd Many metrics are based on properties inherent to natural images and videos: SSIM, NIQE, BRISQUE etc. Applicability and performance analysis of VQA metrics on gaming videos Open question!!!! Gaming videos: No reference available Performance analysis and design of good No Reference metrics critical for QoE estimation

GamingVideoSET 24 videos, 2 each from 12 different games N. Barman, S. Zadtootaghaj, S. Schmidt, M. G. Martini, and S. Möller, GamingVideoSET: A Dataset for Gaming Video Streaming Applications, in 2018 16th Annual Workshop on Network and Systems Support for Games (NetGames 2018), Amsterdam, 2018.

Spatial Information (SI) vs. Temporal Information (TI)

Video Encoding Parameters Parameter Value Duration 30 sec Resolution 1080p, 720p, 480p Frame Rate 30 Number of Reference Videos 24 Encoder FFmpeg Encoding Mode CBR Video Compression Standard H.264, Main 4.0 Preset Veryfast

Resolution-Bitrate Pairs Resolution Bitrate (kbps) 1080p 600, 750, 1000, 1200, 1500, 2000, 3000, 4000 720p 500, 600, 750, 900, 1200, 1600, 2000, 2500, 4000 480p 300, 400, 600, 900, 1200, 2000, 4000 The bitrates in bold text refer to the bitrates used in the subjective quality assessment

Evaluated VQA Metrics Full Reference Peak Signal to Noise Ratio (PSNR) Structural Similarity Index Metric (SSIM) Video Multi-Method Assessment Fusion (VMAF) Reduced Reference Optimized version of ST-RRED (Spatio-temporal-reduced reference entropic difference) metric: ST-RREDOpt Spatial efficient entropic differencing for quality assessment (SpEED-QA) No Reference Blind image quality index (BIQI) Blind/referenceless image spatial quality evaluator (BRISQUE) Natural Image Quality Evaluator (NIQE)

Quality vs. Bitrate Curves Eight VQA metrics considering all videos for 1080p resolution

Subjective Test Six videos: CSGO, H1Z1, HS, FIFA, LoL and PC 3 resolutions, 5 bitrates each: 90 stimuli Subjective test methodology: ACR (scale: 1-5) Test Environment: ITU-R Rec. BT.500 Number of test participants: 25 Display Monitor: 22 FHD ViewSonic 720p and 480p videos decoded and rescaled to 1080p

Correlation Analysis Performance of the VQA metrics scores with respect to MOS All Data refers to the combined data of all three resolution-bitrate pairs FR and RR evaluated on rescaled videos. NR Metrics evaluation done on non-rescaled videos

Discussion 1: Comparison of VQA Metrics with respect to MOS

Discussion 1 Comparison of VQA Metrics with respect to MOS VMAF results in the highest correlation values Both RR metrics results in almost similar results SpEED-QA seven times faster than ST-RREDOpt NR metrics BIQI performs the worst BRISQUE and NIQE 1080p and 720p: Almost equal performance 480p and All data: NIQE performs better than BRISQUE

Discussion 2: Impact of resolution on VQA metrics

Discussion 2 Impact of resolution on VQA metrics For FR and NR metrics, performance decreases from higher to lower resolution. For RR metrics, 720p results in higher correlation value Fisher s Z-test to asses the significance of the difference between two resolution correlation values Difference between 1080p and 720p is not statistically significant Difference between 720p and 480p is statistically significant

Discussion 3: Performance degradation at lower resolutions

Discussion 3 Performance degradation at lower resolutions Performance at 480p is considerably lower compared to the same VQA metric performance for the 720p and 1080p resolutions. Decrease in performance for some metrics is higher than others

MOS (with 95% confidence interval), PSNR and VMAF values for the CSGO video sequence

Evaluation over the full dataset Use VMAF scores as ground truth Evaluate performance of rest seven metrics on the full dataset 24 videos, 24 resolution-bitrate pairs 576 stimuli

Correlation Analysis Performance of the VQA metrics scores with respect to VMAF All Data refers to the combined data of all three resolution-bitrate pairs. In terms of PLCC: 1080p: PSNR; 720p and 480p: NIQE In terms of SROCC: 1080p, 720p and 480p: SpEEDQA All Data, PLCC and SROCC: PSNR

Discussion 4: Decreased NR metric performance for All Data

NR metrics vs. VMAF scores Scatter plot of the considering all three resolutions over the whole dataset May not be suitable for spatial resolution based adaptation applications (e.g. Typical HTTP adaptive streaming solutions)

Conclusions VMAF results in the highest correlation w.r.t. MOS score in terms of PLCC and SROCC values. SSIM and NIQE also performs quite well. Performance of all VQA metrics is worse for 480p resolution as compared to 720p and 1080p. When considering VMAF values as the benchmark, PSNR results in the highest correlation considering All data When considering All data, the performance of NR metrics decreases significantly. VQA metric performance on gaming videos similar to non-gaming videos Future work: Investigate shortcomings in existing NR metrics and improve/develop new metrics

Thank you! This presentation is part of a project that has received funding from the European Union s Horizon 2020 research and innovation programme under the Marie Skłodowska- Curie grant agreement No 643072. 25