Video Codec Requirements and Evaluation Methodology

Similar documents
Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

Chapter 2 Introduction to

Into the Depths: The Technical Details Behind AV1. Nathan Egge Mile High Video Workshop 2018 July 31, 2018

JPEG2000: An Introduction Part II

CODING EFFICIENCY IMPROVEMENT FOR SVC BROADCAST IN THE CONTEXT OF THE EMERGING DVB STANDARDIZATION

Video coding standards

Motion Video Compression

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

The H.263+ Video Coding Standard: Complexity and Performance

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

Video Transmission. Thomas Wiegand: Digital Image Communication Video Transmission 1. Transmission of Hybrid Coded Video. Channel Encoder.

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

Error Resilient Video Coding Using Unequally Protected Key Pictures

Error concealment techniques in H.264 video transmission over wireless networks

1 Overview of MPEG-2 multi-view profile (MVP)

ROBUST ADAPTIVE INTRA REFRESH FOR MULTIVIEW VIDEO

Lecture 1: Introduction & Image and Video Coding Techniques (I)

MPEG-2. ISO/IEC (or ITU-T H.262)

A two-stage approach for robust HEVC coding and streaming

Compressed-Sensing-Enabled Video Streaming for Wireless Multimedia Sensor Networks Abstract:

RECOMMENDATION ITU-R BT * Video coding for digital terrestrial television broadcasting

Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard

OL_H264MCLD Multi-Channel HDTV H.264/AVC Limited Baseline Video Decoder V1.0. General Description. Applications. Features

The H.26L Video Coding Project

Joint source-channel video coding for H.264 using FEC

ERROR CONCEALMENT TECHNIQUES IN H.264 VIDEO TRANSMISSION OVER WIRELESS NETWORKS

Dual Frame Video Encoding with Feedback

AV1 Image File Format (AVIF)

DVB-UHD in TS

White Paper. Video-over-IP: Network Performance Analysis

Video 1 Video October 16, 2001

Research Article. ISSN (Print) *Corresponding author Shireen Fathima

Error resilient H.264/AVC Video over Satellite for low Packet Loss Rates

Video Over Mobile Networks

INF5080 Multimedia Coding and Transmission Vårsemester 2005, Ifi, UiO. Wavelet Coding & JPEG Wolfgang Leister.

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Mauricio Álvarez-Mesa ; Chi Ching Chi ; Ben Juurlink ; Valeri George ; Thomas Schierl Parallel video decoding in the emerging HEVC standard

Chapter 2 Video Coding Standards and Video Formats

Video Coding IPR Issues

Embedding Multilevel Image Encryption in the LAR Codec

UHD + HDR SFO Mark Gregotski, Director LHG

MPEG Solutions. Transition to H.264 Video. Equipment Under Test. Test Domain. Multiplexer. TX/RTX or TS Player TSCA

17 October About H.265/HEVC. Things you should know about the new encoding.

OPEN STANDARD GIGABIT ETHERNET LOW LATENCY VIDEO DISTRIBUTION ARCHITECTURE

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER

Improved H.264 /AVC video broadcast /multicast

SG16-TD155-A3/PLEN. STUDY GROUP 16 Original: English TELECOMMUNICATION STANDARDIZATION SECTOR. Question(s): 6/16 Macao, China, October 2017

Coding. Multiple Description. Packet networks [1][2] a new technology for video streaming over the Internet. Andrea Vitali STMicroelectronics

Improved Error Concealment Using Scene Information

SVC Uncovered W H I T E P A P E R. A short primer on the basics of Scalable Video Coding and its benefits

AV1: The Quest is Nearly Complete

Video Compression - From Concepts to the H.264/AVC Standard

OL_H264e HDTV H.264/AVC Baseline Video Encoder Rev 1.0. General Description. Applications. Features

Modeling and Optimization of a Systematic Lossy Error Protection System based on H.264/AVC Redundant Slices

Advanced Computer Networks

AV1 Update. Thomas Daede October 5, Mozilla & The Xiph.Org Foundation

Video (Fundamentals, Compression Techniques & Standards) Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

A HIGH THROUGHPUT CABAC ALGORITHM USING SYNTAX ELEMENT PARTITIONING. Vivienne Sze Anantha P. Chandrakasan 2009 ICIP Cairo, Egypt

Deliverable reference number: D2.1 Deliverable title: Criteria specification for the QoE research

Robust Transmission of H.264/AVC Video Using 64-QAM and Unequal Error Protection

A robust video encoding scheme to enhance error concealment of intra frames

HIGH DYNAMIC RANGE SUBJECTIVE TESTING

Modeling and Evaluating Feedback-Based Error Control for Video Transfer

UHD 4K Transmissions on the EBU Network

Error Resilient Coding Techniques for Video Delivery over Vehicular Networks

HEVC Subjective Video Quality Test Results

Multimedia Communications. Image and Video compression

Study of AVS China Part 7 for Mobile Applications. By Jay Mehta EE 5359 Multimedia Processing Spring 2010

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ISCAS.2005.

Using the VideoEdge IP Encoder with Intellex IP

Lecture 2 Video Formation and Representation

SCALABLE EXTENSION OF HEVC USING ENHANCED INTER-LAYER PREDICTION. Thorsten Laude*, Xiaoyu Xiu, Jie Dong, Yuwen He, Yan Ye, Jörn Ostermann*

Introduction. Packet Loss Recovery for Streaming Video. Introduction (2) Outline. Problem Description. Model (Outline)

A Novel Approach towards Video Compression for Mobile Internet using Transform Domain Technique

The Multistandard Full Hd Video-Codec Engine On Low Power Devices

Multimedia Communications. Video compression

FINAL REPORT PERFORMANCE ANALYSIS OF AVS-M AND ITS APPLICATION IN MOBILE ENVIRONMENT

Multimedia Networking

MULTI-CORE SOFTWARE ARCHITECTURE FOR THE SCALABLE HEVC DECODER. Wassim Hamidouche, Mickael Raulet and Olivier Déforges

OBJECT-BASED IMAGE COMPRESSION WITH SIMULTANEOUS SPATIAL AND SNR SCALABILITY SUPPORT FOR MULTICASTING OVER HETEROGENEOUS NETWORKS

RECOMMENDATION ITU-R BT.1203 *

Robust Transmission of H.264/AVC Video using 64-QAM and unequal error protection

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 19, NO. 6, JUNE

MPEG-1 and MPEG-2 Digital Video Coding Standards

International Journal for Research in Applied Science & Engineering Technology (IJRASET) Motion Compensation Techniques Adopted In HEVC

Quality impact of video format and scaling in the context of IPTV.

Compressed Domain Video Compositing with HEVC

Understanding Compression Technologies for HD and Megapixel Surveillance

ATSC vs NTSC Spectrum. ATSC 8VSB Data Framing

Lund, Sweden, 5 Mid Sweden University, Sundsvall, Sweden

Content storage architectures

RECOMMENDATION ITU-R BT Methodology for the subjective assessment of video quality in multimedia applications

Analysis of Packet Loss for Compressed Video: Does Burst-Length Matter?

Comparative Study of JPEG2000 and H.264/AVC FRExt I Frame Coding on High-Definition Video Sequences

Error Resilience for Compressed Sensing with Multiple-Channel Transmission

Video System Characteristics of AVC in the ATSC Digital Television System

Inputs and Outputs. Review. Outline. May 4, Image and video coding: A big picture

Transcription:

Video Codec Reuirements and Evaluation Methodology www.huawei.com draft-ietf-netvc-reuirements-02 Alexey Filippov (Huawei Technologies), Andrey Norkin (Netflix), Jose Alvarez (Huawei Technologies)

Contents An overview of applications Reuirements Evaluation methodology Conclusions Slide 2

Applications Internet Video Streaming Internet Protocol Television (IPTV) Video conferencing Video sharing Screencasting Game streaming Video monitoring / surveillance Slide 3

Internet Video Streaming Basic reuirements: Significant improvements in compression efficiency between codec generations Random access to pictures Random Access Period (RAP) usually 2-5 seconds Support of wide range of content types and formats HDR and WCG Gains on lower resolutions is important for adaptive streaming (many resolution) Gains on easy content are also important for overall bitrate savings Efficiency for film grain encoding which is present in a lot of content Tools for perceptually optimized encoding High encoding complexity can be tolerated in software encoders (up to 10x) Bitstream should have a model allowing easy parsing and identification of components (frames, etc) Optional reuirements: Resolution, uality (SNR) and temporal (frame-rate) scalability Slide 4

Internet Video Streaming Resolution Frame-rate, fps Picture access mode 2160p (4K),3840x2160 24/1.001, 24, 25, 1080p (2K), 1920x1080 RA 30/1.001, 30, 50, 1080i, 1920x1080 * RA 720p, 1280x720 60/1.001, 60, 100, RA 576p (EDTV), 720x576 576i (SDTV), 720x576 * 480p (EDTV), 720x480 480i (SDTV), 720x480 * 512x384 QVGA, 320x240 120/1.001, 120 (Table 2 in ITU-R BT-2020) NB *: interlaced content can be handled at the higher system level and not necessarily by using specialized video coding tools. It is included in this table only for the sake of completeness as most video content today is in progressive format. RA RA RA RA RA RA RA Slide 5

Internet Protocol Television (IPTV) Basic reuirements: Significant improvements in compression efficiency between codec generations Random access to pictures Random Access Period (RAP) usually 0.5-1 seconds Support of wide range of content types and formats HDR and WCG Efficiency for film grain encoding which is present in a lot of content Tools for perceptually optimized encoding Bitstream should have a model allowing easy parsing and identification of components (frames, etc) Optional reuirements: Resolution, uality (SNR) and temporal (frame-rate) scalability Slide 6

Internet Protocol Television (IPTV) Resolution Frame-rate, fps Picture access mode 2160p (4K),3840x2160 24/1.001, 24, 25, 1080p, 1920x1080 RA 30/1.001, 30, 50, 1080i, 1920x1080 * RA 720p, 1280x720 60/1.001, 60, 100, RA 576p (EDTV), 720x576 576i (SDTV), 720x576 * 480p (EDTV), 720x480 480i (SDTV), 720x480 * 120/1.001, 120 (Table 2 in ITU-R BT-2020) NB *: interlaced content can be handled at the higher system level and not necessarily by using specialized video coding tools. It is included in this table only for the sake of completeness as most video content today is in progressive format. RA RA RA RA RA Slide 7

Video conferencing Basic reuirements: Delay should be kept as low as possible The preferable and maximum end-to-end delay values should be less than 100 ms and 320 ms, respectively Error robustness Low-complexity encoder Optional reuirements: Temporal (frame-rate), resolution and uality (SNR) scalability Slide 8

Video conferencing Resolution Frame-rate, fps Picture access mode 1080p, 1920x1080 15, 30 FIZD 720p, 1280x720 30, 60 FIZD 4CIF, 704x576 30, 60 FIZD 4SIF, 704x480 30, 60 FIZD VGA, 640x480 30, 60 FIZD 360p, 640x360 30, 60 FIZD Slide 9

Video sharing Basic reuirements: Random access to pictures for downloaded video data Temporal (frame-rate) scalability Resolution and uality (SNR) scalability Optional reuirements: Error robustness Typical scenarios: GoPro camera Cameras integrated into smartphones Slide 10

Video sharing* Resolution Frame-rate, fps Picture access mode 2160p (4K), 3840x2160 24, 25, 30, 48, 50, 60 RA 1440p (2K), 2560x1440 24, 25, 30, 48, 50, 60 RA 1080p, 1920x1080 24, 25, 30, 48, 50, 60 RA 720p, 1280x720 24, 25, 30, 48, 50, 60 RA 480p, 854x480 24, 25, 30, 48, 50, 60 RA 360p, 640x360 24, 25, 30, 48, 50, 60 RA * - Sources of these data: "Recommended upload encoding settings (Advanced)" https://support.google.com/youtube/answer/1722171?hl=en Slide 11

Screencasting Basic reuirements: Support of a wide range of input video formats RGB and YCbCr 4:4:4 in addition to YCbCr 4:2:0 and YCbCr 4:2:2 High visual uality up to visually and mathematically lossless Optional reuirements: Error robustness Slide 12

Screencasting Resolution Frame-rate, fps Picture access mode Input color format: RBG 4:4:4 5k, 5120x2880 15, 30, 60 AI, RA, FIZD 4k, 3840x2160 15, 30, 60 AI, RA, FIZD WQXGA, 2560x1600 15, 30, 60 AI, RA, FIZD WUXGA, 1920x1200 15, 30, 60 AI, RA, FIZD WSXGA+, 1680x1050 15, 30, 60 AI, RA, FIZD WXGA, 1280x800 15, 30, 60 AI, RA, FIZD XGA, 1024x768 15, 30, 60 AI, RA, FIZD SVGA, 800x600 15, 30, 60 AI, RA, FIZD VGA, 640x480 15, 30, 60 AI, RA, FIZD Slide 13

Screencasting Resolution Frame-rate, fps Picture access mode Input color format: YCbCr 4:4:4 5k, 5120x2880 15, 30, 60 AI, RA, FIZD 4k, 3840x2160 15, 30, 60 AI, RA, FIZD 1440p (2K), 2560x1440 15, 30, 60 AI, RA, FIZD 1080p, 1920x1080 15, 30, 60 AI, RA, FIZD 720p, 1280x720 15, 30, 60 AI, RA, FIZD Slide 14

Game streaming Basic reuirements: Random access to pictures Temporal (frame-rate) scalability Error robustness Optional reuirements: Resolution and uality (SNR) scalability Specific features: This content typically contains many sharp edges and large motion Slide 15

Video monitoring / surveillance Basic reuirements: Random access to pictures for downloaded video data Random Access Period (RAP) should be kept in the range of 1-5 seconds Low-complexity encoder Support of HDR In some cases, high uality (fidelity) of a video signal is reuired after lossy compression Optional reuirements: Support of WCG Support of a monochrome mode e.g., for infrared cameras Temporal, resolution and uality (SNR) scalability Slide 16

Video monitoring / surveillance Resolution Frame-rate, fps Picture access mode 2160p (4K),3840x2160 12 RA 5Mpixels, 2560x1920 12 RA 1080p, 1920x1080 25 RA 1.3Mpixels, 1280x960 25, 30 RA 720p, 1280x720 25, 30 RA SVGA, 800x600 25, 30 RA Slide 17

Reuirements General reuirements Basic reuirements Optional reuirements Slide 18

General reuirements Coding efficiency / compression performance Improvements over state-of-the-art video codecs such as HEVC/H.265 and VP9, at least, by 20-25%, preferably more Good uality specification and well-defined profiles and levels: They are reuired to enable device interoperability and facilitate decoder implementations High-level syntax should allow extensibility New features can be supported easily by using metadata such as SEI messages, VUI, headers Bit-stream should have a model that allows easy parsing and identification of components (such as frames) Similar to ISO/IEC14496-10, Annex B or ISO/IEC 14496-15 In particular, information needed for packet handling (e.g., frame type) should not reuire parsing anything below the header level. Slide 19

General reuirements (cont d) Support of perceptual uality tools such as adaptive QP and uantization matrices The codec specification should define a buffer model Such as hypothetical reference decoder (HRD) Specifications providing integration with system and delivery layers should be developed Slide 20

Basic reuirements Input source formats: Bit depth: 8- and 10-bits per color component Up to 12-bits for a high profile Color sampling formats: YCbCr 4:2:0 YCbCr 4:4:4, YCbCr 4:2:2 and YCbCr 4:0:0 (preferably in different profile(s)) Support of HDR and WCG For profiles with bit depth of 10 bits per sample or higher Support of arbitrary resolution (constrained to level limits) for such applications where a picture can have an arbitrary size e.g., in screencasting Slide 21

Basic reuirements (cont d) Coding delay Support of configurations with zero structural delay also referred to as lowdelay configurations Note: End-to-end delay should be up to 320 ms but its preferable value should be less than 100 ms Support of configurations with non-zero structural delay such as out-of-order or multi-pass encoding to provide additional compression efficiency improvements Scalability Temporal (frame-rate) scalability Slide 22

Basic reuirements (cont d) Complexity Feasible real-time implementation of both an encoder and a decoder for hardware and software implementation based on a wide range of state-of-the-art platforms Real-time encoder should provide sufficient improvement in compression efficiency at reasonable encoder complexity increase High-complexity software encoder implementations used by offline encoding applications They can have 10x or more complexity increase compared to state-of-the-art video compression technologies such as HEVC/H.265 and VP9 Slide 23

Basic reuirements (cont d) Error resilience Error resilience tools that are complementary to the error protection mechanisms implemented on transport level The codec should support mechanisms that facilitate packetization of a bitstream for common network protocols Packetization mechanisms should enable frame-level error recovery by means of retransmission or error concealment The bitstream specification should support independently decodable sub-frame units similar to slices or independent tiles It should be possible for the encoder to restrict the bit-stream to allow parsing of the bit-stream after a packet loss and to communicate it to the decoder Slide 24

Optional reuirements Input source formats: Bit depth: up to 16-bits per color component Color sampling formats: RGB 4:4:4 Support of auxiliary channel: e.g., alpha channel Scalability: Resolution and uality (SNR) scalability If they provide low compression efficiency penalty, they can be supported in the main profile Computational complexity scalability Computational complexity is decreasing along with degrading picture uality Slide 25

Optional reuirements (cont d) Complexity Tools that enable parallel processing at both encoder and decoder sides are highly desirable for many applications E.g., slices, tiles, wave front propagation processing High-level multi-core parallelism encoder and decoder operation, especially entropy encoding and decoding, should allow multiple frames or sub-frame regions (e.g. 1D slices, 2D tiles, or partitions) to be processed concurrently, either independently or with deterministic dependencies that can be efficiently pipelined Low-level instruction set parallelism favor algorithms that are SIMD/GPU friendly over inherently serial algorithms Coding efficiency Compression efficiency on noisy content, content with film grain, computer generated content, and low resolution materials is desirable Slide 26

Compression performance evaluation Methodology of compression performance evaluation Quality assessment Objective evaluation Subjective evaluation Slide 27

Methodology of compression performance evaluation (cont d) Objective evaluation in 3 ranges: Low-bitrate range Middle-bitrate range High-bitrate range Points are selected using the reference codec uality levels Bjøntegaard Delta (BD)-rate should be computed: An average value over all the 3 ranges should be provided Values for each range should be provided as well Slide 28

Quality assessment Objective evaluation Peak Signal-to-Noise Ration (PSNR) where B is the bit depth of source signal R and T are original and reconstructed signals, respectively PSNR = 20 Log 1 MN M N y= 1 x= 1 B ( 2 1) 2 ( R(x, y) S(x, y) ) Multiscale Structural Similarity (MS-SSIM) ssim(x i,y i )= [ l( x,y )] α [ c( x,y )] β [ s( x, y )] γ i i i i i i ssim(x, y i i )= ( 2µ xi µ yi + C1 )( 2σ xiyi + C2 ) 2 2 2 2 ( µ + µ + C )( σ + σ + C ) xi yi 1 xi yi 2 SSIM(X,Y)= 1 N N i= 1 ssim(x i, yi ) Slide 29

Quality assessment (cont d) Subjective evaluation Final and some intermediate decisions should be made using subjective evaluation Mean Opinion Score (MOS) MOS provides a numerical indication of the perceived uality of a picture or a picture seuence after a process such as compression, uantization, transmission and so on. The MOS is expressed as a single number in the range 1 to 5 in the case of a discrete scale (resp., 1 to 100 in the case of a continuous scale) where 1 is the lowest perceived uality, and 5 (resp., 100) is the highest perceived uality Confidence interval can be calculated Some outliers can be rejected This rejection allows us to correct influences induced by the observer's behavior, or bad choice of test pictures or picture seuences Slide 30

Methodology of compression performance evaluation In this draft, just a high-level evaluation framework is proposed Further details (e.g., a list of video seuences, concrete bit-rates, etc) are described in the testing draft The draft only encompasses an evaluation methodology for compression performance Reference software Reference software provided to the NETVC WG for candidate codecs should comprise a fully operational encoder that supports necessary rate controls, subjective uality optimization features and some degree of speed optimization and a real-time decoder Slide 31

Conclusions This document contains an overview of Internet video codec applications and typical use cases a prioritized list of reuirements for an Internet video codec The authors tried to take into account all the received comments An evaluation methodology for this codec is also proposed We recommend to adopt this document Slide 32

Thank You