New forms of video compression

Similar documents
DIRAC. The video compression family using open technology

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

Understanding Compression Technologies for HD and Megapixel Surveillance

New Technologies for Premium Events Contribution over High-capacity IP Networks. By Gunnar Nessa, Appear TV December 13, 2017

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and

Speeding up Dirac s Entropy Coder

Digital Video Telemetry System

MPEG + Compression of Moving Pictures for Digital Cinema Using the MPEG-2 Toolkit. A Digital Cinema Accelerator

Introduction to image compression

Introduction to Video Compression Techniques. Slides courtesy of Tay Vaughan Making Multimedia Work

06 Video. Multimedia Systems. Video Standards, Compression, Post Production

Implementation of an MPEG Codec on the Tilera TM 64 Processor

Fast MBAFF/PAFF Motion Estimation and Mode Decision Scheme for H.264

INTRA-FRAME WAVELET VIDEO CODING

Principles of Video Compression

In MPEG, two-dimensional spatial frequency analysis is performed using the Discrete Cosine Transform

Can JPEG2000 solve the challenge of HDTV Contribution over IP? Author: Janne T. Morstøl Helge Stephansen Ivar Rognstad

Automating the TV production workflow Production logging

ELEC 691X/498X Broadcast Signal Transmission Fall 2015

Video coding standards

CHALLENGES IN NEW MEDIA NETWOKING APPLICATIONS

P1: OTA/XYZ P2: ABC c01 JWBK457-Richardson March 22, :45 Printer Name: Yet to Come

Audio and Video II. Video signal +Color systems Motion estimation Video compression standards +H.261 +MPEG-1, MPEG-2, MPEG-4, MPEG- 7, and MPEG-21

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Case Study: Can Video Quality Testing be Scripted?

Motion Video Compression

A Novel Approach towards Video Compression for Mobile Internet using Transform Domain Technique

ATSC vs NTSC Spectrum. ATSC 8VSB Data Framing

AN MPEG-4 BASED HIGH DEFINITION VTR

Film Grain Technology

Video Coding IPR Issues

Digital Image Processing

A video signal consists of a time sequence of images. Typical frame rates are 24, 25, 30, 50 and 60 images per seconds.

How Does H.264 Work? SALIENT SYSTEMS WHITE PAPER. Understanding video compression with a focus on H.264

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

Intra-frame JPEG-2000 vs. Inter-frame Compression Comparison: The benefits and trade-offs for very high quality, high resolution sequences

Chapter 2 Introduction to

Chapter 10 Basic Video Compression Techniques

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks

Digital Television Fundamentals

Content storage architectures

Digital Video Engineering Professional Certification Competencies

HEVC: Future Video Encoding Landscape

Video 1 Video October 16, 2001

Colour Reproduction Performance of JPEG and JPEG2000 Codecs

PixelNet. Jupiter. The Distributed Display Wall System. by InFocus. infocus.com

SUMMIT LAW GROUP PLLC 315 FIFTH AVENUE SOUTH, SUITE 1000 SEATTLE, WASHINGTON Telephone: (206) Fax: (206)

FLEXIBLE SWITCHING AND EDITING OF MPEG-2 VIDEO BITSTREAMS

The Development of a Synthetic Colour Test Image for Subjective and Objective Quality Assessment of Digital Codecs

TECHNICAL MEDIA SPECIFICATION ON THE FILE BASED SUBMISSION OF MATERIALS TO BE AIRED

TECHNICAL SPECIFICATIONS FOR THE DELIVERY OF CLOSE TO TRANSMISSION TELEVISION PROGRAMMES TO THE

Advanced Computer Networks

Hands-On Real Time HD and 3D IPTV Encoding and Distribution over RF and Optical Fiber

Digital Media. Daniel Fuller ITEC 2110

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

OPEN STANDARD GIGABIT ETHERNET LOW LATENCY VIDEO DISTRIBUTION ARCHITECTURE

MPEGTool: An X Window Based MPEG Encoder and Statistics Tool 1

Research & Development. White Paper WHP 318. Live subtitles re-timing. proof of concept BRITISH BROADCASTING CORPORATION.

Overview: Video Coding Standards

Information Transmission Chapter 3, image and video

Transparent concatenation of MPEG compression

Avivo and the Video Pipeline. Delivering Video and Display Perfection

OVE EDFORS ELECTRICAL AND INFORMATION TECHNOLOGY

DICOM medical image watermarking of ECG signals using EZW algorithm. A. Kannammal* and S. Subha Rani

Format Conversion Design Challenges for Real-Time Software Implementations

White Paper. Video-over-IP: Network Performance Analysis

Multimedia Communications. Image and Video compression

MULTIMEDIA TECHNOLOGIES

The H.26L Video Coding Project

CERIAS Tech Report Preprocessing and Postprocessing Techniques for Encoding Predictive Error Frames in Rate Scalable Video Codecs by E

Understanding IP Video for

Coded Channel +M r9s i APE/SI '- -' Stream ' Regg'zver :l Decoder El : g I l I

Frame Processing Time Deviations in Video Processors

OL_H264e HDTV H.264/AVC Baseline Video Encoder Rev 1.0. General Description. Applications. Features

Video Over Mobile Networks

3Genie P.1. and centred so they appear as a window within the 1080p raster.

PITZ Introduction to the Video System

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)

G406 application note for projector

COMP 9519: Tutorial 1

INTEGRATION, PROCESSING AND RECORDING OF AIRBORNE HIGH RESOLUTION SENSOR IMAGES

Video Processing Applications Image and Video Processing Dr. Anil Kokaram

Jupiter PixelNet. The distributed display wall system. infocus.com

Multiprojection and Capture

An Overview of Video Coding Algorithms

The implementation of HDTV in the European digital TV environment

By David Acker, Broadcast Pix Hardware Engineering Vice President, and SMPTE Fellow Bob Lamm, Broadcast Pix Product Specialist

DVS Whitepaper. DVS and JPEG2000: Hydra OEM board

Transitioning from NTSC (analog) to HD Digital Video

Video Basics. Video Resolution

Digital Terrestrial HDTV Broadcasting in Europe

THINKING ABOUT IP MIGRATION?

PAL uncompressed. 768x576 pixels per frame. 31 MB per second 1.85 GB per minute. x 3 bytes per pixel (24 bit colour) x 25 frames per second

OL_H264MCLD Multi-Channel HDTV H.264/AVC Limited Baseline Video Decoder V1.0. General Description. Applications. Features

A better way to get visual information where you need it.

EBU R The use of DV compression with a sampling raster of 4:2:0 for professional acquisition. Status: Technical Recommendation

The Multistandard Full Hd Video-Codec Engine On Low Power Devices

HDTV compression for storage and transmission over Internet


Datasheet Densité IPG-3901

Transcription:

New forms of video compression New forms of video compression Why is there a need? The move to increasingly higher definition and bigger displays means that we have increasingly large amounts of picture information to move, process and store. This applies not only for content to be consumed in the home but also in the emerging D Cinemas. So these huge amounts of video information need to be moved through the production, post production, versioning, playout and multiple distribution (and no doubt, archive, repurposing and resale). Although the cost of storage is going down, this is at least balanced by the increasing demand for larger amounts of data storage. Video compression has been used very successfully for many years as a way of reducing the amount of data being moved. But the number and complexity of programme paths to consumers via multiple distribution channels benefits from a rethink of how video compression is used. Why Dirac? Traditionally video compression technologies have been proprietary to specific manufacturers. Looked at in isolation these usually give excellent results. But concern has been expressed that when different systems are cascaded there can be concatenation effects, visible as a degradation of picture quality. This is an issue which particularly needs consideration in a production environment, where the signal may be coded and decoded many times across the programme path. The coding / decoding system used in Dirac and Dirac Pro has specifically been designed to minimise this type of effect. Indeed, in principle, the coder can be designed so there is no effect after the first generation. This system has been developed by BBC Research & Innovation using experience built up over tens of years and uses multi-level wavelet (as opposed to convention DCT) transforms to achieve optimum overall performance. The family of codecs has two parts, Dirac Pro (currently submitted to SMPTE for approval as VC-2) and Dirac. Dirac Pro is intended for professional use in environments such as production and post production, where high quality (i.e. either visually or mathematically lossless) is needed and multiple coding / decoding processes are likely. This is an intra-frame codec, designed to accommodate normal production processes such as vision mixing, editing, etc. The compression ratio can lie in the range 2:1 to 16:1. Latency for Dirac Pro through both coder and decoder is never greater than 2ms. The NuMedia DP100 and DP400 are all Dirac Pro products. In contrast, Dirac is designed for distribution applications where there are no production requirements. Therefore this can also include motion compensated prediction across a number of frames, i.e. inter-frame compression, and therefore can achieve up to 100:1 compression and a much lower bit rate. (a) original (b) concatenation effect 1

The benefits of Dirac Pro Dirac Pro compresses each HD- input by approximately 2:1 and multiplexes the compressed bit streams. The resultant bit stream is formatted as an HD- (SMPTE 292M) signal that is fully compliant and compatible with existing HD- equipment, such as switchers of routers. Any metadata, and up to 16 channels of embedded AES / EBU audio, from each source are kept without compression. Unique to the Dirac Pro implementation by NuMedia, the encoded bit stream includes a coarsely quantised interlaced version of each picture. This enables a recognisable image to be recognised as an HD- signal and displayed. So what is in the name, Dirac features can easily be added to Dirac's modular codebase. Architecture Dirac is similar to many of the established video coding systems. However we have adopted established technologies which combine effectiveness, efficiency, and simplicity. Together, these features give us a quality system which is not encumbered with patents. First we use motion compensation to make use of the correlation between picture frames. Good motion compensation can dramatically reduce the amount of data required to code a picture. Then we use wavelets (not the more conventional DCT) to transform the residual error signal. Origin of the name "Dirac" is a reference to Paul A. M. Dirac, British physicist and winner of the 1933 Nobel Prize in Physics. The prize was shared with Erwin Schrödinger, for whom the Schroedinger implementation of Dirac was named. The technology behind Dirac V in -P Є motion compensation wavelet transformation and quantisation Є TQ inverse transformation Є V local motion estimation processed video metadata arithmetic coding output This is a simple introduction to the technology of Dirac. It is hopefully of use to those who are new to the subject, or just want a simple overview. Dirac has three main strands: a compression specification for the bytestream and the decoder software for compression and decompression algorithms designed to support simple and efficient hardware implementations, this is were NuMedia and NuMedia are involved. Unlike many of the other standard video compression systems, the software is not intended simply to provide reference coding and decoding, but also as a prototype implementation that can freely be modified, enhanced and deployed. The decoder implementation in particular is designed to provide fast decoding whilst remaining portable across software platforms. Real time decoding of modern compression systems is difficult without extensively exploiting hardware support (in coprocessors and video cards) or assembly-language code, but these 2 Motion Compensation In Dirac, frames have two essential properties. Firstly, they are either predicted from other frames (Inter) or not (Intra). Secondly they can be used to predict other frames (Reference) or not (Non-reference). All combinations of these properties are possible, and any Inter frame can be predicted from up to two reference frames. This means that Dirac can support conventional MPEG-style structures (Group of Pictures or GOP), but also any other prediction structure that may give better performance. When we get down to pixel level, we can define the reference pixels for motion compensation, either at the global level (through pan, tilt, zoom, rotate etc commands) or by reference to pixels chosen by calculating specific motion vectors for the local block of pixels.

Transform coding Wavelets have commonly been used for still image compression (a recent example is the core of JPEG2000). Now the power of modern chips allows us to use wavelets for motion pictures. The wavelet transform repeatedly filters the signals into low and high frequency parts. This repeated split concentrates the important data in one subband which can be efficiently encoded. We apply different degrees of quantisation to the transformed data. The human eye appears to be insensitive to coarse quantisation in some of the higher wavelet bands, and we exploit this ruthlessly to achieve high compression efficiency. One of the weaknesses of MPEG-2 is the way that the picture goes all blocky when the coder is being worked hard. The use of the Discrete Cosine Transform (DCT) to transform the residual error limits the flexibility of the blocks used in the processing. By using wavelets, we can use varying sizes of blocks, and overlap them to mitigate the impact of block edges. This block structure also results in better motion predictions, again yielding improved compression. Entropy coding The transformed data still has redundancy. Entropy coding is used to reduce the bandwidth. The entropy coding technique used in Dirac is arithmetic coding. This is efficient and flexible. Arithmetic coding separates statistical modelling from the compression process itself, and better compression is afforded when the inter-dependence of data is exploited by switching between models based on previously-coded data. Dirac applies entropy coding to the motion vectors and the output of the wavelet transform process. (a) original (b) first level transform (c) second level transform Bytestream The whole of the compressed data is packaged in a simple bytestream. This has synchronisation, permitting access to any frame quickly and efficiently - making editing simple. The structure is such that the whole bytestream can be packaged in many of the existing transport streams, such as MPEG, MXF, IP, Ogg, etc. This feature means that we are able to use a wide range of sound coding options, as well as easy access to all the other data transport systems required for production or broadcast metadata. Dirac Applications A few of the possible applications include: Moving HD over existing tech infrastructures Sending two camera channels over existing links Desktop production over IP networks Reducing storage in archive Reducing disk storage and bandwidth in D Cinema Input A Input B Deserialise Deserialise A typical hardware implementation Diversity Select Using Dirac Pro A B Audio disembed Multi-Level Wavelet Encoder Colour Space Converter SPDIF Encoder Bypass DVI-D Transmitter Serialise Input A Output B As mentioned above, Dirac Pro was designed with production and post production in mind. It supports a wide range of picture formats including up to 4K and 8K D Cinema images and beyond. It is not limited to 8 or 10 bit signals and can support the large word widths needed in the post production environment. All frame rates plus RGB, 4:4:4, 4:2:2 and 4:2:0 colour formats are supported. The Dirac Pro stream includes essential metadata that describes how the coded signal should be interpreted. A B Warning Class 1 Laser products DVI-D TOSLINK SPF 3

NuMedia products: Integration The DP100 and DP400 codecs are part of the NuMedia range of products. As stand alone units and cards that fit NuMedia s Chameleon. Please see the DP100 and DP400 product brochures for more information of the products available. Applications DP100 allows the transport of 3G HD- over 1.5G infrastructures. Via copper (Coax). 3G HD- In DP100 1.5G Copper Infrastructure DP100 3G HD- Out DP400 allows one 1.5G HD- single to be passed over single 270Mb infrastructures. Either via copper (Coax) and or fibre and or IP. 1.5G HD- In DP400 270Mb Copper Infrastructure DP400 1.5G HD- Out 4

References and further reading 1. Gourier, G.G., 1957. Bandwidth Compression of a Television Signal. Proceedings I.E.E., Paper No. 2357 R, 104 B, pp 265-271 2. http://dirac.sourceforge.net/specification.html 3. Borer, T. & Davies, T. 2005. Dirac - Video Compression using Open Technology. EBU Technical Review No 303, July 2005 4. http://dirac.cvs.sourceforge.net/dirac/diracpro 5. http://www.numediatechnology.com 6. http://sourceforge.net/project/showfiles.php? group_id=102564 7. http://schrodinger.sourceforge.net 8. http://www.jpeg.org/jpeg2000 Acknowledgements The author would like to thank his colleagues at the BBC and Numedia Technology for their contributions to the development of the VC-2 codec. He would also like to thank the BBC for permission to publish this paper. Features and specifications subject to change without notice. All photographs are generic examples only and may not fully represent the product delivered. Numedia Technology Limited Unit 41 Brambles Enterprise Centre, Waterberry Drive, Waterlooville, Hampshire, PO7 7TH Telephone: +44(0) 23 9225 8229 Fax: +44(0) 23 9226 7453 E-mail: info@numediatechnology.com Web: www.numediatechnology.com 5