Digital Image Processing

Similar documents
Video coding standards

Advanced Computer Networks

Multimedia Communications. Image and Video compression

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

Multimedia Communications. Video compression

Chapter 10 Basic Video Compression Techniques

Introduction to Video Compression Techniques. Slides courtesy of Tay Vaughan Making Multimedia Work

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and

Overview: Video Coding Standards

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Principles of Video Compression

An Overview of Video Coding Algorithms

Motion Video Compression

A video signal consists of a time sequence of images. Typical frame rates are 24, 25, 30, 50 and 60 images per seconds.

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Video Compression. Representations. Multimedia Systems and Applications. Analog Video Representations. Digitizing. Digital Video Block Structure

PAL uncompressed. 768x576 pixels per frame. 31 MB per second 1.85 GB per minute. x 3 bytes per pixel (24 bit colour) x 25 frames per second

The H.26L Video Coding Project

H.261: A Standard for VideoConferencing Applications. Nimrod Peleg Update: Nov. 2003

Video (Fundamentals, Compression Techniques & Standards) Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

Audio and Video II. Video signal +Color systems Motion estimation Video compression standards +H.261 +MPEG-1, MPEG-2, MPEG-4, MPEG- 7, and MPEG-21

Video 1 Video October 16, 2001

So far. Chapter 4 Color spaces Chapter 3 image representations. Bitmap grayscale. 1/21/09 CSE 40373/60373: Multimedia Systems

Introduction to image compression

06 Video. Multimedia Systems. Video Standards, Compression, Post Production

Chapter 2 Introduction to

HEVC: Future Video Encoding Landscape

yintroduction to video compression ytypes of frames ysome video compression standards yinvolves sending:

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

MPEG-2. ISO/IEC (or ITU-T H.262)

Motion Re-estimation for MPEG-2 to MPEG-4 Simple Profile Transcoding. Abstract. I. Introduction

Digital Media. Daniel Fuller ITEC 2110

AUDIOVISUAL COMMUNICATION

Midterm Review. Yao Wang Polytechnic University, Brooklyn, NY11201

Video Over Mobile Networks

Analog and Digital Video Basics

Performance Evaluation of Error Resilience Techniques in H.264/AVC Standard

Multimedia Communication Systems 1 MULTIMEDIA SIGNAL CODING AND TRANSMISSION DR. AFSHIN EBRAHIMI

Multimedia Communication Systems 1 MULTIMEDIA SIGNAL CODING AND TRANSMISSION DR. AFSHIN EBRAHIMI

The H.263+ Video Coding Standard: Complexity and Performance

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks

COMP 9519: Tutorial 1

Analog and Digital Video Basics. Nimrod Peleg Update: May. 2006

The Multistandard Full Hd Video-Codec Engine On Low Power Devices

Ch. 1: Audio/Image/Video Fundamentals Multimedia Systems. School of Electrical Engineering and Computer Science Oregon State University

Video Compression Basics. Nimrod Peleg Update: Dec. 2003

Digital Video Telemetry System

Video Processing Applications Image and Video Processing Dr. Anil Kokaram

Video coding. Summary. Visual perception. Hints on video coding. Pag. 1

Part II Video. General Concepts MPEG1 encoding MPEG2 encoding MPEG4 encoding

ITU-T Video Coding Standards

ITU-T Video Coding Standards H.261 and H.263

Implementation of an MPEG Codec on the Tilera TM 64 Processor

MULTIMEDIA TECHNOLOGIES

Chapter 2 Video Coding Standards and Video Formats

CONTEXT-BASED COMPLEXITY REDUCTION

Content storage architectures

FEC FOR EFFICIENT VIDEO TRANSMISSION OVER CDMA

New forms of video compression

Improving Quality of Video Networking

An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions

Distributed Multimedia Systems. 2.Coding. László Böszörményi Distributed Multimedia Systems Coding - 1

ELEC 691X/498X Broadcast Signal Transmission Fall 2015

MPEGTool: An X Window Based MPEG Encoder and Statistics Tool 1

JPEG2000: An Introduction Part II

A Novel Approach towards Video Compression for Mobile Internet using Transform Domain Technique

Multimedia. Course Code (Fall 2017) Fundamental Concepts in Video

Analysis of MPEG-2 Video Streams

complex than coding of interlaced data. This is a significant component of the reduced complexity of AVS coding.

Modeling and Evaluating Feedback-Based Error Control for Video Transfer

A Big Umbrella. Content Creation: produce the media, compress it to a format that is portable/ deliverable

WYNER-ZIV VIDEO CODING WITH LOW ENCODER COMPLEXITY

Lecture 1: Introduction & Image and Video Coding Techniques (I)

ATSC vs NTSC Spectrum. ATSC 8VSB Data Framing

Video Compression - From Concepts to the H.264/AVC Standard

T he Electronic Magazine of O riginal Peer-Reviewed Survey Articles ABSTRACT

FINAL REPORT PERFORMANCE ANALYSIS OF AVS-M AND ITS APPLICATION IN MOBILE ENVIRONMENT

DIRAC. The video compression family using open technology

CERIAS Tech Report Preprocessing and Postprocessing Techniques for Encoding Predictive Error Frames in Rate Scalable Video Codecs by E

FRACTAL AND MULTIFRACTAL ANALYSES OF COMPRESSED VIDEO SEQUENCES

Lecture 2 Video Formation and Representation

Chapter 2. Advanced Telecommunications and Signal Processing Program. E. Galarza, Raynard O. Hinds, Eric C. Reed, Lon E. Sun-

Key Techniques of Bit Rate Reduction for H.264 Streams

MULTIMEDIA COMPRESSION AND COMMUNICATION

Image and video encoding: A big picture. Predictive. Predictive Coding. Post- Processing (Post-filtering) Lossy. Pre-

INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATION ENGINEERING & TECHNOLOGY (IJECET)

17 October About H.265/HEVC. Things you should know about the new encoding.

Video signals are separated into several channels for recording and transmission.

Fast MBAFF/PAFF Motion Estimation and Mode Decision Scheme for H.264

Speeding up Dirac s Entropy Coder

Impact of scan conversion methods on the performance of scalable. video coding. E. Dubois, N. Baaziz and M. Matta. INRS-Telecommunications

Chapter 6 & Chapter 7 Digital Video CS3570

Understanding Compression Technologies for HD and Megapixel Surveillance

INTRA-FRAME WAVELET VIDEO CODING

Film Grain Technology

Into the Depths: The Technical Details Behind AV1. Nathan Egge Mile High Video Workshop 2018 July 31, 2018

The Essence of Image and Video Compression 1E8: Introduction to Engineering Introduction to Image and Video Processing

Visual Communication at Limited Colour Display Capability

A look at the MPEG video coding standard for variable bit rate video transmission 1

Transcription:

Digital Image Processing 25 January 2007 Dr. ir. Aleksandra Pizurica Prof. Dr. Ir. Wilfried Philips Aleksandra.Pizurica @telin.ugent.be Tel: 09/264.3415 UNIVERSITEIT GENT Telecommunicatie en Informatieverwerking Video compression techniques

Units kbps (Mbit/s)=kilobit per second =1 000 bit /s Mbps (Mbit/s)=megabit per second =1 000 000 bit /s MBps (Mbyte/s)=megabyte per second = 8 Mbps GBps (Gbyte/s)=gigabyte per second = 1 000 MBps Kibit/s=kibibit per second = 2 10 bit/s Mibit/s=mebibit per second = 2 20 bit/s Gibit/s=gibibit per second = 2 30 bit/s MiBs (MiByte/s)=mebibyte per second = 8 Mibit/s GiBs (GiByte/s)=gibibyte per second = 1024 MiBps 08c.3 Digital video formats A. Pizurica, Universiteit Gent, 2006-2007 CCIR601 625/50 PAL/SECAM (for TV-studio s) YUV (YCrCb) format: Y: 576 lines, interlaced, 720 pixels/line, 1Byte/pixel U en V: 576 lines, interlaced, 360 pixels/line, 1Byte/pixel 50 fields/s (25 frames/s) Bit rate: 165 Mbit/s = 20 MiB/s = 70 GiB/hour CIF (Common Intermediate Format) YUV (YCrCb) format: Y: 288 lines, progressive, 360 pixels/line, 1Byte/pixel U en V: 288 lines, progressive, 180 pixels/line, 1Byte/pixel 30 fields/s (30 frames/s) Bit rate: 37 Mbit/s = 4.4 MiB/s = 15 GiB/hour QCIF (Quarter Common Intermediate Format) Quarter resolution of CIF A video conferencing format 08c.4

Compression standards for digital video MPEG-1 (CIF-format on cdrom or hard disc): 1.5 Mbit/s, low quality MPEG-2 (TV-broadcast and HDTV): 10-20 Mbit/s, high quality H.261 (video conferencing, ISDN): 284 kbs (or other multiple of 64kbs) H.263: incrementel improvement of H.261, MPEG-1 and MPEG-2 H.264: MPEG-2 + various sophisticated features beter quality at high compression and more flexibility (cfr. scalability in JPEG 2000) MPEG-4: extension of MPEG-1 and MPEG-2 with new features Treats much more than compression alone: Virtual Reality Modeling Language (VRML)-support for 3D rendering, distribution over TCP-IP, file formats Part 2: basic video codec based on MPEG-2 and H.263 + wavelet codec for still images (e.g., statistical background for video compression) Part 10: advanced video codec (AVC: advanced video codec)= H.264 08c.5 MPEG: Video Encoding Regulator Frame Memory + - DCT Quantizer (Q) Variable Length Encoder Preprocessing Input Predictive frame Motion Compensation x Q-1 IDCT + Frame Memory Motion vectors Buffer Output Motion Estimation 08c.6

MPEG MPEG=Motion Pictures Experts Group Temporal prediction and interpolation of the current frame Current image is first predicted based on one (P: prediction) or two (B: bi-directional) earlier coded images The temporal prediction and interpolation makes use of motion estimation and motion compensation Exactly the same prediction is needed at the coder and decoder DCT-block coding of the prediction error (cfr. the JPEG-scheme) Entropy coding of DCT-coefficients and motion vectors Remarks the DCT-coding is applied to blocks of 8x8 pixels (cfr. JPEG) The motion compensation uses blocks of 16x16 pixels (=4 blocks of 8x8) 08c.7 Motion compensation image from past current image Displaced frame difference = prediction error Forward motion compensation: Image is divided into 16x16 blocks, named macro blocks The most similar block from the previous image is used as prediction The prediction error and the motion vector are coded instead of the block itself Remarks The motion vectors can be calculated with pixel- or with sub-pixel-accuracy For sub-pixel-accuracy or for interlaced images blocks have to be interpolated before they can be compared 08c.8

Motion estimation in MPEG Criterion: the optimal motion vector minimizes mean absolute difference (MAD) 15 15 1 mad( vx, v y ) = f ( x, y, t) f ( x vx, y v y, t 1) 256 x= 0 y= 0 Practical method: Try a limited number of possible (v x, v y ) in a given order stop when mad(v x, v y ) becomes sufficiently low The computation time depends on The allowed motion vectors The search strategy The stopping criterion Example: centered spiral search Starts from the idea that motion is usually small investigates small (v x, v y ) first investigates only (v x, v y ) with v x <n, v y <n Stops earlier if low MAD is found 08c.9 Logarithmic search Try first a limited number of vectors on a coarse raster Refine the best vector by investigating a limited number of smaller corrections Repeat untill the desired resolution (e.g. pixel or ¼ of pixel accuracy) is reached Suboptimal (optimum can be missed) but much faster Remarks The MPEG-standards do not define how motion vectors have to be searched, neither the weighting factors for bi-directional compensation This holds for other coding parameters: quantisation labels, Huffman code books Only the data format is determined competition possible between different coders based on their computation time, quality, latency, 08c.10

Bi-directional motion compensation Problem: de-occlusion: when new objects appear for the first time the forward compensation performs bad Solution: backward compensation = compensation based on a future image Bi-directional motion compensation: predict a block based on the weighted mean of the best matching blocks from the previous and future frame For de-occlusion: weight 0 for block from previous image For occlusion: weight 0 for block from future image in other cases: better prediction by using both images 08c.11 I-frames, P-frames and B-frames Not all the images in the sequence are predicted in the same way There are three types of frames: I-frames: coded as image can be decoded based on themselves without information from the past no motion vectors P-frames prediction: motion compensation on latest received I or P-frame motion vectors + prediction error B-frames prediction: weighted mean of Motion compensation on last received I or P-frame And backward motion compensation on the first future I or P-frame can be coded only with a given delay 08c.12

Data stream in MPEG 1 GOP I1 B1 B2 P1 B3 B4 P2 B5 B6 I2 B7 B8 P3 Images in this order appear on the screen I1 P1 B1 B2 P2 B3 B4 I2 B5 B6 P3 B7 B8 Images are coded/decoded in this order Basic unit: Group of Pictures (GOP) one GOP is one I-frame + all frames until the next I-frame typical length: 12 to 15 frames (dynamically adjustable) 08c.13 Data stream in MPEG The choice of the length and the content of GOP influences the latency: delay between coding the frame and showing it; e.g. 2 frames in example from the previous slide + transmission time important in interactive applications The memory needed in the decoder: e.g. 3 frame-buffers in example The computation time: bi-directional motion compensation needs more computation time in the decoder and a bit more time in the coder To reduce computation time: decode only I-frames Simple (cheap) decoders do not need to decode all the data e.g. decode only I frames on slow hardware or I-frames and P-frames on somewhat faster hardware 08c.14

Color coding in MPEG Even in progressive (not interlaced) video chroma componets (U and V) are often subsampled Often used format (also in the basis of MPEG-2): 4:2:0 two 8x8 chrominance blocks per four 8x8 luminance blocks in total twice less chrominance data than luminance data Meaning of the notation: In practice per image line only one chrominance component is subsampled: U in even lines and V in the odd lines 4:2:0 per image line for each 4 luminance samples, 2 chrominance samples are taken of one type (e.g. U) and 0 of another type (e.g. V) Format for high quality (digital studio equipment): 4:2:2 in total same amount of chrominance data as luminance data 4:2:2 per image line for each 4 luminance samples 2 chrominance samples are taken of each type 08c.15 Color coding in MPEG 4:2:0 4:2:2 Sampling positions in the luminance- and chrominance images, for progressive (not-interlaced) video 08c.16

MPEG-4 Video Object Planes in MPEG-4 MPEG-4 is a multimedia format Explicitly treats computer graphics, text, Image is composed of different layers on top of each other: video object plane (VOP): a layer with one video object of arbitrary shape Coding in the form of a VOP binary image: 1=pixel is included in the VOP; 0=pixel is not included Or grey value image (alfa-layer): 255=pixel included and not transparent; 128=pixel included and 50% transparent, Coding image data in a VOP Macro blocks that are completely inside VOP : like in MPEG-2 Blocks that are only partially included in the video-object: with the Shape-Adaptive DCT Remark: if VOP is rectangular no shape information is encoded 08c.18

Example: macro block raster Standard macro blocks (MBs) are coded with DCT; other blocks with SADCT (Shape-Adaptive DCT) 08c.19 Shape coding: Example The binary image that describes the shape of VOP is coded with a context-sensitive arithmetic coder Application: video conferencing 1 VOP for static background: still image or video stream of low bandwidth 1 VOP for the foreground: video stream of high quality 08c.20

Not-overlapping VOPs Advantage: Minimal bit stream if the shape and the position of the foreground object does not change In the case of de-occlusion additional background blocks are sent 08c.21 Overlapping VOPs Advantage: Sending once a static background requires less bits than performing additional processing always when de-occlusion appears 08c.22

Bibliography A. Pizurica, Universiteit Gent, 2006-2007 Tektronix. A guide to mpeg fundamentals and protocol analysis http://www.broadcastpapers.com/sigdis/25w_11418_0.pdf T. Ebrahimi. MPEG-4 Video Verification Model http://lts1pc19.epfl.ch/repository/ebrahimi1997_428.pdf 08c.23