Lecture 1: Introduction & Image and Video Coding Techniques (I)

Similar documents
COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

Lecture 2 Video Formation and Representation

Research Topic. Error Concealment Techniques in H.264/AVC for Wireless Video Transmission in Mobile Networks

Lecture 2 Video Formation and Representation

Video Compression. Representations. Multimedia Systems and Applications. Analog Video Representations. Digitizing. Digital Video Block Structure

Video 1 Video October 16, 2001

ELEC 691X/498X Broadcast Signal Transmission Fall 2015

Module 1: Digital Video Signal Processing Lecture 5: Color coordinates and chromonance subsampling. The Lecture Contains:

Advanced Computer Networks

Ch. 1: Audio/Image/Video Fundamentals Multimedia Systems. School of Electrical Engineering and Computer Science Oregon State University

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Multimedia. Course Code (Fall 2017) Fundamental Concepts in Video

Video (Fundamentals, Compression Techniques & Standards) Hamid R. Rabiee Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi Spring 2011

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

Image and video encoding: A big picture. Predictive. Predictive Coding. Post- Processing (Post-filtering) Lossy. Pre-

Motion Video Compression

Overview: Video Coding Standards

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

H.261: A Standard for VideoConferencing Applications. Nimrod Peleg Update: Nov. 2003

So far. Chapter 4 Color spaces Chapter 3 image representations. Bitmap grayscale. 1/21/09 CSE 40373/60373: Multimedia Systems

Lecture 2 Video Formation and Representation

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and

Multimedia Communications. Video compression

A Novel Approach towards Video Compression for Mobile Internet using Transform Domain Technique

Multimedia Communications. Image and Video compression

Inputs and Outputs. Review. Outline. May 4, Image and video coding: A big picture

An Overview of Video Coding Algorithms

06 Video. Multimedia Systems. Video Standards, Compression, Post Production

OBJECT-BASED IMAGE COMPRESSION WITH SIMULTANEOUS SPATIAL AND SNR SCALABILITY SUPPORT FOR MULTICASTING OVER HETEROGENEOUS NETWORKS

EMBEDDED ZEROTREE WAVELET CODING WITH JOINT HUFFMAN AND ARITHMETIC CODING

A video signal consists of a time sequence of images. Typical frame rates are 24, 25, 30, 50 and 60 images per seconds.

Principles of Video Compression

Video coding standards

Chapter 2 Introduction to

Content storage architectures

Introduction to Video Compression Techniques. Slides courtesy of Tay Vaughan Making Multimedia Work

5.1 Types of Video Signals. Chapter 5 Fundamental Concepts in Video. Component video

Transform Coding of Still Images

Chapter 1 INTRODUCTION

Error concealment techniques in H.264 video transmission over wireless networks

Chapter 3 Fundamental Concepts in Video. 3.1 Types of Video Signals 3.2 Analog Video 3.3 Digital Video

1. Broadcast television

Intra-frame JPEG-2000 vs. Inter-frame Compression Comparison: The benefits and trade-offs for very high quality, high resolution sequences

Digital Image Processing

INTRA-FRAME WAVELET VIDEO CODING

Digital Media. Daniel Fuller ITEC 2110

ERROR CONCEALMENT TECHNIQUES IN H.264 VIDEO TRANSMISSION OVER WIRELESS NETWORKS

ECE3296 Digital Image and Video Processing Lab experiment 2 Digital Video Processing using MATLAB

MPEG-2. ISO/IEC (or ITU-T H.262)

Motion Re-estimation for MPEG-2 to MPEG-4 Simple Profile Transcoding. Abstract. I. Introduction

The H.26L Video Coding Project

Video Codec Requirements and Evaluation Methodology

10 Digital TV Introduction Subsampling

EECS150 - Digital Design Lecture 12 Project Description, Part 2

Traditionally video signals have been transmitted along cables in the form of lower energy electrical impulses. As new technologies emerge we are

Essence of Image and Video

MPEG + Compression of Moving Pictures for Digital Cinema Using the MPEG-2 Toolkit. A Digital Cinema Accelerator

Chapter 2 Video Coding Standards and Video Formats

What You ll Learn Today

Lecture 23: Digital Video. The Digital World of Multimedia Guest lecture: Jayson Bowen

Multimedia Systems. Part 13. Mahdi Vasighi

Visual Communication at Limited Colour Display Capability

DELTA MODULATION AND DPCM CODING OF COLOR SIGNALS

Research Article. ISSN (Print) *Corresponding author Shireen Fathima

INTERNATIONAL TELECOMMUNICATION UNION. SERIES H: AUDIOVISUAL AND MULTIMEDIA SYSTEMS Coding of moving video

Midterm Review. Yao Wang Polytechnic University, Brooklyn, NY11201

Interlace and De-interlace Application on Video

Colour Reproduction Performance of JPEG and JPEG2000 Codecs

Chapt er 3 Data Representation

ITU-T Video Coding Standards

T he Electronic Magazine of O riginal Peer-Reviewed Survey Articles ABSTRACT

MULTIMEDIA TECHNOLOGIES

Module 3: Video Sampling Lecture 17: Sampling of raster scan pattern: BT.601 format, Color video signal sampling formats

PAL uncompressed. 768x576 pixels per frame. 31 MB per second 1.85 GB per minute. x 3 bytes per pixel (24 bit colour) x 25 frames per second

Matrox PowerStream Plus

yintroduction to video compression ytypes of frames ysome video compression standards yinvolves sending:

Welcome Back to Fundamentals of Multimedia (MR412) Fall, ZHU Yongxin, Winson

Chapter 10 Basic Video Compression Techniques

To discuss. Types of video signals Analog Video Digital Video. Multimedia Computing (CSIT 410) 2

for File Format for Digital Moving- Picture Exchange (DPX)

Selective Intra Prediction Mode Decision for H.264/AVC Encoders

Television History. Date / Place E. Nemer - 1

Man-Machine-Interface (Video) Nataliya Nadtoka coach: Jens Bialkowski

Comparative Study of JPEG2000 and H.264/AVC FRExt I Frame Coding on High-Definition Video Sequences

Improvement of MPEG-2 Compression by Position-Dependent Encoding

MPEGTool: An X Window Based MPEG Encoder and Statistics Tool 1

COMP 9519: Tutorial 1

Audio and Video II. Video signal +Color systems Motion estimation Video compression standards +H.261 +MPEG-1, MPEG-2, MPEG-4, MPEG- 7, and MPEG-21

COMPRESSION OF DICOM IMAGES BASED ON WAVELETS AND SPIHT FOR TELEMEDICINE APPLICATIONS

Understanding Multimedia - Basics

Impact of scan conversion methods on the performance of scalable. video coding. E. Dubois, N. Baaziz and M. Matta. INRS-Telecommunications

Enhanced Frame Buffer Management for HEVC Encoders and Decoders

Study of AVS China Part 7 for Mobile Applications. By Jay Mehta EE 5359 Multimedia Processing Spring 2010

17 October About H.265/HEVC. Things you should know about the new encoding.

Real Time PQoS Enhancement of IP Multimedia Services Over Fading and Noisy DVB-T Channel

Understanding IP Video for

Distributed Video Coding Using LDPC Codes for Wireless Video

SERIES J: CABLE NETWORKS AND TRANSMISSION OF TELEVISION, SOUND PROGRAMME AND OTHER MULTIMEDIA SIGNALS Measurement of the quality of service

Introduction to image compression

Understanding IP Video for

Transcription:

Lecture 1: Introduction & Image and Video Coding Techniques (I) Dr. Reji Mathew Reji@unsw.edu.au School of EE&T UNSW A/Prof. Jian Zhang NICTA & CSE UNSW jzhang@cse.unsw.edu.au COMP9519 Multimedia Systems S2 2010

1.1 Introduction Lecturers Profiles Dr. Reji Mathew http://www.ee.unsw.edu.au/staff/research.htm A/Prof. Jian Zhang http://www.cse.unsw.edu.au/~jzhang This course covers three parts: Image/video coding technology, Streaming multimedia and Multimedia content description, analysis and retrieval COMP9519 Multimedia Systems Lecture 1 Slide 2 R. Mathew & J Zhang

1.2 Course Scope & Arrangement The Scope of this Course: Provide fundamentals of state-of-art multimedia technologies Concepts Principles of these technologies and, Their applications Provide a base of introduction to multimedia system Digital audio and image/video signal coding and compression; Multimedia streaming and multimedia presentation Multimedia content description Video structure analysis; video summarization and representation Multimedia database indexing, browsing and retrieval;. COMP9519 Multimedia Systems Lecture 1 Slide 3 R. Mathew & J Zhang

1.2 Course Scope & Arrangement -- Subject Outline Objectives: On successful completion of this subject, students will: understand fundamental concepts, theory and techniques of digital audio and image/video signal coding and compression; multimedia streaming and multimedia presentation multimedia content description video structure analysis; video summarization and representation multimedia database indexing, browsing and retrieval;. be familiar with applications of multimedia systems and their implementations; be able to apply the techniques in real applications gain skills and knowledge beneficial to future work and post-graduate study in multimedia area COMP9519 Multimedia Systems Lecture 1 Slide 4 R. Mathew & J Zhang

1.2 Course Scope & Arrangement Lecture 1 -- Introduction & image processing Lecture 2 -- Image and Video Coding Techniques (I) Lecture 3 -- Image and Video Coding Techniques (II) Assignment 1 Lecture 4 -- Video Compression Lecture 5 -- Video Compression Standards Lecture 6 -- Internet Streaming Media & Multimedia Presentation COMP9519 Multimedia Systems Lecture 1 Slide 5 R. Mathew & J Zhang

1.2 Course Scope & Arrangement (Cont.) Lecture 7 -- Multimedia Content Description Lecture 8 Image based Content Retrieval (part 1) Lecture 9 Image based Content Retrieval (part 2) Assignment 2 Lecture 10: Tutorial Multimedia information Retrieval Lecture 11 -- Image based Content Retrieval (part 3) Lecture 12 -- Course Overview COMP9519 Multimedia Systems Lecture 1 Slide 6 R. Mathew & J Zhang

1.2 Course Scope & Arrangement http://www.cse.unsw.edu.au/~cs9519/ 12 teaching weeks (11 lectures + one tutorial class) Consultation time: Level 4 NICTA L5 Building. Every Wednesday 2-4 PM from Week 2. Tutors: Werayut Saesue lectures 1-6 Tuan Hue Thi lectures 7-11 Weihong Wang Admin 1-12 Tutorials Some tutorials and in-class tests are included during lectures. Only one tutorial class is planned in week 10. COMP9519 Multimedia Systems Lecture 1 Slide 7 R. Mathew & J Zhang

1.2 Assessment Assignment 1 (30%) Assignment 2 (30%) Final Exam (40%) Understand basic concepts Describe concepts for problem solving COMP9519 Multimedia Systems Lecture 1 Slide 8 R. Mathew & J Zhang

References Reference Books: Yun Q. Shi, Image and video compression for multimedia engineering, 2009 Edition 2 F. Pereira, The MPEG-4 book, 2002 Feng D, Siu W C and Zhang H J (editor), Multimedia Information Retrieval and Management, Springer, 2003 International Standards: RTSP www.ietf.org/rfc/rfc2326.txt SDP www.ietf.org/rfc/rfc2327.txt RTP www.ietf.org/rfc/rfc3550.txt RTP for MPEG-4 www.ietf.org/rfc/rfc3016.txt COMP9519 Multimedia Systems Lecture 1 Slide 9 R. Mathew & J Zhang

1.3 Multimedia Applications: Digital Video Video conference and telephony Multimedia communications Digital video Camera DVD, Blu-Ray Disc HDTV Video surveillance and security Video/image database Interactive multimedia Multimedia data storage and management Digital terrestrial and satellite TV COMP9519 Multimedia Systems Lecture 1 Slide 10 R. Mathew & J Zhang

1.3 Multimedia Applications -- Streaming Video (MPEG-4 standard) Content Creation Delivery Consumption Live MPEG-4 Wireline Multicast E.g. daycare, security Professional MPEG-4 Encoder Content Provider IP Network MPEG-4 Server Internet Content MPEG-4 Local Servers/ Caches 4G Wired Networks GPRS PC M Phone M Phone Live MPEG-4 Wireless Camera Error resilient encoder Home RF Network 3G Network M Phone Home entertainment unit COMP9519 Multimedia Systems Lecture 1 Slide 11 R. Mathew & J Zhang Embedded MPEG-4 Players Error resilient decoder

1.5 Basic Concepts of Image and Video Processing 1.5.1 Image and Video Sequence 1.5.2 Pixel Representation 1.5.3 Chrominance sub-sampling 1.5.4 Digital Video Formats 1.5.5 Information Measure 1.5.6 Image Descriptors & Quality Measure 1.5.7 Introduction to Entropy Coding COMP9519 Multimedia Systems Lecture 1 Slide 12 R. Mathew & J Zhang

1.5.1 Image and Video Sequence Digital Image & Video The basic unit to build a image is called pixel (pel) Image resolution is calculated by pixels in horizontal and vertical coordinates Video sequences consist of a number of motion pictures Spatial sampling (horizontal & vertical resolution in pixels) Temporal sampling (frames per second) time COMP9519 Multimedia Systems Lecture 1 Slide 13 R. Mathew & J Zhang

1.5.1 Video Image and Sequence Spatial Sampling HDTV : 1920x1080 Temporal Sampling 25 or 30 fps Standard frame rates (25 or 30 frames per second) are high enough to provide smooth motion. COMP9519 Multimedia Systems Lecture 1 Slide 14 R. Mathew & J Zhang

1.5.2 Pixel Representation Y,U,V Colour Space The Human Visual System (HVS) is sensitive to three colour components. Colour can be represented by Red, Green and Blue components (RGB). Transform to YUV or YCbCr with less correlated representation:. Y = 0.299R + 0.587G + 0.114B U V t t B Y = 2.03 R Y = 1.14 Y 0.299 0.587 0.114 R U t 0.147 0.289 0.436 G = V t 0.615 0.515 0.100 B 1444442444443 Note: The two chrominance components (U,V) contain considerably less information than the luminance component. For this reason, chrominance is often sub-sampled A COMP9519 Multimedia Systems Lecture 1 Slide 15 R. Mathew & J Zhang

1.5.2 Pixel Representation YCC Colour Space d b r For digital component signal (CCIR Rec 601), 8-bit digital variables are used, however: 1. Full digital range is not used to give working margins for coding and filtering. 2. RGB to YdCbCr conversion is given by Yd 0.257 0.504 0.098 Rd 16 C b 0.148 0.291 0.439 G d 128 = + C r 0.439 0.368 0.071 B d 128 Rd 1.164 0.000 1.596 Yd 16 G d 1.164 0.392 0.813 Cb 128 = B d 1.164 2.017 0.000 Cr 128 The positive/negative values of U and V are scaled and zero shifted in a transformation to the Cb and Cr coordinates. where digital luminance, Yd, has a rang of (16-235) with 220 levels starting at 16, and digital chrominance difference signals, Cb and Cr, have a range of (16-240) with 225 levels centered at 128. COMP9519 Multimedia Systems Lecture 1 Slide 16 R. Mathew & J Zhang

1.5.2 Pixel Representation Yd,Cb,Cr Colour Space (a) Red (d) Yd Transform (b) Green (e) Cb (c) Blue (f) COMP9519 Multimedia Systems Lecture 1 Slide 17 R. Mathew & J Zhang Cr

1.5.3 Chrominance sub-sampling Human vision is relatively insensitive to chrominance. For this reason, chrominance is often sub-sampled. Chrominance sub-sampling is specified as a three-element ratio. COMP9519 Multimedia Systems Lecture 1 Slide 18 R. Mathew & J Zhang

1.5.3 Chrominance sub-sampling In 4:4:4 format: Y, Cr & Cb 720 x 576 pixels per frame In 4:2:2 format: Y 720 x 576 and Cr & Cb 360 x 576 pixels per frame In 4:2:0 format: Y 720 x 576 and Cr & Cb 360 x 288 pixels per frame A commonly used format is 4:2:0 which is obtained by sub-sampling each colour component of 4:2:2 source vertically to reduce the number of lines to 288; COMP9519 Multimedia Systems Lecture 1 Slide 19 R. Mathew & J Zhang

1.5.4 Digital Video Formats International Consultative Committee for Radio (CCIR) Rec. 601: Two display rates: 50Hz: 720x576 pixels at 50 fields per second. 60Hz: 720x480 pixels at 60 fields per second. Both rates are 2:1 interlaced and 4:2:2 chrominance sampling (with optional 4:4:4). 858 pixels 720 pixels 864 pixels 720 pixels 525 lines 480lines 625 lines 576 lines 122 pixels 525/60: 60 fields/s 16 pixels NOTE: Figures referred from Y. Wang et al, 2002. 132 pixels 625/50: 50 fields/s 12 pixels COMP9519 Multimedia Systems Lecture 1 Slide 20 R. Mathew & J Zhang

1.5.4 Digital Video Formats Common Intermediate Format (CIF): This format was defined by CCITT (TSS) for H.261 coding standard (teleconferencing and videophone). Several size formats: SQCIF: 88x72 pixels QCIF: 176x144 pixels. CIF: 352x288 pixels. 4CIF: 704x576 pixels. Non-interlaced (progressive), and chrominance sub-sampling using 4:2:0. Frame rates up to 25 frames/sec COMP9519 Multimedia Systems Lecture 1 Slide 21 R. Mathew & J Zhang

1.5.4 Digital Video Format High Definition Television (HDTV): 1280x720 pixels. 1920x1080 pixels. COMP9519 Multimedia Systems Lecture 1 Slide 22 R. Mathew & J Zhang

1.5.5 Information Measure Information Measure Consider a symbol x with an occurrence probability p, its info. content i(x) (i.e. the amount of info contained in the symbol) [ ] 1 = log p( ) i( x) = log 2 2 p( x) x bits 2-1 The smaller the probability, the more info. the symbol contains The occurrence probability somewhat related to the uncertainty of the symbol A small occurrence probability means large uncertainty or the info. Content of a symbol is about the uncertainty of the symbol. Average Information per Symbol Consider a discrete memoriless information source By discreteness, the source is a countable set of symbols By memoriless, the occurrence of a symbol in the set is independent of that of its preceding symbol. COMP9519 Multimedia Systems Lecture 1 Slide 23 R. Mathew & J Zhang

1.5.5 Information Measure Look at a source that contains m possible symbols: {si, i=1,2..m} The occurrence probabilities: {Pi, i=1,2..m} The info. content of a symbol si; Information Entropy Ii i( s) i = log2 = p bits The Entropy is defined as the average information content per symbol of the source. The Entropy, H, can be expressed as follows: H = m i = 1 p i log 2 p i bits From this definition, the entropy of an information source is a function of occurrence probabilities. The entropy reaches the Max. when all symbols in the set are equally probable. i COMP9519 Multimedia Systems Lecture 1 Slide 24 R. Mathew & J Zhang

1.5.5 Information Measure Information Content Consider the two blocks of binary data shown below which contains the most information? COMP9519 Multimedia Systems Lecture 1 Slide 25 R. Mathew & J Zhang

1.5.6 Image Descriptors Definition-- Image mean Given a two-dimensional (2-D) image field with pixel value, x[n.m], n=1,2,,n and m=1,2,,m, the mean of the image is defined as the spatial average of the luminance values of all pixel, i.e., x = 1 N M N M = = n 1 m 1 xnm [, ] (2-5) Definition--Image variance Given a two-dimensional (2-D) image field with pixel value, x[n.m], n=1,2,,n and m=1,2,,m, the variance of the image is defined as the average value of the squared difference between the value of an arbitrary pixel and the image mean, i.e., N M 2 1 2 σ = ( xnm [, ] x) N M (2-6) n= 1 m= 1 COMP9519 Multimedia Systems Lecture 1 Slide 26 R. Mathew & J Zhang

1.5.6 Quality Measurement Image quality measurement (MSE,MAE,SNR and PSNR) Assume symbol xrepresents the original image and ˆx the reconstructed image, M and N the width and the height of respectively Mean squared error (MSE): M 1 N 1 1 MSE = [ xmn (, ) xmn ˆ(, )] MN m= 0 n= 0 Mean absolute error (MAE): M 1 N 1 1 MAE = xmn (, ) xmn ˆ(, ) MN m= 0 n= 0 Peak signal to noise ration (PSNR): PSNR = 10log 10 2 255 db MSE COMP9519 Multimedia Systems Lecture 1 Slide 27 R. Mathew & J Zhang 2 (2-7) (2-8) (2-9) Peak pixel value is assumed 255

1.5.7 Introduction to Entropy Coding Image Histogram Entropy = 7.63 bits/pixel COMP9519 Multimedia Systems Lecture 1 Slide 28 R. Mathew & J Zhang

1.5.7 Introduction to Entropy Coding The number of bits required to represent an image can be made based on the information content using an entropy (variable length coding) approach such as a Huffman code Highly probable symbols are represented by short code-words while less probable symbols are represented by longer code-words The result is a reduction in the average number of bits per symbol COMP9519 Multimedia Systems Lecture 1 Slide 29 R. Mathew & J Zhang

1.5.7 Introduction to Entropy Coding Example Fixed length coding Symbol Probability Codeword A 0.75 00 Codeword Length 2 B 0.125 01 2 C 0.0625 10 2 D 0.0625 11 2 Average bits/symbol = 0.75*2 + 0.125*2 + 0.0625*2 + 0.0625*2 = 2.0 bits/pixel COMP9519 Multimedia Systems Lecture 1 Slide 30 R. Mathew & J Zhang

1.5.7 Introduction to Entropy Coding Example Entropy (Variable Length) Coding Symbol Probability Codeword Codeword Length A 0.75 0 1 B 0.125 10 2 C 0.0625 110 3 D 0.0625 111 3 Average bits/symbol = 0.75*1 + 0.125*2 + 0.0625*3 + 0.0625*3 = 1.375 bits/pixel (A 30% saving with no loss) COMP9519 Multimedia Systems Lecture 1 Slide 31 R. Mathew & J Zhang

1.5.7 Introduction to Entropy Coding Generation of Huffman Codewords If the symbol probabilities are known, Huffman codewords can be automatically generated Details are introduced in next two slides Note on Merging (left to right) 1. Reorder in decreasing order of probability at each step 2. Merge the two lowest probability symbols at each step Note on Splitting (right to left) 1. Split the symbol merged at that step into two symbols COMP9519 Multimedia Systems Lecture 1 Slide 32 R. Mathew & J Zhang

1.5.7 Introduction to Entropy Coding 1. From Right to Left, 2. Two bottom-most branches are formed a node 3. Reorder probabilities into descending order Tree Construction process COMP9519 Multimedia Systems Lecture 1 Slide 33 R. Mathew & J Zhang

1.5.7 Introduction to Entropy Coding 1) Re-arrange the tree to eliminate crossovers, 2) The coding proceeds from left to right, 3) 0 step up and 1 step down. Code generation COMP9519 Multimedia Systems Lecture 1 Slide 34 R. Mathew & J Zhang

1.5.7 Introduction to Entropy Coding Truncated and Modified Huffman Coding Given the size of the code book is L, the longest codeword will reach L bits For a large quantities of symbols, the size of the code book will be restricted Truncated Huffman coding For a suitable selected L1<L, the first L1 symbols are Huffman coded and the remaining symbols are coded by a prefix code, following by a suitable fixed-length code Second Order Entropy Instead of find the entropy of individual symbols, they can be grouped in pairs and the entropy of the symbol pairs calculated. This is called the SECOND ORDER ENTROPY. For correlated data, this will lead to an entropy closer to the source entropy. COMP9519 Multimedia Systems Lecture 1 Slide 35 R. Mathew & J Zhang

1.5.7 Introduction to Entropy Coding Limitations of Huffman Coding Huffman codewords have to be an integer number of bits long. If the probability of a symbol is 1/3, the optimum number of bits to encode that symbol is -log2 (1/3) =1.6. Assigning two bits leads to a longer code message than is the theoretically necessary The symbol probabilities must be known in the decoder size. If not, they must be generated and transmitted to the decoder with the Huffman coded data A larger of number of symbols results in a large codebook Dynamic Huffman coding scheme exists where the code words are adaptively adjusted during encoding and decoding, but it is complex for implementation. COMP9519 Multimedia Systems Lecture 1 Slide 36 R. Mathew & J Zhang