Essence of Image and Video

Similar documents
Essence of Image and Video

Ch. 1: Audio/Image/Video Fundamentals Multimedia Systems. School of Electrical Engineering and Computer Science Oregon State University

Fundamentals of Multimedia. Lecture 3 Color in Image & Video

!"#"$%& Some slides taken shamelessly from Prof. Yao Wang s lecture slides

Chapter 3 Fundamental Concepts in Video. 3.1 Types of Video Signals 3.2 Analog Video 3.3 Digital Video

Understanding Human Color Vision

Processing. Electrical Engineering, Department. IIT Kanpur. NPTEL Online - IIT Kanpur

Lecture 2 Video Formation and Representation

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and

Chapter 4 Color in Image and Video. 4.1 Color Science 4.2 Color Models in Images 4.3 Color Models in Video

Man-Machine-Interface (Video) Nataliya Nadtoka coach: Jens Bialkowski

[source unknown] Cornell CS465 Fall 2004 Lecture Steve Marschner 1

An Overview of Video Coding Algorithms

Television History. Date / Place E. Nemer - 1

Motion Video Compression

Chrominance Subsampling in Digital Images

Lecture 2 Video Formation and Representation

Video Compression. Representations. Multimedia Systems and Applications. Analog Video Representations. Digitizing. Digital Video Block Structure

Audio and Video II. Video signal +Color systems Motion estimation Video compression standards +H.261 +MPEG-1, MPEG-2, MPEG-4, MPEG- 7, and MPEG-21

Computer and Machine Vision

To discuss. Types of video signals Analog Video Digital Video. Multimedia Computing (CSIT 410) 2

Murdoch redux. Colorimetry as Linear Algebra. Math of additive mixing. Approaching color mathematically. RGB colors add as vectors

Advanced Computer Networks

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

Multimedia. Course Code (Fall 2017) Fundamental Concepts in Video

Introduction & Colour

1. Broadcast television

Colour Reproduction Performance of JPEG and JPEG2000 Codecs

Multimedia Systems Video I (Basics of Analog and Digital Video) Mahdi Amiri April 2011 Sharif University of Technology

Chapter 10 Basic Video Compression Techniques

OVE EDFORS ELECTRICAL AND INFORMATION TECHNOLOGY

COLOR AND COLOR SPACES ABSTRACT

Vannevar Bush: As We May Think

Minimizing the Perception of Chromatic Noise in Digital Images

MPEGTool: An X Window Based MPEG Encoder and Statistics Tool 1

The Development of a Synthetic Colour Test Image for Subjective and Objective Quality Assessment of Digital Codecs

Module 1: Digital Video Signal Processing Lecture 5: Color coordinates and chromonance subsampling. The Lecture Contains:

Midterm Review. Yao Wang Polytechnic University, Brooklyn, NY11201

ECE 634: Digital Video Systems Formats: 1/12/17

A Novel Approach towards Video Compression for Mobile Internet using Transform Domain Technique

Research Article. ISSN (Print) *Corresponding author Shireen Fathima

CHAPTER INTRODUCTION:

Information Transmission Chapter 3, image and video

Video 1 Video October 16, 2001

Basics on Video Communications and Other Video Coding Approaches/Standards

Dan Schuster Arusha Technical College March 4, 2010

Content storage architectures

Multimedia Communications. Image and Video compression

Intra-frame JPEG-2000 vs. Inter-frame Compression Comparison: The benefits and trade-offs for very high quality, high resolution sequences

ELEG5502 Video Coding Technology

Computer Graphics. Raster Scan Display System, Rasterization, Refresh Rate, Video Basics and Scan Conversion

Using the NTSC color space to double the quantity of information in an image

Video coding standards

Digital Media. Daniel Fuller ITEC 2110

5.1 Types of Video Signals. Chapter 5 Fundamental Concepts in Video. Component video

4KScope Software Waveform, Vectorscope, Histogram and Monitor

Chapter 1 INTRODUCTION

High-Definition, Standard-Definition Compatible Color Bar Signal

Lecture 2 Video Formation and Representation

PAL uncompressed. 768x576 pixels per frame. 31 MB per second 1.85 GB per minute. x 3 bytes per pixel (24 bit colour) x 25 frames per second

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Overview: Video Coding Standards

Image and video encoding: A big picture. Predictive. Predictive Coding. Post- Processing (Post-filtering) Lossy. Pre-

sdiscope Signal Analysis Software Version 6

Implementation of an MPEG Codec on the Tilera TM 64 Processor

Chapter 2 Introduction to

Introduction to Video Compression Techniques. Slides courtesy of Tay Vaughan Making Multimedia Work

pdf Why CbCr?

Multimedia Communications. Video compression

Color Spaces in Digital Video

Camera Interface Guide

A look at the MPEG video coding standard for variable bit rate video transmission 1

Lecture 1: Introduction & Image and Video Coding Techniques (I)

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Analog and Digital Video Basics

10 Digital TV Introduction Subsampling

Rounding Considerations SDTV-HDTV YCbCr Transforms 4:4:4 to 4:2:2 YCbCr Conversion

2 Video Formation, Perception, and Representation Chapter 1 color value at any point in a video frame records the emitted or reflected light ata parti

So far. Chapter 4 Color spaces Chapter 3 image representations. Bitmap grayscale. 1/21/09 CSE 40373/60373: Multimedia Systems

MULTIMEDIA TECHNOLOGIES

Beyond the Resolution: How to Achieve 4K Standards

TERMINOLOGY INDEX. DME Down Stream Keyer (DSK) Drop Shadow. A/B Roll Edit Animation Effects Anti-Alias Auto Transition

Understanding Compression Technologies for HD and Megapixel Surveillance

Checkpoint 2 Video Encoder

4kScope Signal Analysis Software Version 6

ELEC 691X/498X Broadcast Signal Transmission Fall 2015

Improved Performance For Color To Gray And Back Using Walsh, Hartley And Kekre Wavelet Transform With Various Color Spaces

VIDEO Muhammad AminulAkbar

Video Compression Basics. Nimrod Peleg Update: Dec. 2003

Video Demystified. A Handbook for the Digital Engineer. Fifth Edition. by Keith Jack

Digital Image Processing

CSE Data Visualization. Color. Jeffrey Heer University of Washington

Wipe Scene Change Detection in Video Sequences

Nintendo. January 21, 2004 Good Emulators I will place links to all of these emulators on the webpage. Mac OSX The latest version of RockNES

Mike Robin MIKE ROBIN S COLUMN SEPTEMBER Introduction. Generation of a color bars signal

2.4.1 Graphics. Graphics Principles: Example Screen Format IMAGE REPRESNTATION

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY

Video Signals and Circuits Part 2

Chapter 6 & Chapter 7 Digital Video CS3570

WYNER-ZIV VIDEO CODING WITH LOW ENCODER COMPLEXITY

Transcription:

1 Essence of Image and Video Wei-Ta Chu 2009/9/24

Outline 2 Image Digital Image Fundamentals Representation of Images Video Representation of Videos

3 Essence of Image Wei-Ta Chu 2009/9/24 Chapters 2 and 6 of Digital Image Procesing by R.C. Gonzalez and R.E. Woods, Prentice Hall, 2 nd edition, 2001

Image Sensing and Acquisition 4 Collect the incoming energy and focus it onto an image plane.

A Simple Image Formation Model 5 Denote an image by a 2D function Characterized by two components: Illumination:, determined by the illumination source Reflectance:, determined by the characteristics of the imaged objects.

Image Sampling and Quantization 6 Sampling Quantization Digitizing the coordinate values Digitizing the amplitude values

Image Sampling and Quantization 7 Continuous image projected onto a sensor array Results of image sampling and quantization

Digital Image Representation 8 Dynamic range The number of discrete gray levels allowed for each pixel Due to processing, storage, and sampling hardware considerations, the number of gray levels typically is an integer power of 2: We refer to images whose gray levels span a significant portion of the gray scale as having a high dynamic range.

Digital Image Representation 9 Image size For a square image with width(height) is N, the total number of bits required to represent the image:

Spatial Resolution 10 Sampling is the principal factor determining the spatial resolution of an image. 1024x1024 32x32: Downsampled by a factor of 2

11 Spatial Resolution 1024x1024 Resample 512 x 512 to 1024 x 1024 From 256x256 From 128x128 From 64x64 From 32x32

Gray-Level Resolution(L=256,128,,4,2) 12 256 128 16 8 64 32 2 4

Histogram 13 The histogram of an image with gray level in the range [0,L-1] is a discrete function Normalized histogram

Histogram 14 Useful image statistics Image processing applications Image enhancement Image compression Image segmentation

Color Fundamentals 15 Color spectrum: violet, blue, green, yellow, orange & red Each color in the spectrum blends smoothly into the next The color perceived in an object are determined by the nature of the light reflected from the object

Color Fundamentals 16 Cones can be divided into three principal sensing categories Due to the absorption of the human eyes, colors are seen as variable of three primary colors (red, green, blue) Approximately 65% of all cones are sensitive to red light, 33% to green light, 2% to blue light.

Color Fundamentals 17 Secondary colors of light Magenta (R + B) Cyan (G + B) Yellow (R + G) The primary color of pigments subtract a primary color of light and reflects the other two.

Color Fundamentals 18 Brightness Embodies the chromatic notion of intensity Hue Attribute associated with the dominant wavelength in a mixture of light waves Dominant color as perceived by an observer Saturation The relative purity of the amount of white light mixed with a hue Less saturated: e.g. pink (red+white), lavender (violet+white) Hue and saturation taken together as called chromaticity.

Specifying Colors 19 The amounts of red, green, and blue needed to form any particular color are called the tristimulus values and are denoted X, Y, Z, respectively. A color is then specified by its trichromatic coefficients, defined as Using CIE chromaticity diagram, which shows color composition as a function of x (red) and y(green)

Specifying Colors 20 The point marked green has approximately 63% green and 25% red content. The composition of blue is approximately 13%.

Color Models (Color Spaces) 21 A color model is a specification of a coordinate system and a subspace within that system where each color is represented by a single point. Hardware-oriented & application-oriented RGB color monitor, color video cameras CMY (cyan, magenta, yellow) color printing CMYK (cyan, magenta, yellow, black) color printing HSI (hue, saturation, intensity) closely matching with human perception

The RGB Color Model 22 Based on Cartesian coordinate system Different colors are points on or inside the cube Full color image: 8 bits for each component, total 24 bits

23 The RGB Color Model

The CMY and CMYK Color Models 24 When a surface coated with cyan pigment is illuminated with white light, no red light is reflected from the surface. Cyan subtracts red light Most devices that deposit colored pigments on paper require CMY data input or perform RGB to CMY conversion. Equal amounts of CMY pigments should produce black.

The HSI Color Model 25 RGB/CMY color systems are suited for hardware implementations. RGB system matches nicely with the fact that the human eye is strongly perceptive to red, green, and blue primaries. But RGB and CMY are not well suited for describing colors for human interpretation.

The HSI Color Model 26 We describe a color object by its hue, saturation, and brightness. Hue: color attribute that describes a pure color Saturation: degree of pure color diluted by white light Brightness: measured by intensity HSI color model decouples the intensity component from the color-carrying information

The HSI Color Model 27 Take the RGB cube, stand on the black vertex, with the white vertex above it. The intensity (gray scale) is along The intensity (gray scale) is along the line joining these two vertices.

The HSI Color Model 28 The dot is an arbitrary color point. The angle from the red axis gives the hue, and the length of the vector is the saturation. The intensity of all colors is given by the position of the plane on the vertical intensity axis.

HSI 29 HSI is also known as HSL, HLS HSV color space

Converting colors from RGB to HSI 30 RGB values have been normalized to the range [0,1] The angle θis measured with respect to the red axis of the HSI space.

The LAB (CIELAB) Color Models 31 CIELAB (L * a * b * ) color space L*: lightness dimension a*,b*: two chromatic dimensions that are roughly red-green and blue-yellow. L*a*b* color is designed to approximate human vision http://en.wikipedia.org/wiki/lab_color_space http://coatings.specialchem.com.cn/tc/color/index.aspx?id=cielab

Other Color Models 32 YUV, YIQ, YCbCr color spaces YCbCr is widely used in video/image compression schemes such as MPEG and JPEG Please refer to http://en.wikipedia.org/wiki/color_space

Color Histogram 33 A representation of the distribution of colors in an image. Discretize colors into a number of bins, and counting the number of pixels with colors in each bin. http://rsb.info.nih.gov/ij/plugins/color-inspector.html

Nonuniform Quantization 34 An example in HLS (HSI) space Considering human perception Lee, et al. Spatial color descriptor for image retrieval and Video summarization, IEEE Trans. on Multimedia, 2003.

Characteristics of Histogram 35 The color histogram of an image represents the global statistics (color distribution) of pixels colors Histogram is one of the most useful feature to describe images or be the basis for similarity measure

Histogram-based Difference 36 Bin-wise histogram difference between Image I 1 and I 2

Short Introduction to Image Features 37 Color features Color histogram Color moments Color coherence vectors (CCV) Color correlogram Ma, et al. Benchmarking image features content-based image retrieval, Record of the 32nd Asilomar Conf. on Signals, Systems & Computers, vol 1., 1998.

Short Introduction to Image Features 38 Texture features Tamura features (coarseness, directionality, contrast) Multi-resolution simultaneous auto-regressive model Canny edge histogram Gabor texture feature Pyramid-structured wavelet transform (PWT) feature Tree-structured transform (TWT) feature Ma, et al. Benchmarking image features content-based image retrieval, Record of the 32nd Asilomar Conf. on Signals, Systems & Computers, vol 1., 1998.

Color Moments 39 Containing only the dominant features instead of storing the complete color distributions. Store the first three moments of each color channel of an image in the index. Average Variance Skewness

Color Moments 40

Color Moments 41 Distance between two images I 1 and I 2 Diff. of average Diff. of variance Diff. of skewness

Color Correlogram 42 A color correlogram expresses how the spatial correlation of parts of colors changes with distance. The histogram of an image is defined as The colors in are quantized into The notation is synonymous with and Huang, et al. Image indexing using color correlograms, CVPR, 1997.

Color Correlogram 43 Let a distance be a fixed a priori. Then the correlogram of is defined for, This value gives the probability that a pixel at distance away from the given pixel is of color., The autocorrelogram of captures spatial correlation between identical colors only

44 Essence of Video Wei-Ta Chu 2009/9/24

Constitution of Digital Video Data 45 A natural video stream is continuous in both spatial and temporal domains. In order to represent and process a video stream digitally it is necessary to sample spatially and temporally. Spatial domain Temporal domain

Video Stream 46 Natural scene Camera RGB to YC 1 C 2 Monitor Processing, Storage, Transmission YC 1 C 2 To RGB

Video Data Representation 47 RGB is not very efficient for representing real-world images, since equal bandwidths are required to describe all the three color components. E.g. 8 bits per component, then 24 bits per pixel Human eye is more sensitive to luminance. Many image coding standards and broadcast systems use luminance and color difference signals. YUV and YIQ for analog television standards, YCbCr for their digital version.

Color Models in Video 48 Largely derive from older analog methods for coding color for TV. Luminance is separated from color information. YIQ is the color space used by the NTSC color TV system, employed mainly in North and Central America, and Japan. In Europe, video tape uses the PAL and SECAM codings, which are based on TV that uses a matrix transform called YUV. Digital video mostly uses a matrix transform called YCbCr that is closely related to YUV.

TV Encoding System 49 PAL, short for Phase Alternating Line, is a color encoding system used in broadcast television systems in large parts of the world. SECAM, French for "Sequential Color with Memory"), is an analog color television system first used in France. NTSC is the analog television system in use in the United States, Canada, Japan, South Korea, Taiwan, the Philippines, Mexico, and some other countries

The YUV Color Model 50 The YUV model defines a color space in terms of one luma (brightness) and two chrominance components. The YUV color model is used in the PAL, NTSC, and SECAM composite color video standards. YUV signals are created from an original RGB source. The weighted values of R, G, and B are added together to produce a single Y signal.

The YUV Color Model 51 The U signal is then created by subtracting the Y from the blue signal, and then scaling; V is created by subtracting the Y from the red, and then scaling by a different factor. Y U V

The YCbCr Color Model 52 YCbCr is a family of color spaces used in video and digital photography systems. Y is the luma component and Cb and Cr are the blue and red chroma components. Recommendation 601 specifies 8-bit coding: Y C b C r

Chroma Subsampling 53 4:2:2 indicates horizontal subsampling of the Cb, Cr signals by a factor of 2. Of four pixels labeled as 0 to 3, all four Ys are sent, and every twocb sthe two Cr sare sent. (Y0,Cb0) (Y1,Cr0) (Y2,Cb2) (Y3,Cr2) 4:2:0 subsamples in both the horizontal and vertical dimensions by a factor of 2.

Examples 54 Given image resolution of 720x576 pixels represented with 8 bits each component, the bit rate required is: 4:4:4 resolution: 720x576x8x3 = 10 Mbits/frame 4:2:0 resolution: (720x576x8) + (360x288x8)x2 = 5 Mbits/frame

Motion Estimation 55 Successive video frames may contain the same objects (still or moving). Motion estimation examines the movement of objects in an image sequence to try to obtain vectors representing the estimated motion.

Motion Estimation 56 The Essence of Image and Video Compression, by A.C. Kokaram http://www.mee.tcd.ie/~ack/teaching/1e8/lecture3.pdf

Three Typical Types of Coded Picture 57 I frame (intraframe) Intraframe encoded without any temporal prediction P frame (forward predicted frame) Interframe encoded using motion predition from the previous I or P frame B frame (bidirectionally predicted frame) Interframe encoded using interpolated motion prediction between the previous I or P frames and the next I or P frames.

Motion Prediction 58 A typical Group of Picture (GOP) in MPEG-2

Short Introduction to Video Features 59 Motion-based features Camera motion, object motion Motion activity/magnitude Moving object detection Shot-based features Average shot length/shot change frequency Scene-based features

Motion Type 60 Camera motion (global motion) Zoom-in/Zoom-out Pan Tilt Object motion

Motion Activity/Magnitude 61 Attributes: Intensity of activity Direction of activity Spatial distribution of activity Indication of the number and size of active regions Temporal distribution of activity Variation of activity over the duration of a video segment or shot

62 Average Shot Length/Shot Change Frequency A statistical measurement which divides the total length of the film by the number of shots. Average duration of a shot between cuts Directors often change shots frequently (shorter ASL) to attract the audience E.g. commercials Video segments with longer ASLs usually present peaceful scenes.

Next Week 63 Video syntax analysis