CMPT 365 Multimedia Systems. Mid-Term Review

Similar documents
Multimedia. Course Code (Fall 2017) Fundamental Concepts in Video

To discuss. Types of video signals Analog Video Digital Video. Multimedia Computing (CSIT 410) 2

Chapter 3 Fundamental Concepts in Video. 3.1 Types of Video Signals 3.2 Analog Video 3.3 Digital Video

Ch. 1: Audio/Image/Video Fundamentals Multimedia Systems. School of Electrical Engineering and Computer Science Oregon State University

5.1 Types of Video Signals. Chapter 5 Fundamental Concepts in Video. Component video

An Overview of Video Coding Algorithms

Multimedia Systems Video I (Basics of Analog and Digital Video) Mahdi Amiri April 2011 Sharif University of Technology

ELEC 691X/498X Broadcast Signal Transmission Fall 2015

Fundamentals of Multimedia. Lecture 3 Color in Image & Video

VIDEO Muhammad AminulAkbar

Lecture 2 Video Formation and Representation

Welcome Back to Fundamentals of Multimedia (MR412) Fall, ZHU Yongxin, Winson

!"#"$%& Some slides taken shamelessly from Prof. Yao Wang s lecture slides

1. Broadcast television

Advanced Computer Networks

Multimedia Systems. Part 13. Mahdi Vasighi

Video Compression. Representations. Multimedia Systems and Applications. Analog Video Representations. Digitizing. Digital Video Block Structure

Audio and Video II. Video signal +Color systems Motion estimation Video compression standards +H.261 +MPEG-1, MPEG-2, MPEG-4, MPEG- 7, and MPEG-21

Motion Video Compression

Communication Theory and Engineering

Video coding standards

Part 1: Introduction to Computer Graphics

Digital Television Fundamentals

MULTIMEDIA TECHNOLOGIES

Television History. Date / Place E. Nemer - 1

So far. Chapter 4 Color spaces Chapter 3 image representations. Bitmap grayscale. 1/21/09 CSE 40373/60373: Multimedia Systems

8/30/2010. Chapter 1: Data Storage. Bits and Bit Patterns. Boolean Operations. Gates. The Boolean operations AND, OR, and XOR (exclusive or)

DCI Requirements Image - Dynamics

Objectives: Topics covered: Basic terminology Important Definitions Display Processor Raster and Vector Graphics Coordinate Systems Graphics Standards

Chrominance Subsampling in Digital Images

Mahdi Amiri. April Sharif University of Technology

OVE EDFORS ELECTRICAL AND INFORMATION TECHNOLOGY

Information Transmission Chapter 3, image and video

Rec. ITU-R BT RECOMMENDATION ITU-R BT PARAMETER VALUES FOR THE HDTV STANDARDS FOR PRODUCTION AND INTERNATIONAL PROGRAMME EXCHANGE

Image and video encoding: A big picture. Predictive. Predictive Coding. Post- Processing (Post-filtering) Lossy. Pre-

EMBEDDED ZEROTREE WAVELET CODING WITH JOINT HUFFMAN AND ARITHMETIC CODING

Module 1: Digital Video Signal Processing Lecture 5: Color coordinates and chromonance subsampling. The Lecture Contains:

COMP 249 Advanced Distributed Systems Multimedia Networking. Video Compression Standards

AT65 MULTIMEDIA SYSTEMS DEC 2015

Digital Media. Daniel Fuller ITEC 2110

Video 1 Video October 16, 2001

Downloads from:

Announcements. Project Turn-In Process. and URL for project on a Word doc Upload to Catalyst Collect It

Toward Better Chroma Subsampling By Glenn Chan Recipient of the 2007 SMPTE Student Paper Award

Chapter 4 Color in Image and Video. 4.1 Color Science 4.2 Color Models in Images 4.3 Color Models in Video

Implementation of an MPEG Codec on the Tilera TM 64 Processor

Comp 410/510. Computer Graphics Spring Introduction to Graphics Systems

Digital Representation

XC-77 (EIA), XC-77CE (CCIR)

Overview: Video Coding Standards

Understanding IP Video for

Lesson 2.2: Digitizing and Packetizing Voice. Optimizing Converged Cisco Networks (ONT) Module 2: Cisco VoIP Implementations

A video signal consists of a time sequence of images. Typical frame rates are 24, 25, 30, 50 and 60 images per seconds.

Monitor and Display Adapters UNIT 4

TV Character Generator

Data Manipulation. Audio and Image Representation. -Representation, Compression, and Communication Errors. Audio Representation

Software Analog Video Inputs

TEST PATTERN GENERATOR

Lecture 1: Introduction & Image and Video Coding Techniques (I)

PAL uncompressed. 768x576 pixels per frame. 31 MB per second 1.85 GB per minute. x 3 bytes per pixel (24 bit colour) x 25 frames per second

10 Digital TV Introduction Subsampling

Swept-tuned spectrum analyzer. Gianfranco Miele, Ph.D

Analog to Digital Converter. Last updated 7/27/18

QUALITY OF COMPUTER MUSIC USING MIDI LANGUAGE FOR DIGITAL MUSIC ARRANGEMENT

Lecture 18: Exam Review

CM3106 Solutions. Do not turn this page over until instructed to do so by the Senior Invigilator.

Essentials of the AV Industry Welcome Introduction How to Take This Course Quizzes, Section Tests, and Course Completion A Digital and Analog World

Chapter 14 D-A and A-D Conversion

Data Converter Overview: DACs and ADCs. Dr. Paul Hasler and Dr. Philip Allen

EECS150 - Digital Design Lecture 12 Project Description, Part 2

ZONE PLATE SIGNALS 525 Lines Standard M/NTSC

CS2401-COMPUTER GRAPHICS QUESTION BANK

Electronic Publishing

Content storage architectures

4. ANALOG TV SIGNALS MEASUREMENT

B I O E N / Biological Signals & Data Acquisition

An Alternative Architecture for High Performance Display R. W. Corrigan, B. R. Lang, D.A. LeHoty, P.A. Alioshin Silicon Light Machines, Sunnyvale, CA

Inputs and Outputs. Review. Outline. May 4, Image and video coding: A big picture

Digital Media. Daniel Fuller ITEC 2110

Checkpoint 2 Video Encoder

Essence of Image and Video

Data Storage and Manipulation

Rounding Considerations SDTV-HDTV YCbCr Transforms 4:4:4 to 4:2:2 YCbCr Conversion

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and

Spatio-temporal inaccuracies of video-based ultrasound images of the tongue

Computer Graphics Hardware

Multimedia Systems Video I (Basics of Analog and Digital Video) Mahdi Amiri November 2015 Sharif University of Technology

Multimedia Networking

Part 1: Introduction to computer graphics 1. Describe Each of the following: a. Computer Graphics. b. Computer Graphics API. c. CG s can be used in

How to Match the Color Brightness of Automotive TFT-LCD Panels

Lecture 2 Video Formation and Representation

Kramer Electronics, Ltd. USER MANUAL. Model: FC Analog Video to SDI Converter

Multimedia: is any combination of: text, graphic art, sound, animation, video delivered by computer or electronic means.

Multimedia Communications. Video compression

Digital Video Telemetry System

FAX Image Compression

Analog and Digital Video Basics

Primer. A Guide to Standard and High-Definition Digital Video Measurements. 3G, Dual Link and ANC Data Information

Midterm Review. Yao Wang Polytechnic University, Brooklyn, NY11201

Elements of a Television System

Transcription:

CMPT 365 Multimedia Systems Mid-Term Review Xiaochuan Chen Spring 2017 CMPT365 Multimedia Systems 1

Adminstrative Mid-Term: Feb 22th, In Class, 50mins Still have a course on Monday Feb 20 th!!! Pick up assignment: Today 4:30~5:30 with TA A2 will be released CMPT365 Multimedia Systems 2

Outline Media Representation - Audio Media Representation - Image Media Representation - Video Lossless Compression CMPT365 Multimedia Systems 3

Quantization and Sampling CMPT365 Multimedia Systems 4

Sampling Rate cont d For correct sampling we must use a sampling rate equal to at least twice the maximum frequency content in the signal. This rate is called the Nyquist rate. The relationship among the Sampling Frequency, True Frequency, and the Alias Frequency is as follows: CMPT365 Multimedia Systems 5

Sampling Rate cont d Nyquist frequency: half of the Sampling rate Since it would be impossible to recover frequencies higher than Nyquist frequency in any event, most systems have an antialiasing filter that restricts the frequency content in the input to the sampler to a range at or below Nyquist frequency. Sampling theory Nyquist theorem If a signal is band-limited, i.e., there is a lower limit f1 and an upper limit f2 of frequency components in the signal, then the sampling rate should be at least 2(f2 f1). CMPT365 Multimedia Systems 6

Quantization Noise Quantization noise: the difference between the actual value of the analog signal, for the particular sampling time, and the nearest quantization interval value. At most, this error can be as much as half of the interval. The quality of the quantization is characterized by the Signal to Quantization Noise Ratio (SQNR). CMPT365 Multimedia Systems 7

Signal to Noise Ratio (SNR) Signal to Noise Ratio (SNR): the ratio of the power of the correct signal and the noise A common measure of the quality of the signal The ratio can be huge and often non-linear So practically, SNR is usually measured in logscale: decibels (db), where 1 db is 1/10 Bel. The SNR value, in units of db, is defined in terms of base-10 logarithms of squared voltages, as follows: CMPT365 Multimedia Systems 8

Common sound levels CMPT365 Multimedia Systems 9

Signal-to-Quantization Noise Ratio (SQNR) cont d For a quantization accuracy of N bits per sample, the peak SQNR can be simply expressed: 6.02N is the worst case. Note: We map the maximum signal to 2 N 1 1 ( 2 N 1 ) and the most negative signal to 2 N 1. Dynamic range : the ratio of maximum to minimum absolute values of the signal: V max /V min. The max abs. value V max gets mapped to 2 N 1 1; the min abs. value V min gets mapped to 1. V min is the smallest positive voltage that is not masked by noise. The most negative signal, V max, is mapped to 2 N 1. CMPT365 Multimedia Systems 10

Linear and Non-linear Quantization q Linear format: samples are typically stored as uniformly quantized values. Non-uniform quantization: set up more finely-spaced levels where humans hear with the most acuity. Weber s Law stated formally says that equally perceived differences have values proportional to absolute levels: ΔResponse ΔStimulus/Stimulus (6.5) Inserting a constant of proportionality k, we have a differential equation that states: with response r and stimulus s. dr = k (1/s) ds (6.6) CMPT365 Multimedia Systems 11

Linear and Non-linear Quantization Fig. 6.6: Nonlinear transform for audio signals. The parameter µ is set to µ = 100 or µ = 255; the parameter A for the A-law encoder is usually set to A = 87.6. The µ-law in audio is used to develop a nonuniform quantization rule for sound: uniform quantization of r gives finer resolution in s at the quiet end. CMPT365 Multimedia Systems 12

MIDI: Musical Instrument Digital Interface Use the sound card s defaults for sounds: use a simple scripting language and hardware setup called MIDI. MIDI Overview MIDI is a scripting language it codes events that stand for the production of sounds. E.g., a MIDI event might include values for the pitch of a single note, its duration, and its volume. CMPT365 Multimedia Systems 13

MIDI Concepts MIDI channels are used to separate messages. (a) There are 16 channels numbered from 0 to 15. The channel forms the last 4 bits (the least significant bits) of the message. (b) Usually a channel is associated with a particular instrument: e.g., channel 1 is the piano, channel 10 is the drums, etc. (c) Nevertheless, one can switch instruments midstream, if desired, and associate another instrument with any channel. CMPT365 Multimedia Systems 14

Synthesizer: MIDI Terminology was, and still can be, a stand-alone sound generator that can vary pitch, loudness, and tone color. Units that generate sound are referred to as tone modules or sound modules. Sequencer: started off as a special hardware device for storing and editing a sequence of musical events, in the form of MIDI data. Now it is more often a software music editor on the computer. MIDI Keyboard: produces no sound, instead generating sequences of MIDI in- structions, called MIDI messages MIDI messages are rather like assembler code and usually consist of just a few bytes CMPT365 Multimedia Systems 15

6.2.2 Hardware Aspects of MIDI The MIDI hardware setup consists of a 31.25 kbps serial connection. Usually, MIDI-capable units are either Input devices or Output devices, not both. A traditional synthesizer is shown in Fig. 6.11: Fig. 6.11: A MIDI synthesizer CMPT365 Multimedia Systems 16

The physical MIDI ports consist of 5-pin connectors for IN and OUT, as well as a third connector called THRU. a) MIDI communication is half-duplex. b) MIDI IN is the connector via which the device receives all MIDI data. c) MIDI OUT is the connector through which the device transmits all the MIDI data it generates itself. d) MIDI THRU is the connector by which the device echoes the data it receives from MIDI IN. Note that it is only the MIDI IN data that is echoed by MIDI THRU all the data generated by the device itself is sent via MIDI OUT. CMPT365 Multimedia Systems 17

A typical MIDI sequencer setup is shown in Fig. 6.12: Fig. 6.12: A typical MIDI setup CMPT365 Multimedia Systems 18

Table 6.3: MIDI voice messages Voice Message Status Byte Data Byte1 Data Byte2 Note Off &H8n Key number Note Off velocity Note On &H9n Key number Note On velocity Poly. Key Pressure &HAn Key number Amount Control Change &HBn Controller num. Controller value Program Change &HCn Program number None Channel Pressure &HDn Pressure value None Pitch Bend &HEn MSB LSB (** &H indicates hexadecimal, and n in the status byte hex value stands for a channel number. All values are in 0..127 except Controller number, which is in 0..120) CMPT365 Multimedia Systems 19

Outline Media Representation - Audio Media Representation - Image Media Representation - Video Lossless Compression CMPT365 Multimedia Systems 20

Color Formation R = E(λ) S(λ) q R (λ) dλ G = E(λ) S(λ) q G (λ) dλ B = E(λ) S(λ) q B (λ) dλ CMPT365 Multimedia Systems 21

4.1.6 Gamma Correction The light emitted is in fact roughly proportional to the voltage raised to a power; this power is called gamma, with symbol γ. (a) Thus, if the file value in the red channel is R, the screen emits light proportional to R γ, with SPD equal to that of the red phosphor paint on the screen that is the target of the red channel electron gun. The value of gamma is around 2.2. (b) It is customary to append a prime to signals that are gammacorrected by raising to the power (1/γ) before transmission. Thus we arrive at linear signals: R R = R 1/γ (R ) γ R CMPT365 Multimedia Systems 22

Gamma Correction cont d R g Left: light output from CRT with no gamma-correction applied. -- Darker values are displayed too dark. Right: pre-correcting signals by applying the power law Normalization (0-1)? 1/ R g CMPT365 Multimedia Systems 23

Gamma Correction cont d CMPT365 Multimedia Systems 24

Color Space: RGBà YUV Solution: convert to other spaces Why? Display device, compression (R, G, B) Color Conversion (Y, U, V) Compress (R, G, B) For display Inverse Color Conversion (Y, U, V) Decompress CMPT365 Multimedia Systems 25

Color Space R G B Y Cb Cr Most information is in Y channel (brightness) Cb and Cr are small à easier for compression Human eyes are not sensitive to color error Don t need high resolution for color component CMPT365 Multimedia Systems 26

Color Space: Down-sampling Down-sampling color components to improve compression Luma sample Chroma sample YUV 4:4:4 No downsampling Of Chroma MPEG-1 MPEG-2 YUV 4:2:2 YUV 4:2:0 2:1 horizontal downsampling 2:1 horizontal downsampling of chroma components of chroma components 2 chroma samples for 1 chroma sample for every every 4 luma samples 4 luma samples Widely used CMPT365 Multimedia Systems 27

Raw YUV Data File Format In YUV 4:2:0, number of U and V samples are 1/4 of the Y samples YUV samples are stored separately: Image: YYYY..Y UU U VV V (row by row in each channel) Video: YUV of frame 1, YUV of frame 2, CIF (Common Intermediate format): 352 x 288 pixels for Y, 176 x 144 pixels for U, V U V Y QCIF (Quarter CIF): 176 x 144 pixels for Y, 88 x 72 pixels for U, V CIF, and QCIF formats are widely used for video conference Y: 176 x 144 Sample Matlab code: U: 88 x 72 V: 88 x 72 readyuv('foreman.qcif',176, 144, 1, 1);; CMPT365 Multimedia Systems 28

Dithering Rationale: calculate square patterns of dots such that values from 0 to 255 correspond to patterns that are more and more filled at darker pixel values, for printing on a 1-bit printer. Strategy: Replace a pixel value by a larger pattern, say 2x 2 or 4 x 4, such that the number of printed dots approximates the varying-sized disks of ink used in analog, in halftone printing (e.g., for newspaper photos). 1. Half-tone printing is an analog process that uses smaller or larger filled circles of black ink to represent shading, for newspaper printing. 2. For example, if we use a 2 x 2 dither matrix CMPT365 Multimedia Systems 29

Dithering cont d we can first re-map image values in 0..255 into the new range 0..4 by (integer) dividing by 256/5. Then, e.g., if the pixel value is 0 we print nothing, in a 2 x 2 area of printer output. But if the pixel value is 4 we print all four dots. The rule is: If the intensity is > the dither matrix entry then print an on dot at that entry location: replace each pixel by an n x n matrix of dots. Note that the image size may be much larger, for a dithered image, since replacing each pixel by a 4 x 4 array of dots, makes an image 16 times as large. CMPT365 Multimedia Systems 30

Ordered Dithering A clever trick can get around this problem. Suppose we wish to use a larger, 4 x 4 dither matrix, such as An ordered dither consists of turning on the printer out-put bit for a pixel if the intensity level is greater than the particular matrix element just at that pixel position. Fig. 4 (a) shows a grayscale image of Lena. The ordered-dither version is shown as Fig. 4 (b), with a detail of Lena's right eye in Fig. 4 (c). CMPT365 Multimedia Systems 31

Dithering cont d Algorithm for ordered dither, with n x n dither matrix, is as follows: BEGIN for x = 0 to x max END for y = 0 to y max // columns // rows i = x mod n j = y mod n // I(x, y) is the input, O(x, y) is the output, //D is the dither matrix. if I(x, y) > D(i, j) O(x, y) = 1; else O(x, y) = 0; CMPT365 Multimedia Systems 32

Popular File Formats 8-bit GIF : one of the most important formats because of its historical connection to the WWW and HTML markup language as the first image type recognized by net browsers. JPEG: currently the most important common file format. CMPT365 Multimedia Systems 33

Outline Media Representation - Audio Media Representation - Image Media Representation - Video Lossless Compression CMPT365 Multimedia Systems 34

Analog Video An analog signal f(t) samples a time-varying image Progressive scanning traces through a complete picture (a frame) row-wise for each time interval. Interlaced scanning Odd-numbered lines traced first, and then the evennumbered lines. odd" and even" fields - two fields make up one frame Widely used in traditional (non-digital) TV CMPT365 Multimedia Systems 35

NTSC Video NTSC (National Television System Committee) TV standard is mostly used in North America and Japan YIQ color model 4:3 aspect ratio (i.e., the ratio of picture width to its height) 525 scan lines per frame at 30 frames per second (fps). Interlaced scanning, and each frame is divided into two fields, with 262.5 lines/field horizontal sweep frequency is 525x29.97 = 15,734 lines/sec, each line is swept out in 1/15,734 = 63.6 us the horizontal retrace takes 10.9 sec, this leaves 52.7 sec for the active line signal during which image data is displayed PAL in Asia/Europe, SECAM in Europe All faded out (Canada, Aug 31, 2011) CMPT365 Multimedia Systems 36

Digital Video Why digital video? Advantages Stored on digital device or in memory Faithful duplication in digital domain Good or bad? Direct (random) access, nonlinear video editing achievable as a simple, rather than a complex task Ease of manipulation (noise removal, cut and paste, etc.) Ease of encryption and better tolerance to channel noise Multimedia communications Integration to various multimedia applications CMPT365 Multimedia Systems 37

Analog Video Display Interfaces Component video, Composite video, S-video, VGA CMPT365 Multimedia Systems 38

Entropy Suppose: a data source generates output sequence from a set {A1, A2,, AN} P(Ai): Probability of Ai First-Order Entropy (or simply Entropy): the average self-information of the data set H = å - P A )log P( ) i ( i 2 A i The first-order entropy represents the minimal number of bits needed to losslessly represent one output of the source. CMPT365 Multimedia Systems 39

Shannon-Fano Coding Shannon-Fano Algorithm - a top-down approach Sort the symbols according to the frequency count of their occurrences. Recursively divide the symbols into two parts, each with approximately the same number of counts, until all parts contain only one symbol. Example: coding of HELLO CMPT365 Multimedia Systems 40

Coding Tree CMPT365 Multimedia Systems 41

Huffman Coding Source alphabet A = {a1, a2, a3, a4, a5} Probability distribution: {0.2, 0.4, 0.2, 0.1, 0.1} 1 01 000 0010 0011 Sort combine 1 a2 (0.4) 01 a1(0.2) 000 a3(0.2) 0010 a4(0.1) 0.2 a5(0.1) 0011 Sort 0.4 0.2 0.2 0.2 combine Sort combine Sort combine 1 0.4 01 0.4 000 0.2 0.4 001 Assign code 1 0.6 0 1 00 0.6 0.4 1 01 Note: Huffman codes are not unique! Labels of two branches can be arbitrary. Multiple sorting orders for tied probabilities CMPT365 Multimedia Systems 42

Exam Sample MIDI What is MIDI? How many I/O ports does MIDI support? What are they? We have suddenly invented a new kind of music: 18- tonemusic, that requires a keyboard with 180 keys. How would we have to change the MIDI standard to be able to play this music? CMPT365 Multimedia Systems 43

Exam Sample Color Look up table What is a color look-up table and how is it used to represent color? Give an advantage and a disadvantage of this representation with respect to true color (24-bit) color How do you convert from 24-bit color to an 8-bit color look up table representation? CMPT365 Multimedia Systems 44