Problem. Objective. Presentation Preview. Prior Work in Use of Color Segmentation. Prior Work in Face Detection & Recognition

Similar documents
Smart Traffic Control System Using Image Processing

Minimizing the Perception of Chromatic Noise in Digital Images

Research Article. ISSN (Print) *Corresponding author Shireen Fathima

Chord Classification of an Audio Signal using Artificial Neural Network

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

Paulo V. K. Borges. Flat 1, 50A, Cephas Av. London, UK, E1 4AR (+44) PRESENTATION

DeepID: Deep Learning for Face Recognition. Department of Electronic Engineering,

@ Massachusetts Institute of Technology All rights reserved.

2. Problem formulation

Color Image Compression Using Colorization Based On Coding Technique

Essence of Image and Video

Copy Move Image Forgery Detection Method Using Steerable Pyramid Transform and Texture Descriptor

Essence of Image and Video

Automatic Laughter Detection

CS229 Project Report Polyphonic Piano Transcription

Journal of Field Robotics. Instructions to Authors

Basic Operations App Guide

Subjective Similarity of Music: Data Collection for Individuality Analysis

Audio-Based Video Editing with Two-Channel Microphone

Supplemental Material: Color Compatibility From Large Datasets

Distortion Analysis Of Tamil Language Characters Recognition

Music Recommendation from Song Sets

Improving Frame Based Automatic Laughter Detection

Extraction Methods of Watermarks from Linearly-Distorted Images to Maximize Signal-to-Noise Ratio. Brandon Migdal. Advisors: Carl Salvaggio

Speech Recognition and Signal Processing for Broadcast News Transcription

Detecting Musical Key with Supervised Learning

Man-Machine-Interface (Video) Nataliya Nadtoka coach: Jens Bialkowski

Reducing False Positives in Video Shot Detection

Composer Style Attribution

Topics in Computer Music Instrument Identification. Ioanna Karydi

An Introduction to Deep Image Aesthetics

Neural Network Predicating Movie Box Office Performance

Singer Identification

Speech and Speaker Recognition for the Command of an Industrial Robot

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

Music Database Retrieval Based on Spectral Similarity

SMART VEHICLE SCREENING SYSTEM USING ARTIFICIAL INTELLIGENCE METHODS

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

DETECTION OF SLOW-MOTION REPLAY SEGMENTS IN SPORTS VIDEO FOR HIGHLIGHTS GENERATION

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Week 14 Music Understanding and Classification

Release Year Prediction for Songs

Quantitative Evaluation of Pairs and RS Steganalysis

Computational Modelling of Harmony

Document Analysis Support for the Manual Auditing of Elections

MUSICAL INSTRUMENT RECOGNITION WITH WAVELET ENVELOPES

... A Pseudo-Statistical Approach to Commercial Boundary Detection. Prasanna V Rangarajan Dept of Electrical Engineering Columbia University

Hidden Markov Model based dance recognition

Automatic Laughter Detection

Automatic Identification of Instrument Type in Music Signal using Wavelet and MFCC

Comparative Study on Fingerprint Recognition Systems Project BioFinger

Decision-Maker Preference Modeling in Interactive Multiobjective Optimization

Interactive Tic Tac Toe

Automatically Creating Biomedical Bibliographic Records from Printed Volumes of Old Indexes

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES

Detection of Panoramic Takes in Soccer Videos Using Phase Correlation and Boosting

Automatic Extraction of Popular Music Ringtones Based on Music Structure Analysis

Non Stationary Signals (Voice) Verification System Using Wavelet Transform

An Image Compression Technique Based on the Novel Approach of Colorization Based Coding

Biomimetic spectro-temporal features for music instrument recognition in isolated notes and solo phrases

Formalizing Irony with Doxastic Logic

About Giovanni De Poli. What is Model. Introduction. di Poli: Methodologies for Expressive Modeling of/for Music Performance

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER

An Iot Based Smart Manifold Attendance System

Musical Instrument Identification Using Principal Component Analysis and Multi-Layered Perceptrons

A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES

Research Article 2016

Automatic Piano Music Transcription

High Quality Digital Video Processing: Technology and Methods

International Journal of Advance Engineering and Research Development MUSICAL INSTRUMENT IDENTIFICATION AND STATUS FINDING WITH MFCC

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

GRADIENT-BASED MUSICAL FEATURE EXTRACTION BASED ON SCALE-INVARIANT FEATURE TRANSFORM

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

Behavior Forensics for Scalable Multiuser Collusion: Fairness Versus Effectiveness H. Vicky Zhao, Member, IEEE, and K. J. Ray Liu, Fellow, IEEE

Supervised Learning in Genre Classification

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Masking in Chrominance Channels of Natural Images Data, Analysis, and Prediction

LOCOCODE versus PCA and ICA. Jurgen Schmidhuber. IDSIA, Corso Elvezia 36. CH-6900-Lugano, Switzerland. Abstract

EMBEDDED ZEROTREE WAVELET CODING WITH JOINT HUFFMAN AND ARITHMETIC CODING

ISSN ICIRET-2014

POLYPHONIC INSTRUMENT RECOGNITION USING SPECTRAL CLUSTERING

hit), and assume that longer incidental sounds (forest noise, water, wind noise) resemble a Gaussian noise distribution.

Outline. Why do we classify? Audio Classification

Identifying Table Tennis Balls From Real Match Scenes Using Image Processing And Artificial Intelligence Techniques

Universität Bamberg Angewandte Informatik. Seminar KI: gestern, heute, morgen. We are Humor Beings. Understanding and Predicting visual Humor

Using the NTSC color space to double the quantity of information in an image

Lab 6: Edge Detection in Image and Video

On the Characterization of Distributed Virtual Environment Systems

CS 1674: Intro to Computer Vision. Face Detection. Prof. Adriana Kovashka University of Pittsburgh November 7, 2016

IDENTIFYING TABLE TENNIS BALLS FROM REAL MATCH SCENES USING IMAGE PROCESSING AND ARTIFICIAL INTELLIGENCE TECHNIQUES

Reconstruction of Ca 2+ dynamics from low frame rate Ca 2+ imaging data CS229 final project. Submitted by: Limor Bursztyn

LEARNING AUDIO SHEET MUSIC CORRESPONDENCES. Matthias Dorfer Department of Computational Perception

Nearest-neighbor and Bilinear Resampling Factor Estimation to Detect Blockiness or Blurriness of an Image*

Off-line Handwriting Recognition by Recurrent Error Propagation Networks

Neural Network for Music Instrument Identi cation

BUILDING A SYSTEM FOR WRITER IDENTIFICATION ON HANDWRITTEN MUSIC SCORES

A Bayesian Network for Real-Time Musical Accompaniment

WHAT'S HOT: LINEAR POPULARITY PREDICTION FROM TV AND SOCIAL USAGE DATA Jan Neumann, Xiaodong Yu, and Mohamad Ali Torkamani Comcast Labs

Image Contrast Enhancement (ICE) The Defining Feature. Author: J Schell, Product Manager DRS Technologies, Network and Imaging Systems Group

A Music Retrieval System Using Melody and Lyric

Transcription:

Problem Facing the Truth: Using Color to Improve Facial Feature Extraction Problem: Failed Feature Extraction in OKAO Tracking generally works on Caucasians, but sometimes features are mislabeled or altogether lost/undetected Particularly faulty for dark complexions Maria Jabon Damien Cerbelaud Christopher Tsai March 2, 28 EE 362 Applied Vision and Image Systems no. 2 Objective Presentation Preview Solution: Improving Feature Extraction using Color More steady tracking Fewer false features Robust to perturbations: motion, blinking Fewer empty frames Focus: African-American Faces Subset of dark-skinned population Three subjects with particularly low tracking accuracy Test Data from OKAO Prior Work in Face Tracking Without color With color Our Methods Color edge boosting Windowing Results and Discussion Conclusion Possible Extensions no. 3 no. 4 Prior Work in Face Detection & Recognition Prior Work in Use of Color Segmentation Classic segmentation using luminance, grayscale Keypoint identification Template matching Neural networks train with database, adapt Feature-based registration [Boehnen, Russ, 25] Localized edge detection Threshold face first Stronger threshold for eyes, nose Feature-based approaches Color edge detection using vector gradients Image comparison methods Pairwise feature matching g( (SIFT, SURF) with gargantuan g training set Joint Wavelet Coefficients from RGB channels [Huang, Lai, 24] Skin region isolation Template matching within skin region [Campadelli, Lanzarotti, Lipori, 24] First Principal Component [Dikbas, Arici, Altunbasak, 27] no. 5 no. 6

Experimental Design Benefits of Eigenimages (PCA) Working with OKAO (face in Japanese) Input: Color (RGB) frames of a centered face System: Converts color frames to grayscale System: Pattern matching for eyes and mouth Output: Feature locations and statistics Where other methods fail, the principal component prevails: Grayscaling Pattern Matching OKAO A Different Problem Cannot simply detect color edges outside OKAO Must enhance color images rather than grayscaling them 4 4 Color Grid Color Laplacian 1 st Eigenimage no. 7 no. 8 Benefits of Eigenimages (PCA) Benefits of Eigenimages (PCA) Where other methods fail The principal component usually succeeds: Grayscaling Luminance Color Gradient Skin Color Grid 1 st Principal Component no. 9 no.1 Convert RGB to eigenspace [Dikbas, Arici, Altunbasak] Three principal components How to use the principal eigenimage? In place of the luminance channel? Boosting the luminance channel? Find edges, and boost luminance channel by edges? no.11 Replacing the Luminance Channel no.12

Boosting the Luminance Channel no.13 Edge-Boosting the Luminance Channel no.14 Color gradients How to use the color edges? In place of the luminance channel? (not feasible) Boosting Y (CbCr) RGB (H)S(V) (HS)I channels? Find edges, and boost luminance channel by edges? no.15 Edge-Boosting the Luminance Channel no.16 Edge-Boosting Each RGB Channel no.17 Boosting the Luminance Channel no.18

Histogram Equalization Histogram Equalization Skin color histogram 5 Color (RGB) Equalization.5 5 1 15 2 25 5 1 15 2 25 5 8 LUMINANCE Probability Mass LUMINANCE Probability Mass 6.5.9 4 2.8 5 1 15 2 25.8.7.6.5 5 1 15 2 25.6.4.4.3.2.2 5 1 15 2 25 5 1 15 2 25.1 5 1 15 2 25 5 1 15 2 25 no.19 no.2 Equalizing RGB Histograms Equalizing the Luminance Histogram.1.5.5 LUMINANCE Probability Mass 5 1 15 2 25.4.8.6.4 5 1 15 2 25 5 1 15 2 25.1.5 5 1 15 2 25 Luminance Not Equalized no.21 Luminance Equalized no.22 Other Scaling Methods Other Scaling Methods Scaling grayscale image between and 1 Luminance channel Maximum variance projection (~eigenimages) Project color points onto axis of maximum variance Enhances color contrast Results No noticeable improvement for Scaled Grayscale Small improvement for maximum variance method Reason: Color contrast is local information, so considering the entire image averages out the facial contrast information no.23 no.24

Windowing the Face Windowing the Face Consider local contrast / color information Simulation of adaptive windowing Original video estimates position of mouth and eyes Estimate used to adapt window Position at time n used to generate window at time (n + 1) Localizes relevant color or luminance values for maximal stretching no.25 no.26 Windowing + Scaling Windowing + Scaling Results Scaled Grayscale Maximal variance projection (~eigenimages) Maximal variance projection with emphasized luminance Few improvements using Scaled Grayscale Noticeable improvement for principal component Exceptional boost when luminance of first maximal variance projection is scaled and truncated no.27 no.28 Results and Evaluation Average Mean Confidence Average mean confidence unreliable due to outliers Percentage of frames with higher confidence resistant to outliers Visual evaluation Subject 5 18 Subject 5 18 Subject GrayScaleNW GrayScaleSCNW.4.4 5 18 ColorGrad ColorGrad Boosted RGB Equalize Eigenimages Y Equalize.22.56.5.56 2.5 5.8 8.5 5.14 14.1.1.52.82 PCA NoWin PCA LumScaleW1 PCA LumScaleW2 PCA LumScaleW3.8.37.39.43.7.28.5.7 no.29 no.3

Percentage of Frames with Higher Confidence Graphical Results Subject GrayScaleNW GrayScaleSCNW.316.316 5.5.5 18.44.4 Subject ColorGrad ColorGrad Boosted RGB Equalize Eigenimages YEqualize.315.46 5.13 3.6 18.5.13.38.8.7.6.5.4.3 Percentage of Frames With Higher Confidence for Each Method Subject Subject 5 Subject 18 Subject 5 18 PCA NoWin PCA LumScaleW1 PCA LumScaleW2 PCA LumScaleW3.2755.74.64.46.74.3.2.1 no.31 no.32 Visual Results Presentation Review Subject 5: Windowed Principal Component Luminance Scaled Subject : Color Gradient Edge Boosted Subject : Luminance Equalization no.33 Applicability of previous work OKAO Challenges Input: color image Color contrast boosting Y-Channel Replacement Luminance Boosting Edge-Based Boosting Equalization Advantages of Windowing 5 1 15 2 25 5 1 15 2 25 Metrics for success Windowing + PCA effective no.34 5 1 15 2 25 Facing the Future More extensive testing Other face trackers (we used OKAO) More subjects or various shades (we used three) Longer video sequences (we were limited by memory) Subjects with glasses Choosing the optimal method Using image histogram as a selector Predicting the effectiveness of a method before applying it Selective feature or color boosting Not the generic color edge select color range Accentuate the mouth and eyes no.35 Bibliography Wandell, Brian A. Foundations of Vision. Sunderland, MA: Sinauer & Associates, 1997. Gonzalez, Rafael C., Richard E. Woods, and Steven L. Eddins. Digital Image Processing Using MATLAB. Upper Saddle River, NJ: Pearson Prentice Hall, 24. Dikbas, Salih, Tarik Arici, and Yucel Altunbasak. Chrominance Edge Preserving Grayscale Transformation with Approximate First Principal Component for Color Edge Detection. Georgia Institute of Technology. ICIP 27. Ping, Scott T.Y., Chun Hui Weng, Boonping Lau. Face Detection through Template Matching and Color Segmentation. Stanford University. EE 368, May 23. Huang, Szu-Hao, Shang-Hong Lai. Detecting Faces from Color Video by Using Paired Wavelet Features. National Tsing Hua University. Proceedings of the IEEE Computer Society on Computer Vision and Pattern Recognition, 24. Campadelli, Paola, Rafaella Lanzarotti, Guiseppe Lipori. Face Detection in Color Images of Generic Scenes. IEEE Conference on Computational Intelligence for Homeland Security and Personal Safety. Venice, Italy, 21-22 July 24. Feris, Rogerio F., Teofilo Emidio de Campos, Roberto Marcondes Cesar Junior. Detection and Tracking of Facial Features in Video Sequences. Lecture Notes in Artificial Intelligence, vol. 1793, pp. 197-26, April 2. Boehnen, Chris, Trina Russ. A Fast Multi-Modal Approach to Facial Feature Detection. Proceedings of the Seventh IEEE Workshop on Applications of Computer Vision, 25. Jain, Anil K. Fundamentals of Digital Image Processing. Cambridge: Prentice Hall, 1988. no.36

Acknowledgments Professor Brian Wandell Professor Joyce Farrell Research Assistant, Dr. Manu Parmar Research Assistant, Dr. Peter Catrysse Teaching Assistant, Christopher Anderson OMRON Research & Development (OKAO) Stanford Virtual Reality Lab no.37