Xuelong Li, Thomas Huang. University of Illinois at Urbana-Champaign

Similar documents
Research Article. ISSN (Print) *Corresponding author Shireen Fathima

Color Image Compression Using Colorization Based On Coding Technique

Lecture 9 Source Separation

Supervised Learning in Genre Classification

VBM683 Machine Learning

Semi-supervised Musical Instrument Recognition

Optimized Color Based Compression

FINDING COMMUNITY STRUCTURE IN MUSIC GENRES NETWORKS

MPEG-7 AUDIO SPECTRUM BASIS AS A SIGNATURE OF VIOLIN SOUND

Free Viewpoint Switching in Multi-view Video Streaming Using. Wyner-Ziv Video Coding

hit), and assume that longer incidental sounds (forest noise, water, wind noise) resemble a Gaussian noise distribution.

International Journal of Advance Engineering and Research Development MUSICAL INSTRUMENT IDENTIFICATION AND STATUS FINDING WITH MFCC

A Survey on: Sound Source Separation Methods

Supervised Musical Source Separation from Mono and Stereo Mixtures based on Sinusoidal Modeling

Lecture 10 Harmonic/Percussive Separation

DeepID: Deep Learning for Face Recognition. Department of Electronic Engineering,

Ferenc, Szani, László Pitlik, Anikó Balogh, Apertus Nonprofit Ltd.

Video-based Vibrato Detection and Analysis for Polyphonic String Music

IN 1968, Anderson [6] proposed a memory structure named

A Survey of Audio-Based Music Classification and Annotation

COMP 9519: Tutorial 1

MPEG has been established as an international standard

DICOM medical image watermarking of ECG signals using EZW algorithm. A. Kannammal* and S. Subha Rani

How to Shelve Books by Call Number. A Lesson For Student Assistants at the Shatford Library. By William K. Grainger

MidiFind: Fast and Effec/ve Similarity Searching in Large MIDI Databases

Extracting Information from Music Audio

Indexing local features. Wed March 30 Prof. Kristen Grauman UT-Austin

A Framework for Segmentation of Interview Videos

Recognising Cello Performers using Timbre Models

Representations in Deep Neural Nets. Paul Humphreys July

Test Design and Item Analysis

Singer Recognition and Modeling Singer Error

SIMONS COMPUTATIONAL THEORIES OF THE BRAIN APRIL 18, 2018 DECODING THE FUNCTIONAL NETWORKS OF CEREBRAL CORTEX

Joint Optimization of Source-Channel Video Coding Using the H.264/AVC encoder and FEC Codes. Digital Signal and Image Processing Lab

CERIAS Tech Report Preprocessing and Postprocessing Techniques for Encoding Predictive Error Frames in Rate Scalable Video Codecs by E

Video compression principles. Color Space Conversion. Sub-sampling of Chrominance Information. Video: moving pictures and the terms frame and

Comparison Parameters and Speaker Similarity Coincidence Criteria:

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /ISCAS.2005.

A Visualization of Relationships Among Papers Using Citation and Co-citation Information

Constructive Adaptive User Interfaces Composing Music Based on Human Feelings

Brain-Computer Interface (BCI)

Automatic Extraction of Popular Music Ringtones Based on Music Structure Analysis

Chapter 5 Synchronous Sequential Logic

Error Resilience for Compressed Sensing with Multiple-Channel Transmission

AN UNEQUAL ERROR PROTECTION SCHEME FOR MULTIPLE INPUT MULTIPLE OUTPUT SYSTEMS. M. Farooq Sabir, Robert W. Heath and Alan C. Bovik

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

TRAFFIC SURVEILLANCE VIDEO MANAGEMENT SYSTEM

Scalable Foveated Visual Information Coding and Communications

MUSICAL INSTRUMENT IDENTIFICATION BASED ON HARMONIC TEMPORAL TIMBRE FEATURES

An Image Compression Technique Based on the Novel Approach of Colorization Based Coding

Recognising Cello Performers Using Timbre Models

Shades of Music. Projektarbeit

VISUAL CONTENT BASED SEGMENTATION OF TALK & GAME SHOWS. O. Javed, S. Khan, Z. Rasheed, M.Shah. {ojaved, khan, zrasheed,

Post-Routing Layer Assignment for Double Patterning

An Introduction to Deep Image Aesthetics

Sarcasm Detection in Text: Design Document

/$ IEEE

OBJECT-BASED IMAGE COMPRESSION WITH SIMULTANEOUS SPATIAL AND SNR SCALABILITY SUPPORT FOR MULTICASTING OVER HETEROGENEOUS NETWORKS

Topic 10. Multi-pitch Analysis

ROBUST REGION-OF-INTEREST SCALABLE CODING WITH LEAKY PREDICTION IN H.264/AVC. Qian Chen, Li Song, Xiaokang Yang, Wenjun Zhang

EC 6501 DIGITAL COMMUNICATION

arxiv: v1 [cs.dl] 9 May 2017

Feature Conditioning Based on DWT Sub-Bands Selection on Proposed Channels in BCI Speller

Video Codec Requirements and Evaluation Methodology

Digital Signal Processing. Prof. Dietrich Klakow Rahil Mahdian

Singer Traits Identification using Deep Neural Network

DATA hiding technologies have been widely studied in

Adaptive Key Frame Selection for Efficient Video Coding

Memory efficient Distributed architecture LUT Design using Unified Architecture

Automatic Polyphonic Music Composition Using the EMILE and ABL Grammar Inductors *

1360 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 59, NO. 3, MARCH Optimal Encoding for Discrete Degraded Broadcast Channels

Improving Performance in Neural Networks Using a Boosting Algorithm

A TEXT RETRIEVAL APPROACH TO CONTENT-BASED AUDIO RETRIEVAL

Automatic Labelling of tabla signals

Common Spatial Pattern Ensemble Classifier and Its Application in Brain-Computer Interface

DISTRIBUTION STATEMENT A 7001Ö

WE ADDRESS the development of a novel computational

The EMC, Signal And Power Integrity Institute Presents

ORF 307: Lecture 14. Linear Programming: Chapter 14: Network Flows: Algorithms

Music Genre Classification and Variance Comparison on Number of Genres

DISSERTATION AND THESIS FORMATING GUIDE Spring 2018 PREPARED BY THE OFFICE OF GRADUATE STUDIES

More design examples, state assignment and reduction. Page 1

COMPRESSION OF DICOM IMAGES BASED ON WAVELETS AND SPIHT FOR TELEMEDICINE APPLICATIONS

Communication Theory and Engineering

Power-Driven Flip-Flop p Merging and Relocation. Shao-Huan Wang Yu-Yi Liang Tien-Yu Kuo Wai-Kei Tsing Hua University

Classification of Timbre Similarity

Reducing False Positives in Video Shot Detection

Automatic Music Similarity Assessment and Recommendation. A Thesis. Submitted to the Faculty. Drexel University. Donald Shaul Williamson

THE TIMING COUNTER OF THE MEG EXPERIMENT: DESIGN AND COMMISSIONING (OR HOW TO BUILD YOUR OWN HIGH TIMING RESOLUTION DETECTOR )

MUSI-6201 Computational Music Analysis

A Survey Of Mood-Based Music Classification

Joint source-channel video coding for H.264 using FEC

Image Steganalysis: Challenges

Experiments on musical instrument separation using multiplecause

DETECTION OF SLOW-MOTION REPLAY SEGMENTS IN SPORTS VIDEO FOR HIGHLIGHTS GENERATION

Machine Learning Term Project Write-up Creating Models of Performers of Chopin Mazurkas

Automatic Music Genre Classification

The Million Song Dataset

Deep Search Cannot Communicate Callsigns

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

Transcription:

Non-Negative N Graph Embedding Jianchao Yang, Shuicheng Yan, Yun Fu, Xuelong Li, Thomas Huang Department of ECE, Beckman Institute and CSL University of Illinois at Urbana-Champaign

Outline Non-negative negative Part-based Representation Non-Negative Matrix Factorization Non-negative negative Graph Embedding (NGE) Graph Embedding framework Our formulation Experiment Results Face recognition Localized basis Robust to image occlusion Conclusions

Non-negative negative Part-based Representation Why non-negativity? Better physical interpretation of the non-negative data Examples such as absolute temperatures, light intensities, probabilities, sound spectra, etc. Why part-based? Psychological and physiological evidence for part-based representations in the human brain. Perception of the whole as perceptions of the parts.

Non-negative negative Matrix Factorization Formulation Multiplicative update rules guarantee non-negativity negativity

What NMF Learns? NMF indeed learns part-based representation. Problems: Matrix factorization has no control on the properties of the parts. Used in document clustering, but not good for recognition. How the brain learns the discriminative parts is still unknown.

Non-negative negative Graph Embedding Motivation (NGE) Learn the non-negative part-based representation Want it to be good for classification Method Reconstruction for learning the part-based basis Regularization with discriminant analysis

A Better Scheme Reconstruction Discriminant Analysis Input Part-Based Basis Output t Use all available data for learning the basis, while guided d by the labeling li information.

Learn the Discriminative Parts One straightforward solution : the data matrix : the part-based basis matrix : the coefficient matrix : function encoding the discriminative i i power of coefficients The problem is how to choose and to do the optimization.

Graph Embedding Graph Embedding Framework [Yan, et al 2007] Intrinsic Graph: characterize the favorable relationship among training data. Penalty Graph: characterize the unfavorable relationship among training data Objective: These graphs can be unsupervised, supervised or semi- supervised.

NGE Formulation Divide id the feature space into two parts--discriminant i i t space and the complementary space for reconstruction. The objective for is:

NGE Formulation To make the problem solvable, change the objective with the complementary space: Given the intrinsic graph and penalty graph, the optimization problem can formulated as:

Preliminaries Definition 1: A matrix B is called M-matrix if 1) the offdiagonal entries are less than or equal to zeros; 2) the real parts of all eigen values are positive. Lemma 1: If B is a M-matrix, ti its inverse is non-negative, that is B(i,j) >= 0. Definition 2: Function G(A, A ) is an auxiliary function for F(A) if G(A, A )>= F(A) and G(A, A) = F(A). Lemma 2: If G is an auxiliary function of F, F is nonincreasing under the following update rule:

Optimization Procedure Initialize W and H with non-negative values, and the optimization is done by alternating between W and H. Optimize W, fixing H. Define the auxiliary function as Thus the update rule for W is: where is a diagonal element-wise positive matrix, which guarantees the non-negativity of W.

Optimization Procedure Optimize H, fixing W. The auxiliary function is defined as To optimize : To optimize : and are M-matrix, whose inverse are element-wise non-negative, negative, hence guarantees non- negativity of H.

General Framework Intrinsic and penalty graphs for Marginal Fisher Analysis Our algorithm is a general framework, given the intrinsic and penalty graphs. These graphs can be unsupervised, supervised or semi- supervised. We used supervised Marginal Fisher Analysis (MFA) graph to demonstrate the framework.

Face Recognition Experiments Tested on three databases: CMU PIE, ORL and FERET. Compared with unsupervised algorithms PCA, NMF, LNMF (S. Li, CVPR 2001) and supervised algorithms LDA and MFA.

Experiments Learned non-negative part-based basis NMF LNMF NGE

Robust to Occlusion Occlusion Examples Experiments

Contributions: Conclusions Proposed a general framework called Non-Negative Graph Embedding (NGE). Supervised MFA graph is used to demonstrate the effectiveness of the algorithm. Limitation: Like other graph-based method, NGE suffers from speed and scalability during the off-line training. Extension: Unlabeled data can be incorporated into the basis learning, while guided by the available label information.

Thank you!