Musical Instrument Identification based on F0-dependent Multivariate Normal Distribution

Similar documents
Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods

Research Article Instrument Identification in Polyphonic Music: Feature Weighting to Minimize Influence of Sound Overlaps

MUSICAL INSTRUMENT IDENTIFICATION BASED ON HARMONIC TEMPORAL TIMBRE FEATURES

Musical Instrument Identification Using Principal Component Analysis and Multi-Layered Perceptrons

An Accurate Timbre Model for Musical Instruments and its Application to Classification

Automatic Identification of Instrument Type in Music Signal using Wavelet and MFCC

Musical Instrument Recognizer Instrogram and Its Application to Music Retrieval based on Instrumentation Similarity

Music Information Retrieval with Temporal Features and Timbre

Topics in Computer Music Instrument Identification. Ioanna Karydi

Application Of Missing Feature Theory To The Recognition Of Musical Instruments In Polyphonic Audio

WE ADDRESS the development of a novel computational

MUSI-6201 Computational Music Analysis

Musical instrument identification in continuous recordings

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

Semi-supervised Musical Instrument Recognition

THE importance of music content analysis for musical

Supervised Musical Source Separation from Mono and Stereo Mixtures based on Sinusoidal Modeling

The tempo MUSICAL APPRECIATIONS MUSICAL APPRECIATION SHEET 1. slow. Can you hear which is which? Write a tick ( ) in the PIECES OF MUSIC

POLYPHONIC INSTRUMENT RECOGNITION USING SPECTRAL CLUSTERING

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

TABLE OF CONTENTS CHAPTER 1 PREREQUISITES FOR WRITING AN ARRANGEMENT... 1

Cross-Dataset Validation of Feature Sets in Musical Instrument Classification

MUSICAL NOTE AND INSTRUMENT CLASSIFICATION WITH LIKELIHOOD-FREQUENCY-TIME ANALYSIS AND SUPPORT VECTOR MACHINES

GCT535- Sound Technology for Multimedia Timbre Analysis. Graduate School of Culture Technology KAIST Juhan Nam

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

DELAWARE MUSIC EDUCATORS ASSOCIATION ALL-STATE ENSEMBLES GENERAL GUIDELINES

MOTIVATION AGENDA MUSIC, EMOTION, AND TIMBRE CHARACTERIZING THE EMOTION OF INDIVIDUAL PIANO AND OTHER MUSICAL INSTRUMENT SOUNDS

638 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 18, NO. 3, MARCH 2010

Recognising Cello Performers using Timbre Models

AUTOM AT I C DRUM SOUND DE SCRI PT I ON FOR RE AL - WORL D M USI C USING TEMPLATE ADAPTATION AND MATCHING METHODS

Norman Public Schools MUSIC ASSESSMENT GUIDE FOR GRADE 8

Recognising Cello Performers Using Timbre Models

Convention Paper Presented at the 115th Convention 2003 October New York, NY, USA

Music Standard 1. Standard 2. Standard 3. Standard 4.

How to Use This Book and CD

Prelude. Name Class School

Singer Identification

Step I - Online Instrumental Music Registration May 21, 2018 through June 5, 2018

FILE # HAPPY BIRTHDAY EUPHONIUM SHEET MUSIC

Instrument Timbre Transformation using Gaussian Mixture Models

Pop Quartets For All: Trombone, Baritone B.C., Bassoon, Tuba (Pop Instrumental Ensembles For All) By Story;Michael

Neural Network for Music Instrument Identi cation

The Conservatory School Middle Grades Audition Guidelines

IFB 16-24, Musical Instruments Repair Service Tabsheet

Dr. Rob McWilliams ~ Education Outreach Clinician, Yamaha Music Australia Dr. Heather McWilliams ~ Instrumental Music Teacher, Education Queensland

Topic 10. Multi-pitch Analysis

We Can. DESIGN. BUILD. DELIVER Catalog wibenchmfg.com. Industrial Education Office Health

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models

How Deep The Father s Love For Us

Greater Cleveland Instrumental Solo and Ensemble Contest Association. RULES AND REGULATIONS (revised September 2016)

FLIGHT OF THE BUMBLEBEE CLARINET DOWNLOAD

Workshop Friday, June 27th 1:00 PM

GENERAL PRICE LIST February 2018

GENERAL PRICE LIST December 2017

Analysis, Synthesis, and Perception of Musical Sounds

AUTOREGRESSIVE MFCC MODELS FOR GENRE CLASSIFICATION IMPROVED BY HARMONIC-PERCUSSION SEPARATION

Enhancing Ensemble Balance by: William W. Gourley

Music Study Guide. Moore Public Schools. Definitions of Musical Terms

Wes-Boland Eisteddfod

Subjective Similarity of Music: Data Collection for Individuality Analysis

Instrumental & Vocal Music Program

Christ The Lord Is Risen Today (#2)

CHAPTER 14 INSTRUMENTS

BOPLICITY / MARK SCHEME

CHAPTER THIRTEEN FINGERING CHARTS

The Elements of Music

The Elements of Music

Alta High School Instrumental Music Audition Packet

A FUNCTIONAL CLASSIFICATION OF ONE INSTRUMENT S TIMBRES

Classification of Timbre Similarity

Data-Driven Solo Voice Enhancement for Jazz Music Retrieval

FREE SHEET MUSIC SAXOPHONE - DOWNLOAD PDF, MP3 & MIDI PLAY SMART-PRACTICE SERIES FREE MUSIC LESSONS FROM JAZZ

INSTRUMENTAL TEACHING PROGRAMME

Automatic Rhythmic Notation from Single Voice Audio Sources

1 Hour IAI F Hours

CMEA Eastern Region Middle School Audition Repertoire ERMS Brass/Woodwind/Percussion

Ultimate Christmas Instrumental Solos For Strings: Violin, Book & CD (Ultimate Instrumental Solos Series) By Alfred Music

List of Original Compositions, by Genre

The Story of the Woodwind Family. STUDY GUIDE Provided by jewel winds

B Flat Clarinet Solos With Piano - Nocturne READ ONLINE

Ultimate Movie Instrumental Solos: Clarinet (Book & CD) (Pop Instrumental Solo) By Alfred Publishing Staff READ ONLINE

Music Genre Classification and Variance Comparison on Number of Genres

Welcome to the West Babylon Musical Instrument Program!

WMEA WIAA State Solo and Ensemble Contest 2012

On Human Capability and Acoustic Cues for Discriminating Singing and Speaking Voices

INSTRUMENTAL MUSIC PROGRAM

Guide to Band Instruments

Improving Frame Based Automatic Laughter Detection

GOOD-SOUNDS.ORG: A FRAMEWORK TO EXPLORE GOODNESS IN INSTRUMENTAL SOUNDS

MUSIC. Make a musical instrument of your choice out of household items. 5. Attend a music (instrumental or vocal) concert.

Danville Public Schools Music Curriculum Preschool & Kindergarten

INSTRUMENTAL TUITION BURSARY

Oak Bay Band MUSIC THEORY LEARNING GUIDE LEVEL IA

Parameter Estimation of Virtual Musical Instrument Synthesizers

SECTION E - INSTRUMENTAL

Video-based Vibrato Detection and Analysis for Polyphonic String Music

Chord Classification of an Audio Signal using Artificial Neural Network

Today's Pop & Rock Hits Instrumental Solos: Tenor Sax (Book & CD) (Alfred's Instrumental Play-Along) By Alfred Publishing Staff

first year charts Preview Only Legal Use Requires Purchase Pacific Attitude for jazz ensemble JAZZ VINCE GASSI INSTRUMENTATION

Transcription:

Musical Instrument Identification based on F0-dependent Multivariate Normal Distribution Tetsuro Kitahara* Masataka Goto** Hiroshi G. Okuno* *Grad. Sch l of Informatics, Kyoto Univ. **PRESTO JST / Nat l Inst. Adv. Ind. Sci. & Tech. ICASSP 03 (6-10 th Apr. 2003 in Hong Kong)

Today s talk 1. What is musical instrument identification? 2. What is difficult in musical instrument identification? The pitch dependency of timbre 3. How is the pitch dependency coped with? Approximate it as a function of F0 4. Musical instrument identification using F0- dependent multivariate normal distribution 5. Experimental results 6. Conclusions

1. What is musical instrument identification? It is to obtain the name of musical instruments from sounds (acoustical signals). It is useful for music automatic transcription, music information retrieval, etc. Its research began recently (since 1990s). p(x wpiano) Feature Extraction (e.g. Decay speed, Spectral centroid) p(x w flute ) w = argmax p(w X) = argmax p(x w) p(w) <inst>piano</inst>

2. What is difficult in musical instrument identification? The pitch dependency of timbre e.g. Low-pitch piano sound = Slow decay High-pitch piano sound = Fast decay 0.5 (a) Pitch = C2 (65.5Hz) 0.5 (b) Pitch = C6 (1048Hz) 0 0 0.5 0 1 2 3 time [s] -0.5 0 1 2 3 time [s]

3. How is the pitch dependency coped with? Most previous studies have not dealt with the pitch dependency. Example: [Martin99] used hierarchical classification. [Brown99] used cepstral coefficients. [Eronen00] used both techniques. [Kashino98] developed a system for computational music scene analysis. [Kashino00] introduced template adaptation and musical contexts

3. How is the pitch dependency coped with? Proposal: Approximate the pitch dependency of each feature as a function of fundamental frequency (F0)

3. How is the pitch dependency coped with? An F0-dependent multivariate normal distribution has following two parameters: F0-dependent mean function which captures the pitch dependency (i.e. the position of distributions of each F0) F0-normalized covariance which captures the non-pitch dependency

4. Musical instrument identification using F0-dependent multivariate normal distribution 1 st step: Feature extraction 129 features defined based on consulting literatures are extracted. e.g. Spectral centroid (which captures brightness of tones) Piano Spectral centroid Spectral centroid Flute

4. Musical instrument identification using F0-dependent multivariate normal distribution 1 st step: Feature extraction 129 features defined based on consulting literatures are extracted. e.g. Decay speed of power Piano decayed not decayed Flute

4. Musical instrument identification using F0-dependent multivariate normal distribution 2 nd step: Dimensionality reduction First, the 129-dimensional feature space is transformed to a 79-dimensional space by PCA (principal component analysis) (with the proportion value of 99%) Second, the 79-dimensional feature space is transformed to an 18-dimensional space by LDA (linear discriminant analysis)

4. Musical instrument identification using F0-dependent multivariate normal distribution 3 rd step: Parameter estimation First, the F0-dependent mean function is approximated as a cubic polynomial.

4. Musical instrument identification using F0-dependent multivariate normal distribution 3 rd step: Parameter estimation Second, the F0-normalized covariance is obtained by subtracting the F0-dependent mean from each feature. eliminating the pitch dependency

4. Musical instrument identification using F0-dependent multivariate normal distribution Final step: Using the Bayes decision rule The instrument w satisfying w = argmax [log p(x w; f) + log p(w; f)] is determined as the result. p(x w; f) - A probability density function of the F0- dependent multivariate normal distribution. - Defined using the F0-dependent mean function and the F0-normalized covariance.

5. Experiments (Conditions) Database: A subset of RWC-MDB-I-2001 Consists of solo tones of 19 real instruments with all pitch range. Contains 3 individuals and 3 intensities for each instrument. Contains normal articulation only. The number of all sounds is 6,247. Using the 10-fold cross validation. Evaluate the performance both at individualinstrument level and at category level.

Piano Guitars Strings Brass Saxophones Double Reeds Clarinet Air Reeds Piano Classical Guitar Ukulele Violin Viola Trumpet Soprano Sax Alto Sax Oboe Clarinet Piccolo Flute Acoustic Guitar Cello Trombone Tenor Sax Baritone Sax Faggoto Recorder

5. Experiments (Results) Recognition rate[%] 100 80 60 40 20 0 Baseline Proposed Individual level (19 classes) Category level (8 classes) Recognition rates: 79.73% (at individual level) 90.65% (at category level) Improvement: 4.00% (at individual level) 2.45% (at category level) Error reduction (relative): 16.48% (at individual level) 20.67% (at category level)

5. Experiments (Results) The recognition rates of following 6 instruments were improved by more than 7%. Recognition rates[%] 100 80 60 40 20 0 Piano Trumpet Trombone Soprano Sax Baritone Sax Baseline Proposed Faggoto Piano: The best improved (74.21% 83.27%) Because the piano has the wide pitch range.

6. Conclusions To cope with the pitch dependency of timbre in musical instrument identification, F0-dependent multivariate normal distribution is proposed. Experimental results: Recognition rate: 75.73% 79.73% (Using 6,247 solo tones of 19 instruments) Future works: Evaluation against mixture of sounds Development of application systems using the proposed method.

Recognition rates[%] 100 80 60 40 20 0 Piano Recognition rates at category level Guitar Strings Brass Sax Baseline Proposed Dbl Rd. ClarinetAir Rd. Err Rdct 35% 8% 23% 33% 20% 13% 15% 8% Recognition rates for all categories were improved. Recognition rates for Piano, Guitar, Strings: 96.7%

Bayes (18 dim; PCA+LDA) Bayes (79 dim; PCA only) Bayes (18 dim; PCA only) 3-NN (18 dim; PCA+LDA) 3-NN (79 dim; PCA only) 3-NN (18 dim; PCA only) Bayes vs k-nn We adopt PCA+LDA+Bayes achieved the best performance. 18-dimension is better than 79-dimension. # of training data is not enough for 79-dim. The use of LDA improved the performance. LDA considers separation between classes.

Bayes (18 dim; PCA+LDA) Bayes (79 dim; PCA only) Bayes (18 dim; PCA only) 3-NN (18 dim; PCA+LDA) 3-NN (79 dim; PCA only) Bayes vs k-nn We adopt Jain s guideline (1982): 3-NN (18 dim; PCA only) Having 5 to 10 times as many training data as # of dimensions seems to be a good practice. PCA+LDA+Bayes achieved the best performance. 18-dimension is better than 79-dimension. # of training data is not enough for 79-dim. The use of LDA improved the performance. LDA considers separation between classes.

Relationship between training data and dimension 14 dim. (85%) 18 dim. (88%) 20 dim. (89%) 23 dim. (90%) 32 dim. (93%) 41 dim. (95%) 52 dim. (97%) 79 dim. (99%) Hughes s peaking phenomenon At 23-dimension, the performance peaked. Any results without LDA are worse than that with LDA.