Semi-supervised Musical Instrument Recognition

Similar documents
Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Automatic Piano Music Transcription

MUSICAL INSTRUMENT IDENTIFICATION BASED ON HARMONIC TEMPORAL TIMBRE FEATURES

Singer Traits Identification using Deep Neural Network

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

Topics in Computer Music Instrument Identification. Ioanna Karydi

International Journal of Advance Engineering and Research Development MUSICAL INSTRUMENT IDENTIFICATION AND STATUS FINDING WITH MFCC

Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods

Chord Classification of an Audio Signal using Artificial Neural Network

Musical Instrument Identification Using Principal Component Analysis and Multi-Layered Perceptrons

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Recognising Cello Performers using Timbre Models

CS229 Project Report Polyphonic Piano Transcription

Transcription of the Singing Melody in Polyphonic Music

Musical Instrument Identification based on F0-dependent Multivariate Normal Distribution

Outline. Why do we classify? Audio Classification

Improving Frame Based Automatic Laughter Detection

Subjective Similarity of Music: Data Collection for Individuality Analysis

Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

Retrieval of textual song lyrics from sung inputs

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

GCT535- Sound Technology for Multimedia Timbre Analysis. Graduate School of Culture Technology KAIST Juhan Nam

Audio-Based Video Editing with Two-Channel Microphone

Krzysztof Rychlicki-Kicior, Bartlomiej Stasiak and Mykhaylo Yatsymirskyy Lodz University of Technology

POLYPHONIC INSTRUMENT RECOGNITION USING SPECTRAL CLUSTERING

Effects of acoustic degradations on cover song recognition

hit), and assume that longer incidental sounds (forest noise, water, wind noise) resemble a Gaussian noise distribution.

Neural Network for Music Instrument Identi cation

ON FINDING MELODIC LINES IN AUDIO RECORDINGS. Matija Marolt

Recognising Cello Performers Using Timbre Models

Data-Driven Solo Voice Enhancement for Jazz Music Retrieval

Topic 10. Multi-pitch Analysis

AUTOREGRESSIVE MFCC MODELS FOR GENRE CLASSIFICATION IMPROVED BY HARMONIC-PERCUSSION SEPARATION

Automatic Music Clustering using Audio Attributes

The Human Features of Music.

MUSI-6201 Computational Music Analysis

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST)

Supervised Learning in Genre Classification

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas

Week 14 Music Understanding and Classification

Automatic Laughter Detection

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

Automatic Rhythmic Notation from Single Voice Audio Sources

MODELS of music begin with a representation of the

Algorithms for melody search and transcription. Antti Laaksonen

THE importance of music content analysis for musical

TOWARD UNDERSTANDING EXPRESSIVE PERCUSSION THROUGH CONTENT BASED ANALYSIS

Experiments on musical instrument separation using multiplecause

Lecture 9 Source Separation

An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions

Classification of Musical Instruments sounds by Using MFCC and Timbral Audio Descriptors

Music Information Retrieval Community

FULL-AUTOMATIC DJ MIXING SYSTEM WITH OPTIMAL TEMPO ADJUSTMENT BASED ON MEASUREMENT FUNCTION OF USER DISCOMFORT

An Accurate Timbre Model for Musical Instruments and its Application to Classification

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

Music Genre Classification and Variance Comparison on Number of Genres

Enhancing Music Maps

A CHROMA-BASED SALIENCE FUNCTION FOR MELODY AND BASS LINE ESTIMATION FROM MUSIC AUDIO SIGNALS

Automatic Labelling of tabla signals

Cross-Dataset Validation of Feature Sets in Musical Instrument Classification

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng

A Survey on: Sound Source Separation Methods

Query By Humming: Finding Songs in a Polyphonic Database

Hidden Markov Model based dance recognition

ISSN ICIRET-2014

Research Article. ISSN (Print) *Corresponding author Shireen Fathima

MUSICAL INSTRUMENT RECOGNITION USING BIOLOGICALLY INSPIRED FILTERING OF TEMPORAL DICTIONARY ATOMS

Acoustic Scene Classification

AMusical Instrument Sample Database of Isolated Notes

Application Of Missing Feature Theory To The Recognition Of Musical Instruments In Polyphonic Audio

Methods for the automatic structural analysis of music. Jordan B. L. Smith CIRMMT Workshop on Structural Analysis of Music 26 March 2010

ON THE USE OF PERCEPTUAL PROPERTIES FOR MELODY ESTIMATION

Speech Recognition Combining MFCCs and Image Features

Sarcasm Detection in Text: Design Document

Music Radar: A Web-based Query by Humming System

Music Information Retrieval with Temporal Features and Timbre

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

Automatic Music Genre Classification

Melody Retrieval On The Web

Predicting Time-Varying Musical Emotion Distributions from Multi-Track Audio

Introductions to Music Information Retrieval

Recommending Music for Language Learning: The Problem of Singing Voice Intelligibility

Musical instrument identification in continuous recordings

A CLASSIFICATION APPROACH TO MELODY TRANSCRIPTION

Interactive Classification of Sound Objects for Polyphonic Electro-Acoustic Music Annotation

PHYSICS OF MUSIC. 1.) Charles Taylor, Exploring Music (Music Library ML3805 T )

Popular Song Summarization Using Chorus Section Detection from Audio Signal

Efficient Vocal Melody Extraction from Polyphonic Music Signals

Multipitch estimation by joint modeling of harmonic and transient sounds

ABSOLUTE OR RELATIVE? A NEW APPROACH TO BUILDING FEATURE VECTORS FOR EMOTION TRACKING IN MUSIC

Measurement of overtone frequencies of a toy piano and perception of its pitch

Singer Recognition and Modeling Singer Error

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

Automatic music transcription

PRODUCTION MACHINERY UTILIZATION MONITORING BASED ON ACOUSTIC AND VIBRATION SIGNAL ANALYSIS

Pitch correction on the human voice

Music Genre Classification

Transcription:

Semi-supervised Musical Instrument Recognition Master s Thesis Presentation Aleksandr Diment 1 1 Tampere niversity of Technology, Finland Supervisors: Adj.Prof. Tuomas Virtanen, MSc Toni Heittola 17 May 2013

Outline Semi-supervised instrument recognition A. Diment 17.5.2013 2/25

Musical instrument recognition Semi-supervised learning Objectives and main results Semi-supervised instrument recognition A. Diment 17.5.2013 3/25

Introduction Musical instrument recognition Musical instrument recognition Semi-supervised learning Objectives and main results Music information retrieval: obtaining information of various kinds from music. Situationally tailored playlisting, personalised radio etc. Instrument recognition automatic music database annotation; automatic music transcription; musical genre classification. Semi-supervised instrument recognition A. Diment 17.5.2013 4/25

Introduction Semi-supervised learning Musical instrument recognition Semi-supervised learning Objectives and main results Traditional learning paradigms: unsupervised: no additional knowledge about the data samples supervised: a label is assigned to each data sample Supervised: large amounts of annotated training data needed. SSL: only part of training data needs to be annotated. Not yet applied for instrument recognition. Semi-supervised instrument recognition A. Diment 17.5.2013 5/25

Introduction Objectives and main results Musical instrument recognition Semi-supervised learning Objectives and main results Objectives: studying techniques for musical instrument recognition; studying various SSL schemes; Main results: developed pattern recognition system for instrument recognition with SSL; two algorithms are implemented. Semi-supervised instrument recognition A. Diment 17.5.2013 6/25

Building blocks Feature extraction Training algorithms Labelled data weighting One-class-at-a-time training Semi-supervised instrument recognition A. Diment 17.5.2013 7/25

System description Building blocks Input data Building blocks Feature extraction Training algorithms Labelled data weighting One-class-at-a-time training Preprocessing Feature extraction Training set Features Testing set Training Models Classification Decision Figure 1: A block diagram of a typical pattern classification system. Semi-supervised instrument recognition A. Diment 17.5.2013 8/25

System description Feature extraction Building blocks Feature extraction Training algorithms Labelled data weighting One-class-at-a-time training Relevant information is within the timbre, unique to a particular group of instruments. Set of tonal qualities which characterise a particular musical sound. Everything except pitch, loudness and duration. Semi-supervised instrument recognition A. Diment 17.5.2013 9/25

System description Feature extraction Input signal Pre-emphasis Building blocks Feature extraction Training algorithms Labelled data weighting One-class-at-a-time training Frame blocking and windowing FFT 2 Log DCT Static coefficients d dt Delta coefficients Figure 2: A block diagram of MFCCs calculation. Semi-supervised instrument recognition A. Diment 17.5.2013 10/25

System description Training algorithms Building blocks Feature extraction Training algorithms Labelled data weighting One-class-at-a-time training The EM algorithm MLEs of model parameters when treating observations as incomplete data. sed when obtaining direct equations for the model is impossible. Extending EM for SSL: the iterative and incremental EM-based algorithms 1. 1 P. J. Moreno and S. Agarwal, An experimental study of EM-based algorithms for semi-supervised learning in audio classification, in Proc. of the ICML-2003 Workshop on the Continuum from Labeled to nlabeled Data, 2003. Semi-supervised instrument recognition A. Diment 17.5.2013 11/25

training sample from the unlabelled dataset L training sample from the labelled dataset Initial iteration Iteration t = 1 Iteration t = 2... Classification Training dataset Piano L Guitar L Piano L Guitar L Piano L Guitar L Training... Models nlabelled samples Labelled samples, piano Labelled samples, guitar nlabelled samples classified as piano nlabelled samples classified as guitar Figure 3: Schematic and feature space representation of the training stage with three iterations of the iterative EM-based algorithm.

training sample from the unlabelled dataset L training sample from the labelled dataset Initial iteration Iteration t = 1 Iteration t = 2... Classification Training dataset Piano L Guitar L Piano L Guitar L Piano L Guitar L Training... Models nlabelled samples Labelled samples, piano Labelled samples, guitar nlabelled samples classified as piano nlabelled samples classified as guitar Figure 4: Schematic and feature space representation of the training stage with three iterations of the incremental EM-based algorithm.

System description Labelled data weighting The algorithms improve the performance in case the initial labelled data size is relatively low. Building blocks Feature extraction Training algorithms Labelled data weighting One-class-at-a-time training Not much difference when adding large or small amount of unlabelled data. de-weight the contribution of the unlabelled data. Simply: S l = ω(t) S l. Semi-supervised instrument recognition A. Diment 17.5.2013 14/25

System description One-class-at-a-time training Another issue of the iterative algorithm: resulting classification accuracy is oscillating along the iteration axis. Building blocks Feature extraction Training algorithms Labelled data weighting One-class-at-a-time training 100 Accuracy, % 90 80 0 5 10 15 20 Iteration Semi-supervised instrument recognition A. Diment 17.5.2013 15/25

System description One-class-at-a-time training One-class-at-a time approach: at an iteration only one class is retrained. Building blocks Feature extraction Training algorithms Labelled data weighting One-class-at-a-time training previous models of one class are unaffected by training another one fewer peaks safer to chose an arbitrary iteration index for termination. Semi-supervised instrument recognition A. Diment 17.5.2013 16/25

Datasets Baseline results Results with the iterative EM-based algorithm Semi-supervised instrument recognition A. Diment 17.5.2013 17/25

Evaluation Datasets Datasets Baseline results Results with the iterative EM-based algorithm RWC Music Database 2. Three variations per instrument. Separate notes within the whole range with a step of a semitone. 44.1 khz, 16 bit. 2 M. Goto et al., RWC music database: music genre database and musical instrument sound database, in Proc. of the 4th Int. Conf. on Music Information Retrieval (ISMIR), 2003, pp. 229 230. Semi-supervised instrument recognition A. Diment 17.5.2013 18/25

Evaluation Datasets Table 1: List of instruments and number of recordings of the notes used in the smaller and larger sets, respectively. Datasets Baseline results Results with the iterative EM-based algorithm Instrument # notes Instrument # notes Acoustic Guitar 702 Electric Guitar 702 Tuba 270 Bassoon 360 Total 2 212 Pianoforte 792 Classic Guitar 702 Electric Guitar 702 Electric Bass 507 Trombone 278 Tuba 270 Horn 288 Bassoon 360 Clarinet 360 Banjo 941 Total 5 200 Semi-supervised instrument recognition A. Diment 17.5.2013 19/25

Evaluation Baseline results Datasets Baseline results Results with the iterative EM-based algorithm Baseline scenario: treating all available data as labelled. pper bound for possible SSL performance. Table 2: Average classification accuracy across all classes in fully-supervised case. Instrument set size Recognition accuracy, % 4 instruments 92.1 10 instruments 82.9 Semi-supervised instrument recognition A. Diment 17.5.2013 20/25

Evaluation Results with the iterative EM-based algorithm Datasets Baseline results Results with the iterative EM-based algorithm Table 3: Classification results with the iterative EM-based SSL algorithm when incorporating both modifications. Instrument set size Recognition accuracy, % initial maximum absolute gain relative gain 4 instruments 89.19 95.30 6.11 6.85 10 instruments 58.73 68.43 9.70 14.18 Semi-supervised instrument recognition A. Diment 17.5.2013 21/25

100 90 Accuracy, % Datasets Baseline results Results with the iterative EM-based algorithm 80 Initial approach One class at a time Labelled data weighting Both approaches 0 2 4 6 8 10 12 14 16 18 20 (Macro)iteration Figure 5: Comparison of the modifications to the iterative algorithm with their combination and the initial version, smaller instrument set. Semi-supervised instrument recognition A. Diment 17.5.2013 22/25

Accuracy, % 90 Datasets Baseline results Results with the iterative EM-based algorithm 80 Initial accuracy Final accuracy 5 6 8 10 15 20 30 100 labelled/total dataset sizes ratio, % Figure 6: Comparison of the accuracies of the initial models and the final iteration of the incremental EM-based SSL algorithm as a function of the relative labelled dataset size with the smaller instrument set. Semi-supervised instrument recognition A. Diment 17.5.2013 23/25

Semi-supervised instrument recognition A. Diment 17.5.2013 24/25

Conclusions SSL for instrument recognition works. Gain 9.7%; as little as 7% data needs to be annotated; proposed extensions simplify termination and increase accuracy. Future work: more complex scenarios (more instruments, noise, reverberation... ); neighbouring problems. Semi-supervised instrument recognition A. Diment 17.5.2013 25/25