MUSI-6201 Computational Music Analysis

Size: px
Start display at page:

Download "MUSI-6201 Computational Music Analysis"

Transcription

1 MUSI-6201 Computational Music Analysis Part 9.1: Genre Classification alexander lerch November 4, 2015

2 temporal analysis overview text book Chapter 8: Musical Genre, Similarity, and Mood (pp ) sources: slides (latex) & Matlab github repository lecture content definition of musical genre typical features and feature categories simple classifiers and basic classifier properties

3 temporal analysis overview text book Chapter 8: Musical Genre, Similarity, and Mood (pp ) sources: slides (latex) & Matlab github repository lecture content definition of musical genre typical features and feature categories simple classifiers and basic classifier properties

4 temporal analysis overview text book Chapter 8: Musical Genre, Similarity, and Mood (pp ) sources: slides (latex) & Matlab github repository lecture content definition of musical genre typical features and feature categories simple classifiers and basic classifier properties

5 temporal analysis overview text book Chapter 8: Musical Genre, Similarity, and Mood (pp ) sources: slides (latex) & Matlab github repository lecture content definition of musical genre typical features and feature categories simple classifiers and basic classifier properties

6 introduction one of the oldest research topics in MIR classic machine learning task related fields: speech-music classification instrument recognition artist identification music emotion recognition

7 introduction one of the oldest research topics in MIR classic machine learning task related fields: speech-music classification instrument recognition artist identification music emotion recognition

8 introduction one of the oldest research topics in MIR classic machine learning task related fields: speech-music classification instrument recognition artist identification music emotion recognition

9 applications large music databases: annotation sorting, browsing, retrieving recommendation systems automatic playlist generation mashup generation

10 genre: definition what is musical genre

11 genre: definition what is musical genre clusters of musical similarity? hard to answer in general, there are many systematic problems

12 genre: definition what is musical genre clusters of musical similarity? hard to answer in general, there are many systematic problems

13 genre: definition what is musical genre clusters of musical similarity? hard to answer in general, there are many systematic problems 1 non-agreement on taxonomies

14 genre: definition what is musical genre clusters of musical similarity? hard to answer in general, there are many systematic problems 1 non-agreement on taxonomies 2 genre label scope: song, album, artist, piece of a song

15 genre: definition what is musical genre clusters of musical similarity? hard to answer in general, there are many systematic problems 1 non-agreement on taxonomies 2 genre label scope: song, album, artist, piece of a song 3 ill-defined genre labels: geographic (indian music), historic (baroque), technical (barbershop), instrumentation (symphonic music), usage (christmas songs)

16 genre: definition what is musical genre clusters of musical similarity? hard to answer in general, there are many systematic problems 1 non-agreement on taxonomies 2 genre label scope: song, album, artist, piece of a song 3 ill-defined genre labels: geographic (indian music), historic (baroque), technical (barbershop), instrumentation (symphonic music), usage (christmas songs) 4 taxonomy scalability: genres and subgenres evolve over time

17 genre: definition what is musical genre clusters of musical similarity? hard to answer in general, there are many systematic problems 1 non-agreement on taxonomies 2 genre label scope: song, album, artist, piece of a song 3 ill-defined genre labels: geographic (indian music), historic (baroque), technical (barbershop), instrumentation (symphonic music), usage (christmas songs) 4 taxonomy scalability: genres and subgenres evolve over time 5 non-orthogonality: several genres for one piece of music

18 genre: taxonomy examples Speech Music Male Female Sports Disco Country Hip Hop Rock Blues Reggae Pop Metal Classical Jazz Choir Orchestra Piano String Quartet Big Band Cool Fusion Piano Quartet Swing Background Speech Music Male Female +Background Classical Non-Classical Chamber Orchestra Rock Electro/Pop Jazz/Blues Piano Solo String Quartet Other Symphonic +Choir +Soloist Soft Rock Hard Rock Hip Hop Techno/Dance Pop

19 observations with humans 1 human classification far from perfect: % for limited set of classes 2 for many genres, humans need only a fraction of a second to classify short time timbre features sufficient? plots from 1, 2 1 S. Lippens, J.-P. Martens, T. D. Mulder, et al., A Comparison of Human and Automatic Musical Genre Classification, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Montreal, R. O. Gjerdingen and D. Perrott, Scanning the Dial: The Rapid Recognition of Music Genres, Journal of New Music Research, vol. 37, no. 2, pp , Jun. 2008, 00067, issn:

20 observations with humans 1 human classification far from perfect: % for limited set of classes 2 for many genres, humans need only a fraction of a second to classify short time timbre features sufficient? plots from 1, 2 1 S. Lippens, J.-P. Martens, T. D. Mulder, et al., A Comparison of Human and Automatic Musical Genre Classification, in Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Montreal, R. O. Gjerdingen and D. Perrott, Scanning the Dial: The Rapid Recognition of Music Genres, Journal of New Music Research, vol. 37, no. 2, pp , Jun. 2008, 00067, issn:

21 overview Audio Signal Feature Extraction Classification Genre Label 1 feature extraction dimensionality reduction meaningful representation 2 classification map or convert feature to comprehensible domain

22 overview Audio Signal Feature Extraction Classification Genre Label 1 feature extraction dimensionality reduction meaningful representation 2 classification map or convert feature to comprehensible domain

23 feature categories high level similarities? melody, hook lines, bass lines, harmony progression rhythm & tempo structure instrumentation & timbre... technical feature categories tonal technical timbral temporal intensity extracted features should be extractable (not: time envelope in polyphonic signals) relevant (not: pitch chroma for instrument ID) non-redundant have discriminative power (robust to noise)

24 feature categories high level similarities? melody, hook lines, bass lines, harmony progression rhythm & tempo structure instrumentation & timbre... technical feature categories tonal technical timbral temporal intensity extracted features should be extractable (not: time envelope in polyphonic signals) relevant (not: pitch chroma for instrument ID) non-redundant have discriminative power (robust to noise)

25 feature categories high level similarities? melody, hook lines, bass lines, harmony progression rhythm & tempo structure instrumentation & timbre... technical feature categories tonal technical timbral temporal intensity extracted features should be extractable (not: time envelope in polyphonic signals) relevant (not: pitch chroma for instrument ID) non-redundant have discriminative power (robust to noise)

26 instantaneous features spectral features (timbre): Spectral Centroid, MFCCs, Spectral Flux,... pitch features (tonal): pitch chroma distribution/change,... rhythm features (temporal): onset density, beat histogram features,... statistical features (technical): standard deviation, skewness, zero crossings,... intensity features: level variation, number of pauses,...

27 instantaneous features spectral features (timbre): Spectral Centroid, MFCCs, Spectral Flux,... pitch features (tonal): pitch chroma distribution/change,... rhythm features (temporal): onset density, beat histogram features,... statistical features (technical): standard deviation, skewness, zero crossings,... intensity features: level variation, number of pauses,...

28 instantaneous features spectral features (timbre): Spectral Centroid, MFCCs, Spectral Flux,... pitch features (tonal): pitch chroma distribution/change,... rhythm features (temporal): onset density, beat histogram features,... statistical features (technical): standard deviation, skewness, zero crossings,... intensity features: level variation, number of pauses,...

29 instantaneous features spectral features (timbre): Spectral Centroid, MFCCs, Spectral Flux,... pitch features (tonal): pitch chroma distribution/change,... rhythm features (temporal): onset density, beat histogram features,... statistical features (technical): standard deviation, skewness, zero crossings,... intensity features: level variation, number of pauses,...

30 instantaneous features spectral features (timbre): Spectral Centroid, MFCCs, Spectral Flux,... pitch features (tonal): pitch chroma distribution/change,... rhythm features (temporal): onset density, beat histogram features,... statistical features (technical): standard deviation, skewness, zero crossings,... intensity features: level variation, number of pauses,...

31 overview intro MGC classifiers example feature extraction 1 extract instantaneous features 2 compute derived features (derivative, filtered) compute long term features & subfeatures per texture window compute subfeatures per file normalize subfeatures (select or) transform subfeatures feature vector classifier input summary

32 feature extraction 1 extract instantaneous features 2 compute derived features (derivative, filtered) 3 compute long term features & subfeatures per texture window 4 compute subfeatures per file 5 normalize subfeatures 6 (select or) transform subfeatures 7 feature vector classifier input

33 feature extraction 1 extract instantaneous features 2 compute derived features (derivative, filtered) 3 compute long term features & subfeatures per texture window 4 compute subfeatures per file 5 normalize subfeatures 6 (select or) transform subfeatures 7 feature vector classifier input

34 feature extraction 1 extract instantaneous features 2 compute derived features (derivative, filtered) 3 compute long term features & subfeatures per texture window 4 compute subfeatures per file 5 normalize subfeatures 6 (select or) transform subfeatures 7 feature vector classifier input

35 feature extraction 1 extract instantaneous features 2 compute derived features (derivative, filtered) 3 compute long term features & subfeatures per texture window 4 compute subfeatures per file 5 normalize subfeatures 6 (select or) transform subfeatures 7 feature vector classifier input

36 feature extraction 1 extract instantaneous features 2 compute derived features (derivative, filtered) 3 compute long term features & subfeatures per texture window 4 compute subfeatures per file 5 normalize subfeatures 6 (select or) transform subfeatures 7 feature vector classifier input

37 feature extraction 1 extract instantaneous features 2 compute derived features (derivative, filtered) 3 compute long term features & subfeatures per texture window 4 compute subfeatures per file 5 normalize subfeatures 6 (select or) transform subfeatures 7 feature vector classifier input music speech std rms mean spectral centroid

38 long term features 1/2 derived from beat histogram 3 3 G. Tzanetakis and P. Cook, Musical genre classification of audio signals, Transactions on Speech and Audio Processing, vol. 10, no. 5, pp , Jul. 2002, issn: doi: /TSA

39 long term features 2/2 derived from pitch histogram or pitch chroma 4 4 G. Tzanetakis, A. Ermolinskyi, and P. Cook, Pitch Histograms in Audio and Symbolic Music Information Retrieval, in Proceedings of the 3rd International Conference on Music Information Retrieval (ISMIR), Paris, 2002.

40 additional feature examples stereo features mid channel energy vs. side channel energy spectral channel differences features at higher semantic levels: tempo, structure, harmonic complexity, instrumentation

41 additional feature examples stereo features mid channel energy vs. side channel energy spectral channel differences features at higher semantic levels: tempo, structure, harmonic complexity, instrumentation

42 classification: general steps 1 define training set: annotated results 2 normalize training set 3 train classifier 4 evaluate classifier with test set 5 (adjust classifier settings, return to 4.)

43 classification: general steps 1 define training set: annotated results 2 normalize training set 3 train classifier 4 evaluate classifier with test set 5 (adjust classifier settings, return to 4.)

44 classification: general steps 1 define training set: annotated results 2 normalize training set 3 train classifier 4 evaluate classifier with test set 5 (adjust classifier settings, return to 4.)

45 classification: general steps 1 define training set: annotated results 2 normalize training set 3 train classifier 4 evaluate classifier with test set 5 (adjust classifier settings, return to 4.)

46 classification: general steps 1 define training set: annotated results 2 normalize training set 3 train classifier 4 evaluate classifier with test set 5 (adjust classifier settings, return to 4.)

47 training set training set size vs. number of features training set too small overfitting feature number too large overfitting training set too noisy underfitting training set not representative bad classification performance classifier poor classifier bad classification performance different classifier features poor features bad classification performance feature selection new, better features features not normalized possibly bad classification performance feature range feature mean feature distribution classifier: rules of thumb

48 training set training set size vs. number of features training set too small overfitting feature number too large overfitting training set too noisy underfitting training set not representative bad classification performance classifier poor classifier bad classification performance different classifier features poor features bad classification performance feature selection new, better features features not normalized possibly bad classification performance feature range feature mean feature distribution classifier: rules of thumb

49 training set training set size vs. number of features training set too small overfitting feature number too large overfitting training set too noisy underfitting training set not representative bad classification performance classifier poor classifier bad classification performance different classifier features poor features bad classification performance feature selection new, better features features not normalized possibly bad classification performance feature range feature mean feature distribution classifier: rules of thumb

50 training set training set size vs. number of features training set too small overfitting feature number too large overfitting training set too noisy underfitting training set not representative bad classification performance classifier poor classifier bad classification performance different classifier features poor features bad classification performance feature selection new, better features features not normalized possibly bad classification performance feature range feature mean feature distribution classifier: rules of thumb

51 training set training set size vs. number of features training set too small overfitting feature number too large overfitting training set too noisy underfitting training set not representative bad classification performance classifier poor classifier bad classification performance different classifier features poor features bad classification performance feature selection new, better features features not normalized possibly bad classification performance feature range feature mean feature distribution classifier: rules of thumb

52 training set training set size vs. number of features training set too small overfitting feature number too large overfitting training set too noisy underfitting training set not representative bad classification performance classifier poor classifier bad classification performance different classifier features poor features bad classification performance feature selection new, better features features not normalized possibly bad classification performance feature range feature mean feature distribution classifier: rules of thumb

53 classifier: evaluation define test set for evaluation test set different from training set otherwise, same requirements example: N-fold cross validation 1 split training set into N parts (randomly, but preferably identical number per class) 2 select one part as test set 3 train the classifier with all observations from remaining N 1 parts 4 compute the classification rate for the test set 5 repeat until all N parts have been tested 6 overall result: average classification rate

54 classifier: evaluation define test set for evaluation test set different from training set otherwise, same requirements example: N-fold cross validation 1 split training set into N parts (randomly, but preferably identical number per class) 2 select one part as test set 3 train the classifier with all observations from remaining N 1 parts 4 compute the classification rate for the test set 5 repeat until all N parts have been tested 6 overall result: average classification rate

55 classifier: evaluation define test set for evaluation test set different from training set otherwise, same requirements example: N-fold cross validation 1 split training set into N parts (randomly, but preferably identical number per class) 2 select one part as test set 3 train the classifier with all observations from remaining N 1 parts 4 compute the classification rate for the test set 5 repeat until all N parts have been tested 6 overall result: average classification rate

56 classifier: evaluation define test set for evaluation test set different from training set otherwise, same requirements example: N-fold cross validation 1 split training set into N parts (randomly, but preferably identical number per class) 2 select one part as test set 3 train the classifier with all observations from remaining N 1 parts 4 compute the classification rate for the test set 5 repeat until all N parts have been tested 6 overall result: average classification rate

57 classifier: evaluation define test set for evaluation test set different from training set otherwise, same requirements example: N-fold cross validation 1 split training set into N parts (randomly, but preferably identical number per class) 2 select one part as test set 3 train the classifier with all observations from remaining N 1 parts 4 compute the classification rate for the test set 5 repeat until all N parts have been tested 6 overall result: average classification rate

58 classifier: evaluation define test set for evaluation test set different from training set otherwise, same requirements example: N-fold cross validation 1 split training set into N parts (randomly, but preferably identical number per class) 2 select one part as test set 3 train the classifier with all observations from remaining N 1 parts 4 compute the classification rate for the test set 5 repeat until all N parts have been tested 6 overall result: average classification rate

59 classifier: evaluation define test set for evaluation test set different from training set otherwise, same requirements example: N-fold cross validation 1 split training set into N parts (randomly, but preferably identical number per class) 2 select one part as test set 3 train the classifier with all observations from remaining N 1 parts 4 compute the classification rate for the test set 5 repeat until all N parts have been tested 6 overall result: average classification rate

60 classification: extract test vector and set class to majority of classifier: knn training: extract reference vectors from training set (keep class labels) matlab source: matlab/displayknn.m

61 classifier: knn training: extract reference vectors from training set (keep class labels) classification: extract test vector and set class to majority of k nearest reference vectors matlab source: matlab/displayknn.m

62 classifier: knn training: extract reference vectors from training set (keep class labels) classification: extract test vector and set class to majority of k nearest reference vectors k = 3 matlab source: matlab/displayknn.m

63 classifier: knn training: extract reference vectors from training set (keep class labels) classification: extract test vector and set class to majority of k nearest reference vectors k = 3 matlab source: matlab/displayknn.m k = 5

64 classifier: knn training: extract reference vectors from training set (keep class labels) classification: extract test vector and set class to majority of k nearest reference vectors k = 3 matlab source: matlab/displayknn.m k = 5 k = 7

65 classifier: knn training: extract reference vectors from training set (keep class labels) classification: extract test vector and set class to majority of k nearest reference vectors classifier data: all training vectors

66 classifier: GMM training: build model of each class distribution as superposition of Gaussian distributions classification: compute output of each Gaussian and select class with highest probability classifier data: per class per Gaussian: µ and covariance, mixture weight?

67 classifier: GMM training: build model of each class distribution as superposition of Gaussian distributions classification: compute output of each Gaussian and select class with highest probability matlab source: matlab/displaygmm.m

68 classifier: GMM training: build model of each class distribution as superposition of Gaussian distributions classification: compute output of each Gaussian and select class with highest probability classifier data: per class per Gaussian: µ and covariance, mixture weight?

69 classifier: SVM training: map features to high dimensional space find separating hyperplane (linear classification) through maximum distance of support vectors (data points) classification: apply feature transform and proceed with linear classification classifier data: support vectors, kernel, kernel parameters vector machine

70 classifier: SVM training: map features to high dimensional space find separating hyperplane (linear classification) through maximum distance of support vectors (data points) classification: apply feature transform and proceed with linear classification classifier data: support vectors, kernel, kernel parameters vector machine

71 classifier: SVM training: map features to high dimensional space find separating hyperplane (linear classification) through maximum distance of support vectors (data points) classification: apply feature transform and proceed with linear classification classifier data: support vectors, kernel, kernel parameters vector machine

72 results classification results depend on training set, test set, and number of classes typical ranges: 10 classes 50 80% note: results vary largely between datasets ill-defined genre boundaries non-uniformly distributed classes overfitting through songs from same album or artist...

73 results classification results depend on training set, test set, and number of classes typical ranges: 10 classes 50 80% note: results vary largely between datasets ill-defined genre boundaries non-uniformly distributed classes overfitting through songs from same album or artist...

74 results classification results depend on training set, test set, and number of classes typical ranges: 10 classes 50 80% note: results vary largely between datasets ill-defined genre boundaries non-uniformly distributed classes overfitting through songs from same album or artist...

75 speech/music classification baseline example 1 extract features 2 represent each file with its 2-dimensional feature vector 3 knn to classify unknown audio files 4 evaluate classification performance

76 speech/music classification example: features 1/2 for each audio file 1 split input signal into (overlapping) blocks 2 compute 2 feature series (spectral centroid, RMS) 3 aggregate feature series to one value each mean of Spectral Centroid µ SC = 1 v SC (n) N standard deviation of RMS 1 σ RMS = (v RMS (n) µ RMS ) N 2 4 represent each file as 2-dimensional vector ( µsc, σ RMS ) T n n

77 speech/music classification example: features 1/2 for each audio file 1 split input signal into (overlapping) blocks 2 compute 2 feature series (spectral centroid, RMS) 3 aggregate feature series to one value each mean of Spectral Centroid µ SC = 1 v SC (n) N standard deviation of RMS 1 σ RMS = (v RMS (n) µ RMS ) N 2 4 represent each file as 2-dimensional vector ( µsc, σ RMS ) T n n

78 speech/music classification example: features 1/2 for each audio file 1 split input signal into (overlapping) blocks 2 compute 2 feature series (spectral centroid, RMS) 3 aggregate feature series to one value each mean of Spectral Centroid µ SC = 1 v SC (n) N standard deviation of RMS 1 σ RMS = (v RMS (n) µ RMS ) N 2 4 represent each file as 2-dimensional vector ( µsc, σ RMS ) T n n

79 speech/music classification example: features 2/2 std rms music speech matlab source: matlab/displayscatter.m mean spectral centroid

80 speech/music classification example: training set use dataset annotated as speech and music: requirements large compared to number of features representative for use case (diverse) here: 110 speech files 119 music files extract the features for the dataset

81 speech/music classification example: results (knn) confusion matrix: classification rate: speech music # files speech music = 84.2% single feature classification results Spectral Centroid: 56.7% RMS: 85.1%

82 speech/music classification example: results (knn) confusion matrix: classification rate: speech music # files speech music = 84.2% single feature classification results Spectral Centroid: 56.7% RMS: 85.1%

83 speech/music classification example: results (knn) confusion matrix: classification rate: speech music # files speech music = 84.2% single feature classification results Spectral Centroid: 56.7% RMS: 85.1%

84 summary lecture content 1 name three possible problems in the definition of the ground truth for genre classification 2 is it possible for genre classifiers to yield better accuracy than human experts 3 list the feature processing steps from audio to the input of the classifier

85 summary lecture content 1 name three possible problems in the definition of the ground truth for genre classification 2 is it possible for genre classifiers to yield better accuracy than human experts 3 list the feature processing steps from audio to the input of the classifier

86 summary lecture content 1 name three possible problems in the definition of the ground truth for genre classification 2 is it possible for genre classifiers to yield better accuracy than human experts 3 list the feature processing steps from audio to the input of the classifier

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION ULAŞ BAĞCI AND ENGIN ERZIN arxiv:0907.3220v1 [cs.sd] 18 Jul 2009 ABSTRACT. Music genre classification is an essential tool for

More information

Music Genre Classification and Variance Comparison on Number of Genres

Music Genre Classification and Variance Comparison on Number of Genres Music Genre Classification and Variance Comparison on Number of Genres Miguel Francisco, miguelf@stanford.edu Dong Myung Kim, dmk8265@stanford.edu 1 Abstract In this project we apply machine learning techniques

More information

Automatic Music Clustering using Audio Attributes

Automatic Music Clustering using Audio Attributes Automatic Music Clustering using Audio Attributes Abhishek Sen BTech (Electronics) Veermata Jijabai Technological Institute (VJTI), Mumbai, India abhishekpsen@gmail.com Abstract Music brings people together,

More information

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval DAY 1 Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval Jay LeBoeuf Imagine Research jay{at}imagine-research.com Rebecca

More information

Outline. Why do we classify? Audio Classification

Outline. Why do we classify? Audio Classification Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification Implementation Future Work Why do we classify

More information

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG? WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG? NICHOLAS BORG AND GEORGE HOKKANEN Abstract. The possibility of a hit song prediction algorithm is both academically interesting and industry motivated.

More information

Supervised Learning in Genre Classification

Supervised Learning in Genre Classification Supervised Learning in Genre Classification Introduction & Motivation Mohit Rajani and Luke Ekkizogloy {i.mohit,luke.ekkizogloy}@gmail.com Stanford University, CS229: Machine Learning, 2009 Now that music

More information

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

Music Emotion Recognition. Jaesung Lee. Chung-Ang University Music Emotion Recognition Jaesung Lee Chung-Ang University Introduction Searching Music in Music Information Retrieval Some information about target music is available Query by Text: Title, Artist, or

More information

Classification of Timbre Similarity

Classification of Timbre Similarity Classification of Timbre Similarity Corey Kereliuk McGill University March 15, 2007 1 / 16 1 Definition of Timbre What Timbre is Not What Timbre is A 2-dimensional Timbre Space 2 3 Considerations Common

More information

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST)

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST) Computational Models of Music Similarity 1 Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST) Abstract The perceived similarity of two pieces of music is multi-dimensional,

More information

Subjective Similarity of Music: Data Collection for Individuality Analysis

Subjective Similarity of Music: Data Collection for Individuality Analysis Subjective Similarity of Music: Data Collection for Individuality Analysis Shota Kawabuchi and Chiyomi Miyajima and Norihide Kitaoka and Kazuya Takeda Nagoya University, Nagoya, Japan E-mail: shota.kawabuchi@g.sp.m.is.nagoya-u.ac.jp

More information

Music Genre Classification

Music Genre Classification Music Genre Classification chunya25 Fall 2017 1 Introduction A genre is defined as a category of artistic composition, characterized by similarities in form, style, or subject matter. [1] Some researchers

More information

Music Mood Classification - an SVM based approach. Sebastian Napiorkowski

Music Mood Classification - an SVM based approach. Sebastian Napiorkowski Music Mood Classification - an SVM based approach Sebastian Napiorkowski Topics on Computer Music (Seminar Report) HPAC - RWTH - SS2015 Contents 1. Motivation 2. Quantification and Definition of Mood 3.

More information

AUTOREGRESSIVE MFCC MODELS FOR GENRE CLASSIFICATION IMPROVED BY HARMONIC-PERCUSSION SEPARATION

AUTOREGRESSIVE MFCC MODELS FOR GENRE CLASSIFICATION IMPROVED BY HARMONIC-PERCUSSION SEPARATION AUTOREGRESSIVE MFCC MODELS FOR GENRE CLASSIFICATION IMPROVED BY HARMONIC-PERCUSSION SEPARATION Halfdan Rump, Shigeki Miyabe, Emiru Tsunoo, Nobukata Ono, Shigeki Sagama The University of Tokyo, Graduate

More information

International Journal of Advance Engineering and Research Development MUSICAL INSTRUMENT IDENTIFICATION AND STATUS FINDING WITH MFCC

International Journal of Advance Engineering and Research Development MUSICAL INSTRUMENT IDENTIFICATION AND STATUS FINDING WITH MFCC Scientific Journal of Impact Factor (SJIF): 5.71 International Journal of Advance Engineering and Research Development Volume 5, Issue 04, April -2018 e-issn (O): 2348-4470 p-issn (P): 2348-6406 MUSICAL

More information

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models Aric Bartle (abartle@stanford.edu) December 14, 2012 1 Background The field of composer recognition has

More information

A Survey of Audio-Based Music Classification and Annotation

A Survey of Audio-Based Music Classification and Annotation A Survey of Audio-Based Music Classification and Annotation Zhouyu Fu, Guojun Lu, Kai Ming Ting, and Dengsheng Zhang IEEE Trans. on Multimedia, vol. 13, no. 2, April 2011 presenter: Yin-Tzu Lin ( 阿孜孜 ^.^)

More information

MUSICAL INSTRUMENT IDENTIFICATION BASED ON HARMONIC TEMPORAL TIMBRE FEATURES

MUSICAL INSTRUMENT IDENTIFICATION BASED ON HARMONIC TEMPORAL TIMBRE FEATURES MUSICAL INSTRUMENT IDENTIFICATION BASED ON HARMONIC TEMPORAL TIMBRE FEATURES Jun Wu, Yu Kitano, Stanislaw Andrzej Raczynski, Shigeki Miyabe, Takuya Nishimoto, Nobutaka Ono and Shigeki Sagayama The Graduate

More information

Topics in Computer Music Instrument Identification. Ioanna Karydi

Topics in Computer Music Instrument Identification. Ioanna Karydi Topics in Computer Music Instrument Identification Ioanna Karydi Presentation overview What is instrument identification? Sound attributes & Timbre Human performance The ideal algorithm Selected approaches

More information

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes hello Jay Biernat Third author University of Rochester University of Rochester Affiliation3 words jbiernat@ur.rochester.edu author3@ismir.edu

More information

Automatic Music Genre Classification

Automatic Music Genre Classification Automatic Music Genre Classification Nathan YongHoon Kwon, SUNY Binghamton Ingrid Tchakoua, Jackson State University Matthew Pietrosanu, University of Alberta Freya Fu, Colorado State University Yue Wang,

More information

Week 14 Music Understanding and Classification

Week 14 Music Understanding and Classification Week 14 Music Understanding and Classification Roger B. Dannenberg Professor of Computer Science, Music & Art Overview n Music Style Classification n What s a classifier? n Naïve Bayesian Classifiers n

More information

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM A QUER B EAMPLE MUSIC RETRIEVAL ALGORITHM H. HARB AND L. CHEN Maths-Info department, Ecole Centrale de Lyon. 36, av. Guy de Collongue, 69134, Ecully, France, EUROPE E-mail: {hadi.harb, liming.chen}@ec-lyon.fr

More information

Automatic Rhythmic Notation from Single Voice Audio Sources

Automatic Rhythmic Notation from Single Voice Audio Sources Automatic Rhythmic Notation from Single Voice Audio Sources Jack O Reilly, Shashwat Udit Introduction In this project we used machine learning technique to make estimations of rhythmic notation of a sung

More information

Music Similarity and Cover Song Identification: The Case of Jazz

Music Similarity and Cover Song Identification: The Case of Jazz Music Similarity and Cover Song Identification: The Case of Jazz Simon Dixon and Peter Foster s.e.dixon@qmul.ac.uk Centre for Digital Music School of Electronic Engineering and Computer Science Queen Mary

More information

Predicting Time-Varying Musical Emotion Distributions from Multi-Track Audio

Predicting Time-Varying Musical Emotion Distributions from Multi-Track Audio Predicting Time-Varying Musical Emotion Distributions from Multi-Track Audio Jeffrey Scott, Erik M. Schmidt, Matthew Prockup, Brandon Morton, and Youngmoo E. Kim Music and Entertainment Technology Laboratory

More information

GRADIENT-BASED MUSICAL FEATURE EXTRACTION BASED ON SCALE-INVARIANT FEATURE TRANSFORM

GRADIENT-BASED MUSICAL FEATURE EXTRACTION BASED ON SCALE-INVARIANT FEATURE TRANSFORM 19th European Signal Processing Conference (EUSIPCO 2011) Barcelona, Spain, August 29 - September 2, 2011 GRADIENT-BASED MUSICAL FEATURE EXTRACTION BASED ON SCALE-INVARIANT FEATURE TRANSFORM Tomoko Matsui

More information

GCT535- Sound Technology for Multimedia Timbre Analysis. Graduate School of Culture Technology KAIST Juhan Nam

GCT535- Sound Technology for Multimedia Timbre Analysis. Graduate School of Culture Technology KAIST Juhan Nam GCT535- Sound Technology for Multimedia Timbre Analysis Graduate School of Culture Technology KAIST Juhan Nam 1 Outlines Timbre Analysis Definition of Timbre Timbre Features Zero-crossing rate Spectral

More information

Music Information Retrieval Community

Music Information Retrieval Community Music Information Retrieval Community What: Developing systems that retrieve music When: Late 1990 s to Present Where: ISMIR - conference started in 2000 Why: lots of digital music, lots of music lovers,

More information

Supervised Musical Source Separation from Mono and Stereo Mixtures based on Sinusoidal Modeling

Supervised Musical Source Separation from Mono and Stereo Mixtures based on Sinusoidal Modeling Supervised Musical Source Separation from Mono and Stereo Mixtures based on Sinusoidal Modeling Juan José Burred Équipe Analyse/Synthèse, IRCAM burred@ircam.fr Communication Systems Group Technische Universität

More information

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC G.TZANETAKIS, N.HU, AND R.B. DANNENBERG Computer Science Department, Carnegie Mellon University 5000 Forbes Avenue, Pittsburgh, PA 15213, USA E-mail: gtzan@cs.cmu.edu

More information

Singer Traits Identification using Deep Neural Network

Singer Traits Identification using Deep Neural Network Singer Traits Identification using Deep Neural Network Zhengshan Shi Center for Computer Research in Music and Acoustics Stanford University kittyshi@stanford.edu Abstract The author investigates automatic

More information

Music Information Retrieval

Music Information Retrieval CTP 431 Music and Audio Computing Music Information Retrieval Graduate School of Culture Technology (GSCT) Juhan Nam 1 Introduction ü Instrument: Piano ü Composer: Chopin ü Key: E-minor ü Melody - ELO

More information

Unifying Low-level and High-level Music. Similarity Measures

Unifying Low-level and High-level Music. Similarity Measures Unifying Low-level and High-level Music 1 Similarity Measures Dmitry Bogdanov, Joan Serrà, Nicolas Wack, Perfecto Herrera, and Xavier Serra Abstract Measuring music similarity is essential for multimedia

More information

Features for Audio and Music Classification

Features for Audio and Music Classification Features for Audio and Music Classification Martin F. McKinney and Jeroen Breebaart Auditory and Multisensory Perception, Digital Signal Processing Group Philips Research Laboratories Eindhoven, The Netherlands

More information

IEEE TRANSACTIONS ON MULTIMEDIA, VOL. X, NO. X, MONTH Unifying Low-level and High-level Music Similarity Measures

IEEE TRANSACTIONS ON MULTIMEDIA, VOL. X, NO. X, MONTH Unifying Low-level and High-level Music Similarity Measures IEEE TRANSACTIONS ON MULTIMEDIA, VOL. X, NO. X, MONTH 2010. 1 Unifying Low-level and High-level Music Similarity Measures Dmitry Bogdanov, Joan Serrà, Nicolas Wack, Perfecto Herrera, and Xavier Serra Abstract

More information

A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES

A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES 12th International Society for Music Information Retrieval Conference (ISMIR 2011) A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES Erdem Unal 1 Elaine Chew 2 Panayiotis Georgiou

More information

A Categorical Approach for Recognizing Emotional Effects of Music

A Categorical Approach for Recognizing Emotional Effects of Music A Categorical Approach for Recognizing Emotional Effects of Music Mohsen Sahraei Ardakani 1 and Ehsan Arbabi School of Electrical and Computer Engineering, College of Engineering, University of Tehran,

More information

CS 591 S1 Computational Audio

CS 591 S1 Computational Audio 4/29/7 CS 59 S Computational Audio Wayne Snyder Computer Science Department Boston University Today: Comparing Musical Signals: Cross- and Autocorrelations of Spectral Data for Structure Analysis Segmentation

More information

Automatic Music Similarity Assessment and Recommendation. A Thesis. Submitted to the Faculty. Drexel University. Donald Shaul Williamson

Automatic Music Similarity Assessment and Recommendation. A Thesis. Submitted to the Faculty. Drexel University. Donald Shaul Williamson Automatic Music Similarity Assessment and Recommendation A Thesis Submitted to the Faculty of Drexel University by Donald Shaul Williamson in partial fulfillment of the requirements for the degree of Master

More information

TOWARD UNDERSTANDING EXPRESSIVE PERCUSSION THROUGH CONTENT BASED ANALYSIS

TOWARD UNDERSTANDING EXPRESSIVE PERCUSSION THROUGH CONTENT BASED ANALYSIS TOWARD UNDERSTANDING EXPRESSIVE PERCUSSION THROUGH CONTENT BASED ANALYSIS Matthew Prockup, Erik M. Schmidt, Jeffrey Scott, and Youngmoo E. Kim Music and Entertainment Technology Laboratory (MET-lab) Electrical

More information

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval DAY 1 Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval Jay LeBoeuf Imagine Research jay{at}imagine-research.com Kyogu Lee

More information

TIMBRAL MODELING FOR MUSIC ARTIST RECOGNITION USING I-VECTORS. Hamid Eghbal-zadeh, Markus Schedl and Gerhard Widmer

TIMBRAL MODELING FOR MUSIC ARTIST RECOGNITION USING I-VECTORS. Hamid Eghbal-zadeh, Markus Schedl and Gerhard Widmer TIMBRAL MODELING FOR MUSIC ARTIST RECOGNITION USING I-VECTORS Hamid Eghbal-zadeh, Markus Schedl and Gerhard Widmer Department of Computational Perception Johannes Kepler University of Linz, Austria ABSTRACT

More information

Computational Modelling of Harmony

Computational Modelling of Harmony Computational Modelling of Harmony Simon Dixon Centre for Digital Music, Queen Mary University of London, Mile End Rd, London E1 4NS, UK simon.dixon@elec.qmul.ac.uk http://www.elec.qmul.ac.uk/people/simond

More information

jsymbolic 2: New Developments and Research Opportunities

jsymbolic 2: New Developments and Research Opportunities jsymbolic 2: New Developments and Research Opportunities Cory McKay Marianopolis College and CIRMMT Montreal, Canada 2 / 30 Topics Introduction to features (from a machine learning perspective) And how

More information

Content-based music retrieval

Content-based music retrieval Music retrieval 1 Music retrieval 2 Content-based music retrieval Music information retrieval (MIR) is currently an active research area See proceedings of ISMIR conference and annual MIREX evaluations

More information

MUSIC TONALITY FEATURES FOR SPEECH/MUSIC DISCRIMINATION. Gregory Sell and Pascal Clark

MUSIC TONALITY FEATURES FOR SPEECH/MUSIC DISCRIMINATION. Gregory Sell and Pascal Clark 214 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) MUSIC TONALITY FEATURES FOR SPEECH/MUSIC DISCRIMINATION Gregory Sell and Pascal Clark Human Language Technology Center

More information

Music Recommendation from Song Sets

Music Recommendation from Song Sets Music Recommendation from Song Sets Beth Logan Cambridge Research Laboratory HP Laboratories Cambridge HPL-2004-148 August 30, 2004* E-mail: Beth.Logan@hp.com music analysis, information retrieval, multimedia

More information

Interactive Classification of Sound Objects for Polyphonic Electro-Acoustic Music Annotation

Interactive Classification of Sound Objects for Polyphonic Electro-Acoustic Music Annotation for Polyphonic Electro-Acoustic Music Annotation Sebastien Gulluni 2, Slim Essid 2, Olivier Buisson, and Gaël Richard 2 Institut National de l Audiovisuel, 4 avenue de l Europe 94366 Bry-sur-marne Cedex,

More information

TOWARDS CHARACTERISATION OF MUSIC VIA RHYTHMIC PATTERNS

TOWARDS CHARACTERISATION OF MUSIC VIA RHYTHMIC PATTERNS TOWARDS CHARACTERISATION OF MUSIC VIA RHYTHMIC PATTERNS Simon Dixon Austrian Research Institute for AI Vienna, Austria Fabien Gouyon Universitat Pompeu Fabra Barcelona, Spain Gerhard Widmer Medical University

More information

Data Driven Music Understanding

Data Driven Music Understanding Data Driven Music Understanding Dan Ellis Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Engineering, Columbia University, NY USA http://labrosa.ee.columbia.edu/ 1. Motivation:

More information

Analytic Comparison of Audio Feature Sets using Self-Organising Maps

Analytic Comparison of Audio Feature Sets using Self-Organising Maps Analytic Comparison of Audio Feature Sets using Self-Organising Maps Rudolf Mayer, Jakob Frank, Andreas Rauber Institute of Software Technology and Interactive Systems Vienna University of Technology,

More information

On Human Capability and Acoustic Cues for Discriminating Singing and Speaking Voices

On Human Capability and Acoustic Cues for Discriminating Singing and Speaking Voices On Human Capability and Acoustic Cues for Discriminating Singing and Speaking Voices Yasunori Ohishi 1 Masataka Goto 3 Katunobu Itou 2 Kazuya Takeda 1 1 Graduate School of Information Science, Nagoya University,

More information

IMPROVING RHYTHMIC SIMILARITY COMPUTATION BY BEAT HISTOGRAM TRANSFORMATIONS

IMPROVING RHYTHMIC SIMILARITY COMPUTATION BY BEAT HISTOGRAM TRANSFORMATIONS 1th International Society for Music Information Retrieval Conference (ISMIR 29) IMPROVING RHYTHMIC SIMILARITY COMPUTATION BY BEAT HISTOGRAM TRANSFORMATIONS Matthias Gruhne Bach Technology AS ghe@bachtechnology.com

More information

Tempo and Beat Analysis

Tempo and Beat Analysis Advanced Course Computer Science Music Processing Summer Term 2010 Meinard Müller, Peter Grosche Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Tempo and Beat Analysis Musical Properties:

More information

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University Week 14 Query-by-Humming and Music Fingerprinting Roger B. Dannenberg Professor of Computer Science, Art and Music Overview n Melody-Based Retrieval n Audio-Score Alignment n Music Fingerprinting 2 Metadata-based

More information

Improving Frame Based Automatic Laughter Detection

Improving Frame Based Automatic Laughter Detection Improving Frame Based Automatic Laughter Detection Mary Knox EE225D Class Project knoxm@eecs.berkeley.edu December 13, 2007 Abstract Laughter recognition is an underexplored area of research. My goal for

More information

HIT SONG SCIENCE IS NOT YET A SCIENCE

HIT SONG SCIENCE IS NOT YET A SCIENCE HIT SONG SCIENCE IS NOT YET A SCIENCE François Pachet Sony CSL pachet@csl.sony.fr Pierre Roy Sony CSL roy@csl.sony.fr ABSTRACT We describe a large-scale experiment aiming at validating the hypothesis that

More information

MODELS of music begin with a representation of the

MODELS of music begin with a representation of the 602 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 18, NO. 3, MARCH 2010 Modeling Music as a Dynamic Texture Luke Barrington, Student Member, IEEE, Antoni B. Chan, Member, IEEE, and

More information

A MUSIC CLASSIFICATION METHOD BASED ON TIMBRAL FEATURES

A MUSIC CLASSIFICATION METHOD BASED ON TIMBRAL FEATURES 10th International Society for Music Information Retrieval Conference (ISMIR 2009) A MUSIC CLASSIFICATION METHOD BASED ON TIMBRAL FEATURES Thibault Langlois Faculdade de Ciências da Universidade de Lisboa

More information

The song remains the same: identifying versions of the same piece using tonal descriptors

The song remains the same: identifying versions of the same piece using tonal descriptors The song remains the same: identifying versions of the same piece using tonal descriptors Emilia Gómez Music Technology Group, Universitat Pompeu Fabra Ocata, 83, Barcelona emilia.gomez@iua.upf.edu Abstract

More information

Chord Classification of an Audio Signal using Artificial Neural Network

Chord Classification of an Audio Signal using Artificial Neural Network Chord Classification of an Audio Signal using Artificial Neural Network Ronesh Shrestha Student, Department of Electrical and Electronic Engineering, Kathmandu University, Dhulikhel, Nepal ---------------------------------------------------------------------***---------------------------------------------------------------------

More information

Automatic Extraction of Popular Music Ringtones Based on Music Structure Analysis

Automatic Extraction of Popular Music Ringtones Based on Music Structure Analysis Automatic Extraction of Popular Music Ringtones Based on Music Structure Analysis Fengyan Wu fengyanyy@163.com Shutao Sun stsun@cuc.edu.cn Weiyao Xue Wyxue_std@163.com Abstract Automatic extraction of

More information

Creating a Feature Vector to Identify Similarity between MIDI Files

Creating a Feature Vector to Identify Similarity between MIDI Files Creating a Feature Vector to Identify Similarity between MIDI Files Joseph Stroud 2017 Honors Thesis Advised by Sergio Alvarez Computer Science Department, Boston College 1 Abstract Today there are many

More information

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 AN HMM BASED INVESTIGATION OF DIFFERENCES BETWEEN MUSICAL INSTRUMENTS OF THE SAME TYPE PACS: 43.75.-z Eichner, Matthias; Wolff, Matthias;

More information

Automatic Labelling of tabla signals

Automatic Labelling of tabla signals ISMIR 2003 Oct. 27th 30th 2003 Baltimore (USA) Automatic Labelling of tabla signals Olivier K. GILLET, Gaël RICHARD Introduction Exponential growth of available digital information need for Indexing and

More information

Lecture 9 Source Separation

Lecture 9 Source Separation 10420CS 573100 音樂資訊檢索 Music Information Retrieval Lecture 9 Source Separation Yi-Hsuan Yang Ph.D. http://www.citi.sinica.edu.tw/pages/yang/ yang@citi.sinica.edu.tw Music & Audio Computing Lab, Research

More information

Combination of Audio & Lyrics Features for Genre Classication in Digital Audio Collections

Combination of Audio & Lyrics Features for Genre Classication in Digital Audio Collections 1/23 Combination of Audio & Lyrics Features for Genre Classication in Digital Audio Collections Rudolf Mayer, Andreas Rauber Vienna University of Technology {mayer,rauber}@ifs.tuwien.ac.at Robert Neumayer

More information

Comparison Parameters and Speaker Similarity Coincidence Criteria:

Comparison Parameters and Speaker Similarity Coincidence Criteria: Comparison Parameters and Speaker Similarity Coincidence Criteria: The Easy Voice system uses two interrelating parameters of comparison (first and second error types). False Rejection, FR is a probability

More information

Effects of acoustic degradations on cover song recognition

Effects of acoustic degradations on cover song recognition Signal Processing in Acoustics: Paper 68 Effects of acoustic degradations on cover song recognition Julien Osmalskyj (a), Jean-Jacques Embrechts (b) (a) University of Liège, Belgium, josmalsky@ulg.ac.be

More information

Classification of Musical Instruments sounds by Using MFCC and Timbral Audio Descriptors

Classification of Musical Instruments sounds by Using MFCC and Timbral Audio Descriptors Classification of Musical Instruments sounds by Using MFCC and Timbral Audio Descriptors Priyanka S. Jadhav M.E. (Computer Engineering) G. H. Raisoni College of Engg. & Mgmt. Wagholi, Pune, India E-mail:

More information

An Accurate Timbre Model for Musical Instruments and its Application to Classification

An Accurate Timbre Model for Musical Instruments and its Application to Classification An Accurate Timbre Model for Musical Instruments and its Application to Classification Juan José Burred 1,AxelRöbel 2, and Xavier Rodet 2 1 Communication Systems Group, Technical University of Berlin,

More information

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications Matthias Mauch Chris Cannam György Fazekas! 1 Matthias Mauch, Chris Cannam, George Fazekas Problem Intonation in Unaccompanied

More information

Contextual music information retrieval and recommendation: State of the art and challenges

Contextual music information retrieval and recommendation: State of the art and challenges C O M P U T E R S C I E N C E R E V I E W ( ) Available online at www.sciencedirect.com journal homepage: www.elsevier.com/locate/cosrev Survey Contextual music information retrieval and recommendation:

More information

WE ADDRESS the development of a novel computational

WE ADDRESS the development of a novel computational IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 18, NO. 3, MARCH 2010 663 Dynamic Spectral Envelope Modeling for Timbre Analysis of Musical Instrument Sounds Juan José Burred, Member,

More information

The Million Song Dataset

The Million Song Dataset The Million Song Dataset AUDIO FEATURES The Million Song Dataset There is no data like more data Bob Mercer of IBM (1985). T. Bertin-Mahieux, D.P.W. Ellis, B. Whitman, P. Lamere, The Million Song Dataset,

More information

Analyzing the Relationship Among Audio Labels Using Hubert-Arabie adjusted Rand Index

Analyzing the Relationship Among Audio Labels Using Hubert-Arabie adjusted Rand Index Analyzing the Relationship Among Audio Labels Using Hubert-Arabie adjusted Rand Index Kwan Kim Submitted in partial fulfillment of the requirements for the Master of Music in Music Technology in the Department

More information

Application Of Missing Feature Theory To The Recognition Of Musical Instruments In Polyphonic Audio

Application Of Missing Feature Theory To The Recognition Of Musical Instruments In Polyphonic Audio Application Of Missing Feature Theory To The Recognition Of Musical Instruments In Polyphonic Audio Jana Eggink and Guy J. Brown Department of Computer Science, University of Sheffield Regent Court, 11

More information

MUSICAL NOTE AND INSTRUMENT CLASSIFICATION WITH LIKELIHOOD-FREQUENCY-TIME ANALYSIS AND SUPPORT VECTOR MACHINES

MUSICAL NOTE AND INSTRUMENT CLASSIFICATION WITH LIKELIHOOD-FREQUENCY-TIME ANALYSIS AND SUPPORT VECTOR MACHINES MUSICAL NOTE AND INSTRUMENT CLASSIFICATION WITH LIKELIHOOD-FREQUENCY-TIME ANALYSIS AND SUPPORT VECTOR MACHINES Mehmet Erdal Özbek 1, Claude Delpha 2, and Pierre Duhamel 2 1 Dept. of Electrical and Electronics

More information

Music Structure Analysis

Music Structure Analysis Lecture Music Processing Music Structure Analysis Meinard Müller International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de Book: Fundamentals of Music Processing Meinard Müller Fundamentals

More information

Dimensional Music Emotion Recognition: Combining Standard and Melodic Audio Features

Dimensional Music Emotion Recognition: Combining Standard and Melodic Audio Features Dimensional Music Emotion Recognition: Combining Standard and Melodic Audio Features R. Panda 1, B. Rocha 1 and R. P. Paiva 1, 1 CISUC Centre for Informatics and Systems of the University of Coimbra, Portugal

More information

Audio Feature Extraction for Corpus Analysis

Audio Feature Extraction for Corpus Analysis Audio Feature Extraction for Corpus Analysis Anja Volk Sound and Music Technology 5 Dec 2017 1 Corpus analysis What is corpus analysis study a large corpus of music for gaining insights on general trends

More information

Musical Instrument Identification based on F0-dependent Multivariate Normal Distribution

Musical Instrument Identification based on F0-dependent Multivariate Normal Distribution Musical Instrument Identification based on F0-dependent Multivariate Normal Distribution Tetsuro Kitahara* Masataka Goto** Hiroshi G. Okuno* *Grad. Sch l of Informatics, Kyoto Univ. **PRESTO JST / Nat

More information

A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models

A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models Kyogu Lee Center for Computer Research in Music and Acoustics Stanford University, Stanford CA 94305, USA

More information

Aalborg Universitet. Feature Extraction for Music Information Retrieval Jensen, Jesper Højvang. Publication date: 2009

Aalborg Universitet. Feature Extraction for Music Information Retrieval Jensen, Jesper Højvang. Publication date: 2009 Aalborg Universitet Feature Extraction for Music Information Retrieval Jensen, Jesper Højvang Publication date: 2009 Document Version Publisher's PDF, also known as Version of record Link to publication

More information

Audio Structure Analysis

Audio Structure Analysis Lecture Music Processing Audio Structure Analysis Meinard Müller International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de Music Structure Analysis Music segmentation pitch content

More information

Music Information Retrieval

Music Information Retrieval Music Information Retrieval Automatic genre classification from acoustic features DANIEL RÖNNOW and THEODOR TWETMAN Bachelor of Science Thesis Stockholm, Sweden 2012 Music Information Retrieval Automatic

More information

Multimodal Music Mood Classification Framework for Christian Kokborok Music

Multimodal Music Mood Classification Framework for Christian Kokborok Music Journal of Engineering Technology (ISSN. 0747-9964) Volume 8, Issue 1, Jan. 2019, PP.506-515 Multimodal Music Mood Classification Framework for Christian Kokborok Music Sanchali Das 1*, Sambit Satpathy

More information

THE POTENTIAL FOR AUTOMATIC ASSESSMENT OF TRUMPET TONE QUALITY

THE POTENTIAL FOR AUTOMATIC ASSESSMENT OF TRUMPET TONE QUALITY 12th International Society for Music Information Retrieval Conference (ISMIR 2011) THE POTENTIAL FOR AUTOMATIC ASSESSMENT OF TRUMPET TONE QUALITY Trevor Knight Finn Upham Ichiro Fujinaga Centre for Interdisciplinary

More information

ABSOLUTE OR RELATIVE? A NEW APPROACH TO BUILDING FEATURE VECTORS FOR EMOTION TRACKING IN MUSIC

ABSOLUTE OR RELATIVE? A NEW APPROACH TO BUILDING FEATURE VECTORS FOR EMOTION TRACKING IN MUSIC ABSOLUTE OR RELATIVE? A NEW APPROACH TO BUILDING FEATURE VECTORS FOR EMOTION TRACKING IN MUSIC Vaiva Imbrasaitė, Peter Robinson Computer Laboratory, University of Cambridge, UK Vaiva.Imbrasaite@cl.cam.ac.uk

More information

An Examination of Foote s Self-Similarity Method

An Examination of Foote s Self-Similarity Method WINTER 2001 MUS 220D Units: 4 An Examination of Foote s Self-Similarity Method Unjung Nam The study is based on my dissertation proposal. Its purpose is to improve my understanding of the feature extractors

More information

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng Introduction In this project we were interested in extracting the melody from generic audio files. Due to the

More information

Hidden Markov Model based dance recognition

Hidden Markov Model based dance recognition Hidden Markov Model based dance recognition Dragutin Hrenek, Nenad Mikša, Robert Perica, Pavle Prentašić and Boris Trubić University of Zagreb, Faculty of Electrical Engineering and Computing Unska 3,

More information

Automatic Piano Music Transcription

Automatic Piano Music Transcription Automatic Piano Music Transcription Jianyu Fan Qiuhan Wang Xin Li Jianyu.Fan.Gr@dartmouth.edu Qiuhan.Wang.Gr@dartmouth.edu Xi.Li.Gr@dartmouth.edu 1. Introduction Writing down the score while listening

More information

Extracting Information from Music Audio

Extracting Information from Music Audio Extracting Information from Music Audio Dan Ellis Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Engineering, Columbia University, NY USA http://labrosa.ee.columbia.edu/

More information

THE importance of music content analysis for musical

THE importance of music content analysis for musical IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 15, NO. 1, JANUARY 2007 333 Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With

More information

A Language Modeling Approach for the Classification of Audio Music

A Language Modeling Approach for the Classification of Audio Music A Language Modeling Approach for the Classification of Audio Music Gonçalo Marques and Thibault Langlois DI FCUL TR 09 02 February, 2009 HCIM - LaSIGE Departamento de Informática Faculdade de Ciências

More information

CTP431- Music and Audio Computing Music Information Retrieval. Graduate School of Culture Technology KAIST Juhan Nam

CTP431- Music and Audio Computing Music Information Retrieval. Graduate School of Culture Technology KAIST Juhan Nam CTP431- Music and Audio Computing Music Information Retrieval Graduate School of Culture Technology KAIST Juhan Nam 1 Introduction ü Instrument: Piano ü Genre: Classical ü Composer: Chopin ü Key: E-minor

More information

http://www.xkcd.com/655/ Audio Retrieval David Kauchak cs160 Fall 2009 Thanks to Doug Turnbull for some of the slides Administrative CS Colloquium vs. Wed. before Thanksgiving producers consumers 8M artists

More information

Gaussian Mixture Model for Singing Voice Separation from Stereophonic Music

Gaussian Mixture Model for Singing Voice Separation from Stereophonic Music Gaussian Mixture Model for Singing Voice Separation from Stereophonic Music Mine Kim, Seungkwon Beack, Keunwoo Choi, and Kyeongok Kang Realistic Acoustics Research Team, Electronics and Telecommunications

More information