Melody classification using patterns

Similar documents
STRING QUARTET CLASSIFICATION WITH MONOPHONIC MODELS

CHORD SEQUENCE PATTERNS IN OWL

A Pattern Recognition Approach for Melody Track Selection in MIDI Files

Outline. Why do we classify? Audio Classification

Automatic Polyphonic Music Composition Using the EMILE and ABL Grammar Inductors *

Bi-Modal Music Emotion Recognition: Novel Lyrical Features and Dataset

Content-based Indexing of Musical Scores

Characteristics of Polyphonic Music Style and Markov Model of Pitch-Class Intervals

Chapter 15 Contrast Pattern Mining in Folk Music Analysis

CALCULATING SIMILARITY OF FOLK SONG VARIANTS WITH MELODY-BASED FEATURES

Creating a Feature Vector to Identify Similarity between MIDI Files

Music Information Retrieval Using Audio Input

Detecting Musical Key with Supervised Learning

BayesianBand: Jam Session System based on Mutual Prediction by User and System

Automated extraction of motivic patterns and application to the analysis of Debussy s Syrinx

ABSOLUTE OR RELATIVE? A NEW APPROACH TO BUILDING FEATURE VECTORS FOR EMOTION TRACKING IN MUSIC

Singer Traits Identification using Deep Neural Network

Melody Retrieval On The Web

Harmonising Melodies: Why Do We Add the Bass Line First?

Tool-based Identification of Melodic Patterns in MusicXML Documents

GENRE CLASSIFICATION USING HARMONY RULES INDUCED FROM AUTOMATIC CHORD TRANSCRIPTIONS

CPU Bach: An Automatic Chorale Harmonization System

Arts Education Essential Standards Crosswalk: MUSIC A Document to Assist With the Transition From the 2005 Standard Course of Study

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

Music Information Retrieval with Temporal Features and Timbre

Pattern Based Melody Matching Approach to Music Information Retrieval

A COMPARATIVE EVALUATION OF ALGORITHMS FOR DISCOVERING TRANSLATIONAL PATTERNS IN BAROQUE KEYBOARD WORKS

A wavelet-based approach to the discovery of themes and sections in monophonic melodies Velarde, Gissel; Meredith, David

Searching digital music libraries

A COMPARISON OF STATISTICAL AND RULE-BASED MODELS OF MELODIC SEGMENTATION

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

A probabilistic approach to determining bass voice leading in melodic harmonisation

Statistical Modeling and Retrieval of Polyphonic Music

Probabilist modeling of musical chord sequences for music analysis

Actas del WSRFAI 2013

Take a Break, Bach! Let Machine Learning Harmonize That Chorale For You. Chris Lewis Stanford University

Predicting Time-Varying Musical Emotion Distributions from Multi-Track Audio

Towards the Generation of Melodic Structure

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Lyrics Classification using Naive Bayes

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues

TREE MODEL OF SYMBOLIC MUSIC FOR TONALITY GUESSING

Subjective Similarity of Music: Data Collection for Individuality Analysis

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Empirical Musicology Review Vol. 11, No. 1, 2016

Some Experiments in Humour Recognition Using the Italian Wikiquote Collection

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

A Music Retrieval System Using Melody and Lyric

STYLE RECOGNITION THROUGH STATISTICAL EVENT MODELS

Comparison of Dictionary-Based Approaches to Automatic Repeating Melody Extraction

Automatic Rhythmic Notation from Single Voice Audio Sources

Mood Tracking of Radio Station Broadcasts

Analysis of local and global timing and pitch change in ordinary

Audio Feature Extraction for Corpus Analysis

Multimodal Music Mood Classification Framework for Christian Kokborok Music

Music Composition with RNN

Probabilistic and Logic-Based Modelling of Harmony

Music Genre Classification and Variance Comparison on Number of Genres

MUSIR A RETRIEVAL MODEL FOR MUSIC

A CLASSIFICATION APPROACH TO MELODY TRANSCRIPTION

Semi-supervised Musical Instrument Recognition

arxiv: v1 [cs.ir] 20 Mar 2019

LSTM Neural Style Transfer in Music Using Computational Musicology

jsymbolic 2: New Developments and Research Opportunities

PITCH CLASS SET CATEGORIES AS ANALYSIS TOOLS FOR DEGREES OF TONALITY

AN EMOTION MODEL FOR MUSIC USING BRAIN WAVES

MUSICAL STRUCTURAL ANALYSIS DATABASE BASED ON GTTM

Extracting Significant Patterns from Musical Strings: Some Interesting Problems.

SIMSSA DB: A Database for Computational Musicological Research

Feature-Based Analysis of Haydn String Quartets

Music Radar: A Web-based Query by Humming System

Exploring the Design Space of Symbolic Music Genre Classification Using Data Mining Techniques Ortiz-Arroyo, Daniel; Kofod, Christian

Harmonic Generation based on Harmonicity Weightings

Can Song Lyrics Predict Genre? Danny Diekroeger Stanford University

Ensemble of state-of-the-art methods for polyphonic music comparison

N-GRAM-BASED APPROACH TO COMPOSER RECOGNITION

MELODY CLASSIFICATION USING A SIMILARITY METRIC BASED ON KOLMOGOROV COMPLEXITY

Pattern Recognition Approach for Music Style Identification Using Shallow Statistical Descriptors

Perceptual Evaluation of Automatically Extracted Musical Motives

An Introduction to Deep Image Aesthetics

Release Year Prediction for Songs

Modeling memory for melodies

CS229 Project Report Polyphonic Piano Transcription

A Model of Musical Motifs

Curriculum Standard One: The student will listen to and analyze music critically, using the vocabulary and language of music.

A Model of Musical Motifs

arxiv: v1 [cs.ir] 16 Jan 2019

MODELING HARMONY WITH SKIP-GRAMS

IMPROVING GENRE CLASSIFICATION BY COMBINATION OF AUDIO AND SYMBOLIC DESCRIPTORS USING A TRANSCRIPTION SYSTEM

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

CLASSIFICATION OF MUSICAL METRE WITH AUTOCORRELATION AND DISCRIMINANT FUNCTIONS

A Basis for Characterizing Musical Genres

Automatic Reduction of MIDI Files Preserving Relevant Musical Content

Discovering Sequential Association Rules with Constraints and Time Lags in Multiple Sequences

NEW QUERY-BY-HUMMING MUSIC RETRIEVAL SYSTEM CONCEPTION AND EVALUATION BASED ON A QUERY NATURE STUDY

Separating Voices in Polyphonic Music: A Contig Mapping Approach

Krzysztof Rychlicki-Kicior, Bartlomiej Stasiak and Mykhaylo Yatsymirskyy Lodz University of Technology

Exploring Relationships between Audio Features and Emotion in Music

ASSOCIATIONS BETWEEN MUSICOLOGY AND MUSIC INFORMATION RETRIEVAL

Transcription:

Melody classification using patterns Darrell Conklin Department of Computing City University London United Kingdom conklin@city.ac.uk Abstract. A new method for symbolic music classification is proposed, based on a feature set pattern representation. An ensemble of sequential patterns is used to classify unseen pieces using a decision list method. On a small corpus of 195 folk song melodies this method achieves a good classification accuracy of 77%, though with only a 43% recall rate. A competing n-gram model achieves a slightly lower accuracy of 75% though with the advantange of 100% recall rate. The new proposed method may be applicable to a wide variety of music classification problems. 1 Introduction Melody classification is an important task in music informatics. The ability to assign unseen melodies to existing classes by composer, region, genre, period, etc. is a powerful inference capability. Methods can be divided into two main types [1]. The global feature approach encapsulates information about a melody into a feature vector, where standard attribute-value data mining methods can be employed. The event feature approach views a piece as a sequence of events, each with its own features. The standard method to deal with such sequential data is the n-gram model and its extension, the multiple viewpoint model [2]. Bridging the global and event methods is the sequential pattern approach, which uses short patterns to classify an unseen melody. Patterns on one hand can be viewed as global boolean features of pieces, but on the other hand they retain some sequential structure of events. A few preliminary studies exist on the use of short sequential patterns for music classification [3 6]. All of these methods use a pattern vocabulary comprising fixed-length n-grams, and most do some selection to reduce the exhaustive pattern set to those patterns that are predictive of the class. The problem with relying on fixed n is that short patterns may not be predictive of any class (may occur with relatively equal frequency in all classes), and long patterns have the danger of overfitting the training corpus, leading to limited matches of the pattern collection to unseen pieces. The use of a collection of (non-sequential) patterns for classification is an ongoing area of research in class association rule mining [7]. Class association rules are simply association rules that are restricted to contain a class attribute in the right-hand side of the rule. Of general interest is how an exhaustive collection

of association rules can be pruned to an ensemble of rules that are together useful for future classification. Also of interest is the development of theory and methods for pruning, ranking and weighting rules to attain a final class prediction from the collection [8]. Ranking of rules is usually based on rule confidence: the probability of the class given the pattern. This paper describes and evaluates a novel classification scheme for music based on feature set patterns. These are sequential patterns, of arbitrary length with arbitrary feature conjunctions within their components, thereby permitting a high degree of abstraction and flexibility. Extending the work presented in [9], a collection of mined feature set patterns is used to classify folk song melodies. For a small trial corpus of Swiss and Austrian folk songs, the proposed method attains 77% classification accuracy, though with only a 43% recall rate, as evaluated by 10-fold cross-validation. 2 Methods A pattern is a sequence of feature sets. Patterns can be viewed as unary predicates: functions from pieces to boolean values. A piece w instantiates a pattern P, written P (w), if the pattern occurs (possibly multiple times) in the sequence: if the components of the pattern are instantiated by successive events in the sequence. For example, the pattern [{pcint:2, dur:3}, {pcint:3}] has two components and occurs at position 5 of the melodic fragment in Figure 1. 4 2 1 2 3 4 5 6 7 8 9 10 11 12 1 2 3 4 5 6 7 8 9 10 11 12 contour(pitch) = = + + = + contour(dur) + = + + = = = + rest 0 0 0 0 0 0 0 0 0 0 0 pcint 0 0 5 2 3 0 9 3 9 10 7 diaintc P1 P1 P4 M2 m3 P1 M6 m3 M6 m7 P5 Fig. 1. Example of a fragment of a Shanxi folk song (Essen folk song database code J0002), and a viewpoint matrix. A pattern P subsumes (is more general than) a pattern Q if all instances of Q are also instances of P :if w Q(w) P (w) is valid. This subsumption relation can be determined by searching for a contiguous, subsumption-preserving mapping from the components of P to the components of Q. For example, the pattern [{pcint : 3}] subsumes the pattern [{pcint : 3, dur : 1}], which in turn subsumes, for example, the pattern [{pcint : 2}, {pcint : 3, dur : 1}, {}] (note the empty feature set in the last component, which is instantiated by any event).

The piece count of a pattern P in a class is the number of pieces containing the pattern and in the class, notated by c (P ). A class association rule is a structure P where P is a pattern and is a class. The confidence of a rule is the quantity p( P ), estimated from counts in the training corpus. For classification tasks, we seek out patterns that have high confidence: p( P ) greater than a specified threshold in the training corpus. The mining algorithm performs a general-to-specific search of a virtual subsumption taxonomy of patterns, terminating a branch with success when a confident rule is reached. The algorithm therefore finds a decision list of patterns that are both confident and maximally general (not subsumed by any other confident pattern). To classify an unseen query piece w, it is probed against each rule in a decision list, looking for the first rule P such that P (w), then predicting the class for w. To mine for a pattern decision list, the following method is used. For each class, the corpus is mined with the specified minimum confidence p( ) and piece count c ( ), using all other classes as the anticorpus. The collection of all results for each class is used to build a final decision list, sorted from high to low confidence. 2.1 Folk song dataset Two geographic regions represented in the Essen folk song database [10] were chosen to illustrate the method: Austrian (102); and Swiss (93). These were chosen as a pair due to their geographic proximity and presumed suitability as anticorpora for one another. The 5 viewpoints presented in Figure 1 were used in the experiment. 3 Results To evaluate the classification method, a 10-fold cross-validation is performed. The order of pieces in the complete corpus is scrambled, then for each fold, 90% of the dataset is used to build a pattern decision list: for each class, split into a corpus and anticorpus and mined for patterns. The remaining 10% of the pieces are then classified using the resulting decision list. In this initial evaluation of the method, a pattern P is retained for the decision list if p(p ) 0.1 (P occurs in at least 10% of the corpus) and p( P )=1 (the rule is maximally confident). The pattern classification method achieves 77% classification accuracy on the complete corpus, though 111 pieces are not classified by a rule, giving a recall rate of only 43%. Only short patterns were necessary to correctly classify instances: length 1 (2 patterns); 2 (34); 3 (10); 4 (16), and 5 (3). To put the results into some context, a zero-rule classifier (always predicting the most frequent class) achieves 102/195 = 52% classification accuracy. A trigram model of linked interval/duration pairs achieves a slightly lower 75% classification accuracy in a 10-fold cross-validation on the complete corpus. A classifier using

99 global features [11] with the Weka [12] logistic regression implementation, a configuration performing well in a recent study of a larger folk song corpus [1], achieves 66% 10-fold cross-validation accuracy. Corpus Pattern diaintc : M3 diaintc : P1 diaintc : P1 restv : 0 diaintc : m7 diaintc : M3 diaintc : P1 diaintc : P1 restv : 0 contour(dur) : contour(pitch) : contour(pitch) : diaintc : M2 contour(pitch) : pcint : 11 diaintc : M7 pcint : 4 diaintc : M3 diaintc : P1 diaintc : P1 diaintc : P1 pcint : 6 diaintc : P1 restv : 0 pcint : 2 diaintc : M2 diaintc : m2 Table 1. The patterns of the decision list trained on the complete corpus. Table 1 shows the pattern of the decision list trained on the complete corpus. 4 Discussion This paper has presented a method for melody classification based on feature set patterns. As reported earlier [9], patterns can have musicological interest in isolation, and this study showed that they can also be viewed as boolean global features for describing music for supervised learning. In this sense, pattern discovery in music can be viewed as a feature selection task. Though initial results are encouraging, the low recall rate of the method does not yet provide a viable classifier. Further work is necessary to increase the recall rate of the method, and also to apply the pattern classification algorithm to several other and larger music corpora.

References 1. Hillewaere, R., Manderick, B., Conklin, D.: Global feature versus event models for folk song classification. In: ISMIR 2009: 10th International Society for Music Information Retrieval Conference, Kobe, Japan (2009) 2. Conklin, D., Witten, I.: Multiple viewpoint systems for music prediction. Journal of New Music Research 24(1) (1995) 51 73 3. Pérez-Sancho, C., Rizo, D., Iñesta, J.M.: Genre classification using chords and stochastic language models. Connection Science 20(2&3) (2009) 145 159 4. Westhead, M., Smaill, A.: Automatic characterisation of musical style. In: Music Education: An AI Approach. Springer-Verlag (1993) 157 170 5. Lin, C.R., Liu, N.H., Wu, Y.H., Chen, A.: Music classification using significant repeating patterns. In: Database Systems for Advanced Applications. Volume 2973. Springer-Verlag (2004) 506 518 6. Sawada, T., Satoh, K.: Composer classification based on patterns of short note sequences. In: Proc AAAI-2000 Workshop on AI and Music, Austin, Texas (2000) 24 27 7. Liu, B., Hsu, W., Ma, Y.: Integrating classification and association rule mining. In: KDD-98. (1998) 80 86 8. Sulzmann, J.N., Fürnkranz, J.: An empirical comparison of techniques for selecting and combining local patterns into a global model. Technical Report Technical Report TUD-KE-2008-03, Technische Universität Darmstadt, Knowledge Engineering Group (2008) 9. Conklin, D.: Discovery of distinctive patterns in music. In: MML08: Machine Learning and Music, Helsinki (2008) to appear in Intelligent Data Analysis. 10. Schaffrath, H.: The Essen Associative Code: A code for folksong analysis. In Selfridge-Field, E., ed.: Beyond MIDI: The Handbook of Musical Codes. The MIT Press (1997) 343 361 11. http://jmir.sourceforge.net/jsymbolic.html 12. http://www.cs.waikato.ac.nz/ml/weka/