A wavelet-based approach to the discovery of themes and sections in monophonic melodies Velarde, Gissel; Meredith, David

Similar documents
Aalborg Universitet. Composition: 3 Piano Pieces. Bergstrøm-Nielsen, Carl. Creative Commons License CC BY-NC 4.0. Publication date: 2017

Perceptual Evaluation of Automatically Extracted Musical Motives

Aalborg Universitet. The Dimension of Seriousness in Moral Education Wiberg, Merete. Publication date: 2007

Music Structure Analysis

Music Radar: A Web-based Query by Humming System

Open Research Online The Open University s repository of research publications and other research outputs

A COMPARISON OF SYMBOLIC SIMILARITY MEASURES FOR FINDING OCCURRENCES OF MELODIC SEGMENTS

Homework 2 Key-finding algorithm

Automated extraction of motivic patterns and application to the analysis of Debussy s Syrinx

Transcription of the Singing Melody in Polyphonic Music

CALCULATING SIMILARITY OF FOLK SONG VARIANTS WITH MELODY-BASED FEATURES

LEARNING AUDIO SHEET MUSIC CORRESPONDENCES. Matthias Dorfer Department of Computational Perception

Aalborg Universitet. Publication date: Document Version Early version, also known as pre-print. Link to publication from Aalborg University

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Chords not required: Incorporating horizontal and vertical aspects independently in a computer improvisation algorithm

CHAPTER 3. Melody Style Mining

Pitch Spelling Algorithms

Citation for published version (APA): Knakkergård, M. (2010). Michel Chion: Film, a sound art. MedieKultur, 48,

jsymbolic and ELVIS Cory McKay Marianopolis College Montreal, Canada

A MULTI-PARAMETRIC AND REDUNDANCY-FILTERING APPROACH TO PATTERN IDENTIFICATION

Pattern Discovery and Matching in Polyphonic Music and Other Multidimensional Datasets

CLASSIFICATION OF MUSICAL METRE WITH AUTOCORRELATION AND DISCRIMINANT FUNCTIONS

Melody classification using patterns

NOTE-LEVEL MUSIC TRANSCRIPTION BY MAXIMUM LIKELIHOOD SAMPLING

From Idea to Realization - Understanding the Compositional Processes of Electronic Musicians Gelineck, Steven; Serafin, Stefania

10 Visualization of Tonal Content in the Symbolic and Audio Domains

Evaluating Melodic Encodings for Use in Cover Song Identification

METHOD TO DETECT GTTM LOCAL GROUPING BOUNDARIES BASED ON CLUSTERING AND STATISTICAL LEARNING

Evaluation of Melody Similarity Measures

Predicting Variation of Folk Songs: A Corpus Analysis Study on the Memorability of Melodies Janssen, B.D.; Burgoyne, J.A.; Honing, H.J.

Reduced complexity MPEG2 video post-processing for HD display

Syddansk Universitet. Rejoinder Noble Prize effects in citation networks Frandsen, Tove Faber ; Nicolaisen, Jeppe

Music Processing Introduction Meinard Müller

Classification of Timbre Similarity

Music Segmentation Using Markov Chain Methods

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng

Robert Alexandru Dobre, Cristian Negrescu

A Comparison of Methods to Construct an Optimal Membership Function in a Fuzzy Database System

Subjective Similarity of Music: Data Collection for Individuality Analysis

Toward Evaluation Techniques for Music Similarity

MUSICAL STRUCTURAL ANALYSIS DATABASE BASED ON GTTM

Audio. Meinard Müller. Beethoven, Bach, and Billions of Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Kodak Ektapro HS Motion Analyser

Danish independent film, or how to make films without public funding Hansen, Kim Toft

MTO 22.1 Examples: Carter-Ényì, Contour Recursion and Auto-Segmentation

Aalborg Universitet. The Usability Laboratory at Cassiopeia Kjeldskov, Jesper; Skov, Mikael; Stage, Jan. Publication date: 2008

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models

jsymbolic 2: New Developments and Research Opportunities

Music Genre Classification and Variance Comparison on Number of Genres

CSC475 Music Information Retrieval

Syddansk Universitet. The data sharing advantage in astrophysics Dorch, Bertil F.; Drachen, Thea Marie; Ellegaard, Ole

Aalborg Universitet. Scaling Analysis of Author Level Bibliometric Indicators Wildgaard, Lorna; Larsen, Birger. Published in: STI 2014 Leiden

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

Content-based Indexing of Musical Scores

CALL FOR PAPERS. standards. To ensure this, the University has put in place an editorial board of repute made up of

Bach-Prop: Modeling Bach s Harmonization Style with a Back- Propagation Network

2 The Tonal Properties of Pitch-Class Sets: Tonal Implication, Tonal Ambiguity, and Tonalness

Singer Traits Identification using Deep Neural Network

Autocorrelation in meter induction: The role of accent structure a)

MTO 18.1 Examples: Ohriner, Grouping Hierarchy and Trajectories of Pacing

Beethoven, Bach, and Billions of Bytes

Algorithms for melody search and transcription. Antti Laaksonen

Perception-Based Musical Pattern Discovery

Musical Instrument Identification Using Principal Component Analysis and Multi-Layered Perceptrons

CS229 Project Report Polyphonic Piano Transcription

ECG SIGNAL COMPRESSION BASED ON FRACTALS AND RLE

Optical shift register based on an optical flip-flop memory with a single active element Zhang, S.; Li, Z.; Liu, Y.; Khoe, G.D.; Dorren, H.J.S.

2 2. Melody description The MPEG-7 standard distinguishes three types of attributes related to melody: the fundamental frequency LLD associated to a t

Motivic matching strategies for automated pattern extraction

Characteristics of Polyphonic Music Style and Markov Model of Pitch-Class Intervals

mir_eval: A TRANSPARENT IMPLEMENTATION OF COMMON MIR METRICS

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Consonance perception of complex-tone dyads and chords

Multiple instrument tracking based on reconstruction error, pitch continuity and instrument activity

Audio Structure Analysis

SINGING PITCH EXTRACTION BY VOICE VIBRATO/TREMOLO ESTIMATION AND INSTRUMENT PARTIAL DELETION

arxiv: v1 [cs.sd] 8 Jun 2016

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

IMPROVED MELODIC SEQUENCE MATCHING FOR QUERY BASED SEARCHING IN INDIAN CLASSICAL MUSIC

Efficient Vocal Melody Extraction from Polyphonic Music Signals

Extracting Significant Patterns from Musical Strings: Some Interesting Problems.

A repetition-based framework for lyric alignment in popular songs

Exploring the Design Space of Symbolic Music Genre Classification Using Data Mining Techniques Ortiz-Arroyo, Daniel; Kofod, Christian

MidiFind: Fast and Effec/ve Similarity Searching in Large MIDI Databases

Detecting Changes in Music Using Compression

An Integrated Music Chromaticism Model

DISCOVERY OF REPEATED VOCAL PATTERNS IN POLYPHONIC AUDIO: A CASE STUDY ON FLAMENCO MUSIC. Univ. of Piraeus, Greece

Sequential Association Rules in Atonal Music

Sequential Association Rules in Atonal Music

ANALYSIS BY COMPRESSION: AUTOMATIC GENERATION OF COMPACT GEOMETRIC ENCODINGS OF MUSICAL OBJECTS

FINDING COMMUNITY STRUCTURE IN MUSIC GENRES NETWORKS

EIGENVECTOR-BASED RELATIONAL MOTIF DISCOVERY

AN INTEGRATED FRAMEWORK FOR TRANSCRIPTION, MODAL AND MOTIVIC ANALYSES OF MAQAM IMPROVISATION

Representing, comparing and evaluating of music files

CHAPTER 6. Music Retrieval by Melody Style

UWaterloo at SemEval-2017 Task 7: Locating the Pun Using Syntactic Characteristics and Corpus-based Metrics

Matching Components (minidsp_a) Description. 4x Decimation (Stereo) 4x Decimation (Mono) MonoDec4xIn. 2x Decimation (Stereo) 2x Decimation (Mono)

ANNOTATING MUSICAL SCORES IN ENP

ISMIR 2008 Session 2a Music Recommendation and Organization

Transcription:

Aalborg Universitet A wavelet-based approach to the discovery of themes and sections in monophonic melodies Velarde, Gissel; Meredith, David Publication date: 2014 Document Version Accepted author manuscript, peer reviewed version Link to publication from Aalborg University Citation for published version (APA): Velarde, G., & Meredith, D. (2014). A wavelet-based approach to the discovery of themes and sections in monophonic melodies. Abstract from International Symposium on Music Information Retrieval, Taipei, Taiwan, Province of China. General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights.? Users may download and print one copy of any publication from the public portal for the purpose of private study or research.? You may not further distribute the material or use it for any profit-making activity or commercial gain? You may freely distribute the URL identifying the publication in the public portal? Take down policy If you believe that this document breaches copyright please contact us at vbn@aub.aau.dk providing details, and we will remove access to the work immediately and investigate your claim. Downloaded from vbn.aau.dk on: maj 03, 2018

A WAVELET-BASED APPROACH TO THE DISCOVERY OF THEMES AND SECTIONS IN MONOPHONIC MELODIES Gissel Velarde Aalborg University gv@create.aau.dk David Meredith Aalborg University dave@create.aau.dk ABSTRACT We present the computational method submitted to the MIREX 2014 Discovery of Repeated Themes & Sections task, and the results on the monophonic version of the JKU Patterns Development Database. In the context of pattern discovery in monophonic music, the idea behind our method is that, with a good melodic structure in terms of segments, it should be possible to gather similar segments into clusters and rank their salience within the piece. We present an approach to this problem and how we address it. In general terms, we represent melodies either as raw 1D pitch signals or as these signals filtered with the continuous wavelet transform (CWT) using the Haar wavelet. We then segment the signal either into constant duration segments or at the resulting coefficients modulus local maxima. Segments are concatenated based on their contiguous city-block distance. The concatenated segments are compared using city-block distance and clustered using an agglomerative hierarchical cluster tree. Finally, clusters are ranked according the sum of the length of segments occurrences. We present the results of our method on the JKU Patterns Development Database. 1. INTRODUCTION We present the computational method 1 submitted to the MIREX 2014 Discovery of Repeated Themes & Sections task, and the results on the monophonic version of the JKU Patterns Development Database 2. In the context of pattern discovery of monophonic pieces, the idea behind our method is that, with a good melodic structure in terms of segments, it should be possible to gather together similar segments to rank their salience within the piece (See paradigmatic analysis [3]). We also consider other aspects of the problem, in particular, representation, segmentation, measuring similarity, clustering of segments and ranking segments according to salience. This document is licensed under the Creative Commons Attribution-Noncommercial-Share Alike 3.0 License. http://creativecommons.org/licenses/by-nc-sa/3.0/ 2014 The Authors 1 The algorithm is implemented in MATLAB (R2013b, The Mathworks,Inc), using the following toolboxes: Signal Processing, Statistics, Symbolic Math, Wavelet, and the MIDI Toolbox (Eerola & Toiviainen, 2004). 2 https://dl.dropbox.com/u/11997856/jku/jkupdd-noaudio- Aug2013.zip. Accessed 12 May 2014 In the context of this MIREX task, a good melodic structure is considered to be one that is closer to the ground truth analysis, which specifies certain patterns identified by expert analysts as being important or noticeable. These patterns may be nested or hierarchically related (see [1]). We use an agglomerative technique to cluster segments by similarity. Clusters are then ranked according to a perceptually motivated criterion. 2. METHOD The method follows and extends our previously reported approach to melodic segmentation and classification based on filtering with the Haar wavelet [4] and uses some ideas from a generic motif discovery algorithm for sequential data [2]. It follows [4] in terms of representation and segmentation, extending the segmentation method. As [2] is very generic, we use the idea of computing a similarity matrix for window connectivity information as described in section 2.2.4. 2.1 Representation As in [4], we represent melodies either as raw 1D pitch signals or as these signals filtered with the continuous wavelet transform (CWT) using the Haar wavelet at a single time scale. The melodic contour of a melody is sampled using chromatic MIDI pitch information at a defined sampling rate. In the case of pitch signal representation, after segmentation, melodic segments are normalized by subtracting the average pitch. 2.2 Segmentation 2.2.1 First stage segmentation We use some of the segmentation methods described in [4] and additionally use modulus maxima segmentation. The segmentation methods are: - constant segmentation, i.e., segmentation into segments of constant length, or - modulus maxima, where segmentation points are set at local modulus maxima of the wavelet coefficients. 2.2.2 Segment length normalization The segments obtained using these methods generally have different lengths. In order to normalize their length for the purpose of measuring their city-block distances,

and therefore have segments of equal length we define a maximal length for all segments and pad shorter segments as necessary with zeros at the end. 2.2.3 Comparison Segments are compared by building a distance matrix giving all pair-wise distances between segments in terms of normalized city-block distance. The normalization consists of dividing the pairwise distance by the length of the smallest segment before segment length normalization by zero padding. 2.2.4 Concatenation of segments We binarize the distance matrix setting a threshold: values lower than or equal to the threshold take the value of 1 or true, otherwise the value is 0 or false. We concatenate segments of contiguous true values of the diagonals, to form longer segments. 2.3 Comparison This time we use the segments that have been concatenated as described in 2.2.4. The comparison is the same as in 2.2.3. 2.4 Clustering The distance matrix obtained in 2.3 is used for clustering. We use agglomerative clusters from an agglomerative hierarchical cluster tree. Finally, clusters are ranked according to the sum of the length of segments occurrences. 3. EXPERIMENTS We tested the following parameter combinations: - Melodies sampled at 16 samples per quarter note (qn) - Representation: normalized pitch signal or wavelet coefficients filtered at the scale of 1 qn - Segmentation: constant segmentation or modulus maxima - Scale segmentation at 1 or 4 qn - Threshold for concatenating segments: 0.1 or 1 - Distance for both comparisons: city-block - Number of clusters: 7 - Ranking criterion: Sum of the length of occurrences 4. RESULTS We used the evaluation metrics defined by Collins and Meredith in [1] and Collins Matlab implementation to compute the results. The results are obtained applying our method on the JKU Patterns Development Database monophonic version, which contains five melodies for training: Bach's Fugue BWV 889, Beethoven's Sonata Op. 2, No. 1, Movement 3, Chopin's Mazurka Op. 24, No. 4, Gibbons's Silver Swan, and Mozart's Sonata K.282, Movement 2. Table 1 and Table 2 present the results of our two submissions VM1 and VM2 respectively. In our experiments we have tested all combinations mentioned in section 3, and selected two configurations to submit to MIREX. VM1 differs from VM2 in the following parameters: - Normalized pitch signal representation, - Constant segmentation at the scale of 1 qn, - Threshold for concatenation 0.1. VM2 differs from VM1 in the following parameters: - Wavelet coefficients representation filtered at the scale of 1 qn - Modulus maxima segmentation at the scale of 4 qn - Threshold for concatenation 1 According to Friedman s test (χ 2 (1)=1.8, p=0.1797) VM1 and VM2 show no significant difference in the results of the three-layer F1 score. However, for discovering exact occurrences, VM1 outperforms VM2, (χ 2 (1)=4, p=0.045). On the other hand, there is a statistically significant difference in the runtime, suggesting that VM2 should be preferable for fast computation, (χ 2 (1)=5, p=0.0253). In general, recall values are slightly higher than precision values, and the standard deviation of the recall values are slightly lower than the standard deviation of the precision values. For standard precision, recall and F1 score, the standard deviation is highest, compared to the standard deviation of establishment and occurrence measures. These results suggest that VM1 and VM2 perform consistent on the training dataset over establishment and occurrence values, and VM1 performs less consistent on the standard measures. 5. CONCLUSIONS We present a novel computational method for the discovery of repeated themes and sections in monophonic melodies and the results of our two submissions on the same task, considering that VM1 and VM2 perform similarly on the three-layer measures, but VM1 should be preferable for standard measures and VM2 should be preferable for runtime computation. 6. ACKNOWLEDGEMENT Gissel Velarde is supported by the Department of Architecture, Design and Media Technology at Aalborg University. The contribution of David Meredith to the work reported here was made as part of the Learning to Create project (Lrn2Cre8) which acknowledges the financial support of the Future and Emerging Technologies (FET) programme within the Seventh Framework Programme for Research of the European Commission, under FET grant number 610859.

7. REFERENCES [1] T. Collins. Mirex 2014 competition: Discovery of repeated themes and sections, 2014. http://www.musicir.org/mirex/wiki/2014:discovery_of_repeated_the mes_%26_sections. Accessed on 12 May 2014. [2] K. Jensen, M. Styczynski, I. Rigoutsos and G. Stephanopoulos: A generic motif discovery algorithm for sequential data, Bioinformatics, 22:1, pp. 21-28, 2006. [3] R. Monelle: Linguistics and Semiotics in Music, Harwood Academic Publishers, Chur, 1992. [4] G. Velarde, T. Weyde and D. Meredith: An approach to melodic segmentation and classification based on filtering with the Haar-wavelet, Journal of New Music Research, 42:4, 325-345, 2013.

Piece n_p n_q P_est R_est F1_est P_occ P_3 R_3 F1_3 Runtime FFTP_ FFP P_occ P R F1 (s) est Bach 3 7 0.87 0.95 0.91 0.63 0.72 0.67 0.51 0.65 0.57 8.50 0.95 0.60 0.63 0.72 0.67 0.14 0.33 0.20 Beethoven 7 7 0.92 0.92 0.92 0.98 0.98 0.98 0.86 0.91 0.88 31.00 0.76 0.80 0.89 0.93 0.91 0.57 0.57 0.57 Chopin 4 7 0.53 0.86 0.66 0.66 0.86 0.75 0.48 0.70 0.57 34.20 0.68 0.47 0.46 0.83 0.60 0.00 0.00 0.00 Gibbons 8 7 0.95 0.95 0.95 0.66 0.93 0.77 0.85 0.79 0.82 17.76 0.77 0.79 0.66 0.93 0.77 0.29 0.25 0.27 Mozart 9 7 0.92 0.79 0.85 0.82 0.96 0.88 0.79 0.69 0.73 23.61 0.67 0.73 0.72 0.92 0.81 0.57 0.44 0.50 mean 6.2 7 0.84 0.89 0.86 0.75 0.89 0.81 0.70 0.75 0.71 23.01 0.77 0.68 0.67 0.87 0.75 0.31 0.32 0.31 SD 2.59 0 0.17 0.07 0.12 0.15 0.11 0.12 0.19 0.10 0.14 10.34 0.11 0.14 0.15 0.09 0.12 0.26 0.22 0.23 Table 1. Results of VM1 on the JKU Patterns Development Database. Piece n_p n_q P_est R_est F1_est P_occ P_3 R_3 F1_3 Runtime FFTP_ FFP P_occ P R F1 (s) est Bach 3 7 0.56 0.65 0.60 0.89 0.43 0.58 0.39 0.41 0.40 5.07 0.59 0.37 0.56 0.46 0.50 0.00 0.00 0.00 Beethoven 7 7 0.90 0.90 0.90 0.79 0.89 0.84 0.82 0.86 0.84 5.54 0.67 0.75 0.83 0.90 0.86 0.00 0.00 0.00 Chopin 4 7 0.58 0.86 0.69 0.69 0.83 0.75 0.53 0.78 0.64 5.83 0.65 0.44 0.67 0.65 0.66 0.00 0.00 0.00 Gibbons 8 7 0.92 0.88 0.90 0.79 0.84 0.82 0.81 0.73 0.77 2.22 0.70 0.76 0.72 0.69 0.71 0.14 0.13 0.13 Mozart 9 7 0.83 0.71 0.77 0.93 0.93 0.93 0.77 0.63 0.69 5.70 0.56 0.68 0.84 0.88 0.86 0.00 0.00 0.00 mean 6.2 7 0.76 0.80 0.77 0.82 0.78 0.78 0.66 0.68 0.67 4.87 0.63 0.60 0.72 0.71 0.72 0.03 0.03 0.03 SD 2.59 0 0.17 0.11 0.13 0.09 0.20 0.13 0.19 0.17 0.17 1.51 0.06 0.18 0.12 0.18 0.15 0.06 0.06 0.06 Table 2. Results of VM2 on the JKU Patterns Development Database.