Grouping Recorded Music by Structural Similarity Juan Pablo Bello New York University ISMIR 09, Kobe October 2009 marl music and audio research lab

Similar documents
GROUPING RECORDED MUSIC BY STRUCTURAL SIMILARITY

Music Structure Analysis

Audio Structure Analysis

Music Structure Analysis

Music Structure Analysis

Audio Structure Analysis

Audio Structure Analysis

AUDIO-BASED MUSIC STRUCTURE ANALYSIS

Methods for the automatic structural analysis of music. Jordan B. L. Smith CIRMMT Workshop on Structural Analysis of Music 26 March 2010

AUTOMATED METHODS FOR ANALYZING MUSIC RECORDINGS IN SONATA FORM

Popular Song Summarization Using Chorus Section Detection from Audio Signal

AUDIO-BASED MUSIC STRUCTURE ANALYSIS

Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900)

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Toward Automatic Music Audio Summary Generation from Signal Analysis

Semantic Segmentation and Summarization of Music

The song remains the same: identifying versions of the same piece using tonal descriptors

A repetition-based framework for lyric alignment in popular songs

Content-based music retrieval

MODELS of music begin with a representation of the

A TEXT RETRIEVAL APPROACH TO CONTENT-BASED AUDIO RETRIEVAL

CS 591 S1 Computational Audio

Analysing Musical Pieces Using harmony-analyser.org Tools

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

EXPRESSIVE TIMING FROM CROSS-PERFORMANCE AND AUDIO-BASED ALIGNMENT PATTERNS: AN EXTENDED CASE STUDY

An Accurate Timbre Model for Musical Instruments and its Application to Classification

A Multimodal Way of Experiencing and Exploring Music

AUTOMATIC MAPPING OF SCANNED SHEET MUSIC TO AUDIO RECORDINGS

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST)

Towards Supervised Music Structure Annotation: A Case-based Fusion Approach.

Audio Cover Song Identification using Convolutional Neural Network

STRUCTURAL ANALYSIS AND SEGMENTATION OF MUSIC SIGNALS

Repeating Pattern Discovery and Structure Analysis from Acoustic Music Data

Detecting Changes in Music Using Compression

New Developments in Music Information Retrieval

IMPROVING RHYTHMIC SIMILARITY COMPUTATION BY BEAT HISTOGRAM TRANSFORMATIONS

Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification

AUDIO MATCHING VIA CHROMA-BASED STATISTICAL FEATURES

SEGMENTATION, CLUSTERING, AND DISPLAY IN A PERSONAL AUDIO DATABASE FOR MUSICIANS

The Effect of DJs Social Network on Music Popularity

Music Information Retrieval. Juan P Bello

Citation for published version (APA): Jensen, K. K. (2005). A Causal Rhythm Grouping. Lecture Notes in Computer Science, 3310,

ON RHYTHM AND GENERAL MUSIC SIMILARITY

SINGING EXPRESSION TRANSFER FROM ONE VOICE TO ANOTHER FOR A GIVEN SONG. Sangeon Yong, Juhan Nam

arxiv: v1 [cs.ir] 2 Aug 2017

Discovering Musical Structure in Audio Recordings

Lecture 12: Alignment and Matching

TOWARD AUTOMATED HOLISTIC BEAT TRACKING, MUSIC ANALYSIS, AND UNDERSTANDING

Rhythm related MIR tasks

MUSI-6201 Computational Music Analysis

Tempo and Beat Analysis

Subjective Similarity of Music: Data Collection for Individuality Analysis

TOWARDS AUTOMATED EXTRACTION OF TEMPO PARAMETERS FROM EXPRESSIVE MUSIC RECORDINGS

Music Alignment and Applications. Introduction

STRUCTURAL CHANGE ON MULTIPLE TIME SCALES AS A CORRELATE OF MUSICAL COMPLEXITY

USING MUSICAL STRUCTURE TO ENHANCE AUTOMATIC CHORD TRANSCRIPTION

A Study of Synchronization of Audio Data with Symbolic Data. Music254 Project Report Spring 2007 SongHui Chon

Effects of acoustic degradations on cover song recognition

SHEET MUSIC-AUDIO IDENTIFICATION

/$ IEEE

A FORMALIZATION OF RELATIVE LOCAL TEMPO VARIATIONS IN COLLECTIONS OF PERFORMANCES

Recognition and Summarization of Chord Progressions and Their Application to Music Information Retrieval

GRADIENT-BASED MUSICAL FEATURE EXTRACTION BASED ON SCALE-INVARIANT FEATURE TRANSFORM

Data Driven Music Understanding

Music Similarity and Cover Song Identification: The Case of Jazz

ALIGNING SEMI-IMPROVISED MUSIC AUDIO WITH ITS LEAD SHEET

A Bootstrap Method for Training an Accurate Audio Segmenter

Informed Feature Representations for Music and Motion

MUSIC SHAPELETS FOR FAST COVER SONG RECOGNITION

DECODING TEMPO AND TIMING VARIATIONS IN MUSIC RECORDINGS FROM BEAT ANNOTATIONS

The Intervalgram: An Audio Feature for Large-scale Melody Recognition

AUTOMATIC PRACTICE LOGGING: INTRODUCTION, DATASET & PRELIMINARY STUDY

Chord Classification of an Audio Signal using Artificial Neural Network

UvA-DARE (Digital Academic Repository) Clustering and classification of music using interval categories Honingh, A.K.; Bod, L.W.M.

Singer Traits Identification using Deep Neural Network

TRACKING THE ODD : METER INFERENCE IN A CULTURALLY DIVERSE MUSIC CORPUS

Music Genre Classification

Is Music Structure Annotation Multi-Dimensional? A Proposal for Robust Local Music Annotation.

EIGENVECTOR-BASED RELATIONAL MOTIF DISCOVERY

Tempo and Beat Tracking

Meinard Müller. Beethoven, Bach, und Billionen Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

Music Information Retrieval

Music Recommendation from Song Sets

Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods

Machine Learning Term Project Write-up Creating Models of Performers of Chopin Mazurkas

Modeling Music Similarity: Signal-based Models of Subjective Preference Daniel P.W. Ellis, Electrical Engineering, Columbia University

AUDIO-BASED COVER SONG RETRIEVAL USING APPROXIMATE CHORD SEQUENCES: TESTING SHIFTS, GAPS, SWAPS AND BEATS

Perceptual Evaluation of Automatically Extracted Musical Motives

Topic 10. Multi-pitch Analysis

Breakscience. Technological and Musicological Research in Hardcore, Jungle, and Drum & Bass

LEARNING AUDIO SHEET MUSIC CORRESPONDENCES. Matthias Dorfer Department of Computational Perception

Evaluating Melodic Encodings for Use in Cover Song Identification

WE ADDRESS the development of a novel computational

On Human Capability and Acoustic Cues for Discriminating Singing and Speaking Voices

Retrieval of textual song lyrics from sung inputs

Sparse Representation Classification-Based Automatic Chord Recognition For Noisy Music

Research Article Multiple Scale Music Segmentation Using Rhythm, Timbre, and Harmony

DOWNBEAT TRACKING WITH MULTIPLE FEATURES AND DEEP NEURAL NETWORKS

SIMULTANEOUS SEPARATION AND SEGMENTATION IN LAYERED MUSIC

Transcription:

Grouping Recorded Music by Structural Similarity Juan Pablo Bello New York University ISMIR 09, Kobe October 2009

Sequence-based analysis

Structure discovery Cooper, M. & Foote, J. (2002), Automatic Music Summarization via Similarity Analysis Foote, J. et al. (2002), Audio Retrieval by Rhythmic Similarity Peeters, G. (2007), Sequence Representation of Music Structure Using Higher-Order Similarity Matrix and Maximum- Likelihood Approach Bartsch, M. and Wakefield (2001), G.H., "To Catch a Chorus: Using Chroma-Based Representations For Audio Thumbnailing. Dannenberg, Roger, and Hu (2002) ``Pattern Discovery Techniques for Music Audio,' Geoffroy Peeters, Amaury La Burthe and Xavier Rodet (2002), "Toward Automatic Music Audio Summary Generation from Signal Analysis. J. Wellhausen and H. Crysandt (2003), "Temporal Audio Segmentation Using MPEG-7 Descriptors. Masataka Goto (2003), A Chorus-Section Detecting Method for Musical Audio Signals Chai, Wei and Vercoe, Barry (2003). "Structural Analysis Of Musical Signals For Indexing and Thumbnailing." Lie Lu, Stan Li, Wen-Yin Liu, Hong-Jiang Zhang (2002), " Audio Textures Michael Casey: musicstructure.com Lukashevich, H. (2008), Towards Quantitative Measures of Evaluating Song Segmentation Jensen, K. et al. (2005), Rhythm-Based Segmentation of Popular Chinese Music Müller, M. & Clausen, M. (2007), Transposition- Invariant Self-Similarity Matrices Paulus, J. & Klapuri, A. (2008), Music Structure Analysis Using a Probabilistic Fitness Measure and an Integrated Musicological Model Peeters, G. et al. (2002), Toward Automatic Music Audio Summary Generation from Signal Analysis others

Self-similarity Self-similarity for music retrieval (Izumitani and Kashino, ICME-07/ ISMIR-08; Martin et al, ISMIR-09)

Structural similarity Music signals -> trajectories in an N-dimensional feature space Trajectories are fully characterized by the self-similarity matrix (rotation and time-invariance) Structural similarity -> distances between matrices

Representation track i feature extraction self-similarity quantization object i MFCC or chroma, averaged between beats, LPF and standardized Euclidean or cosine distance, normalized Uniformly quantized: 1, 2, 3 or 4 bits

Similarity We use the Normalized Compression Distance (NCD - Li and Vitanyi 1997; Cilibrasi, Vitanyi and dewolf 2004; Cilibrasi and Vitanyi 2005) { } { } NCD(o 1,o 2 ) = C(o 1o 2 ) min C(o 1 ),C(o 2 ) max C(o 1 ),C(o 2 ) C(.) is the size of a compressed object using a standard algorithm. o 1 o 2 is the concatenation of objects o 1 and o 2. Why NCD? It is (quasi) universal It is topic/parameter free (other than the choice of compressor: gzip, bzip2, PPMz)

Experimental setup P56 (Widmer et al. 2003): 56 piano music recordings, 25 pianists (1946-1998) 8 works, 3 composers (Beethoven, Mozart, Chopin), each work has 3-13 renditions S67 67 recordings symphonic music, 34 conductors (1948-2008) 11 works, 7 composers (Beethoven, Berlioz, Brahms, Mahler, Mendelssohn, Mozart, Tchaikovsky), each work has 6-7 renditions Goal: Cluster performances of a given work together

Example 1: beats

Example 2: length 2 nd shortest (5 42) 3 rd shortest (5 51) Shortest (2 41) Mozart_kv282_1 (7 16-8 04)

Example 2: length Mozart_kv279_3 (3 14-3 32) 2 nd longest (4 35) Shortest (1 54) Longest (5 37)

Example 2: length 4 14 4 31 Chopin_op15_1 4 07 5 07 5 41 Shortest (3 39)

Best results Eliminate beat-tracking and re-sample feature matrix to a fixed length Moving average filter across diagonals (Müller and Kurth 07) and binary encoding

Beethoven_op67_1 Mahler_sym4_2 Brahms_op98_3 Mozart_k550_3 Mahler sym1_2 Berlioz_op14_4 Mendelssohn op90_4 Mozart_k385_4 Brahms_op68_3 Tchaikovsky_op74_2 Beethoven_op68_4

NCD limitations

NCD limitations

Concluding remarks Contributions: Rotation and time-invariant representation of music structure Parameterization, inc. quantization strategy to aid generalization A simple method for computing similarity Proof-of-concept (small-scale) evaluation on expressive music Results suggest that: Intermediate processes (e.g. beat tracking) degrade performance (Serra et al, 2008). Global similarity limiting robustness to structural changes (e.g. Berlioz) Current and future work: Scalability and noise-robustness Characterization of local structures is desirable (LSH, DP, MCMO solutions)

Thanks a lot! E-mail: jpbello@nyu.edu Web: homepages.nyu.edu/~jb2843 This work is made possible by grants from the U.S. Institute of Museum and Library Services and the National Science Foundation Also many thanks to: Ernest Li for his ideas; Gerhard Widmer and Werner Goebl for the P56 dataset; Craig Sapp for the CHARM dataset; Dan Ellis and the CompLearn team for making their code available.