Music Structure Analysis

Similar documents
Music Structure Analysis

Audio Structure Analysis

Music Structure Analysis

Methods for the automatic structural analysis of music. Jordan B. L. Smith CIRMMT Workshop on Structural Analysis of Music 26 March 2010

Audio Structure Analysis

Further Topics in MIR

Audio Structure Analysis

AUTOMATED METHODS FOR ANALYZING MUSIC RECORDINGS IN SONATA FORM

Tempo and Beat Tracking

Grouping Recorded Music by Structural Similarity Juan Pablo Bello New York University ISMIR 09, Kobe October 2009 marl music and audio research lab

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Data-Driven Solo Voice Enhancement for Jazz Music Retrieval

Music Information Retrieval

Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900)

Music Processing Introduction Meinard Müller

CS 591 S1 Computational Audio

AUDIO-BASED MUSIC STRUCTURE ANALYSIS

Informed Feature Representations for Music and Motion

AUDIO-BASED MUSIC STRUCTURE ANALYSIS

Popular Song Summarization Using Chorus Section Detection from Audio Signal

Audio. Meinard Müller. Beethoven, Bach, and Billions of Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

The song remains the same: identifying versions of the same piece using tonal descriptors

New Developments in Music Information Retrieval

Tempo and Beat Analysis

Chord Classification of an Audio Signal using Artificial Neural Network

MODELS of music begin with a representation of the

Subjective Similarity of Music: Data Collection for Individuality Analysis

An Examination of Foote s Self-Similarity Method

Meinard Müller. Beethoven, Bach, und Billionen Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

MUSI-6201 Computational Music Analysis

/$ IEEE

Music Processing Audio Retrieval Meinard Müller

Computational Modelling of Harmony

FREISCHÜTZ DIGITAL: A CASE STUDY FOR REFERENCE-BASED AUDIO SEGMENTATION OF OPERAS

A Study of Synchronization of Audio Data with Symbolic Data. Music254 Project Report Spring 2007 SongHui Chon

Music Information Retrieval (MIR)

ANALYZING MEASURE ANNOTATIONS FOR WESTERN CLASSICAL MUSIC RECORDINGS

The Effect of DJs Social Network on Music Popularity

IMPROVING MARKOV MODEL-BASED MUSIC PIECE STRUCTURE LABELLING WITH ACOUSTIC INFORMATION

GRADIENT-BASED MUSICAL FEATURE EXTRACTION BASED ON SCALE-INVARIANT FEATURE TRANSFORM

TOWARDS EVALUATING MULTIPLE PREDOMINANT MELODY ANNOTATIONS IN JAZZ RECORDINGS

Music Representations

Book: Fundamentals of Music Processing. Audio Features. Book: Fundamentals of Music Processing. Book: Fundamentals of Music Processing

Effects of acoustic degradations on cover song recognition

Music Information Retrieval

Music Information Retrieval (MIR)

Content-based music retrieval

Towards Supervised Music Structure Annotation: A Case-based Fusion Approach.

Polyphonic Audio Matching for Score Following and Intelligent Audio Editors

STRUCTURAL CHANGE ON MULTIPLE TIME SCALES AS A CORRELATE OF MUSICAL COMPLEXITY

AUTOMASHUPPER: AN AUTOMATIC MULTI-SONG MASHUP SYSTEM

Robert Alexandru Dobre, Cristian Negrescu

Beethoven, Bach und Billionen Bytes

A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES

DESIGN AND CREATION OF A LARGE-SCALE DATABASE OF STRUCTURAL ANNOTATIONS

Musical Examination to Bridge Audio Data and Sheet Music

A FORMALIZATION OF RELATIVE LOCAL TEMPO VARIATIONS IN COLLECTIONS OF PERFORMANCES

Research Article Multiple Scale Music Segmentation Using Rhythm, Timbre, and Harmony

Citation for published version (APA): Jensen, K. K. (2005). A Causal Rhythm Grouping. Lecture Notes in Computer Science, 3310,

Statistical Modeling and Retrieval of Polyphonic Music

Content-based Music Structure Analysis with Applications to Music Semantics Understanding

SHEET MUSIC-AUDIO IDENTIFICATION

A repetition-based framework for lyric alignment in popular songs

Toward Automatic Music Audio Summary Generation from Signal Analysis

Audio Feature Extraction for Corpus Analysis

A MID-LEVEL REPRESENTATION FOR CAPTURING DOMINANT TEMPO AND PULSE INFORMATION IN MUSIC RECORDINGS

DOWNBEAT TRACKING WITH MULTIPLE FEATURES AND DEEP NEURAL NETWORKS

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

Music Genre Classification and Variance Comparison on Number of Genres

SINGING EXPRESSION TRANSFER FROM ONE VOICE TO ANOTHER FOR A GIVEN SONG. Sangeon Yong, Juhan Nam

Beethoven, Bach, and Billions of Bytes

Lecture 9 Source Separation

Predicting Time-Varying Musical Emotion Distributions from Multi-Track Audio

A CHROMA-BASED SALIENCE FUNCTION FOR MELODY AND BASS LINE ESTIMATION FROM MUSIC AUDIO SIGNALS

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

Paulo V. K. Borges. Flat 1, 50A, Cephas Av. London, UK, E1 4AR (+44) PRESENTATION

Appendix A Types of Recorded Chords

Aspects of Music. Chord Recognition. Musical Chords. Harmony: The Basis of Music. Musical Chords. Musical Chords. Piece of music. Rhythm.

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

Repeating Pattern Discovery and Structure Analysis from Acoustic Music Data

Analysing Musical Pieces Using harmony-analyser.org Tools

AUTOMATIC MAPPING OF SCANNED SHEET MUSIC TO AUDIO RECORDINGS

Automatic Extraction of Popular Music Ringtones Based on Music Structure Analysis

IMPROVING RHYTHMIC SIMILARITY COMPUTATION BY BEAT HISTOGRAM TRANSFORMATIONS

Rhythm related MIR tasks

Retrieval of textual song lyrics from sung inputs

Automatic Rhythmic Notation from Single Voice Audio Sources

SIMULTANEOUS SEPARATION AND SEGMENTATION IN LAYERED MUSIC

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

Music Segmentation Using Markov Chain Methods

RETRIEVING AUDIO RECORDINGS USING MUSICAL THEMES

LyricAlly: Automatic Synchronization of Acoustic Musical Signals and Textual Lyrics

Music structure information is

Is Music Structure Annotation Multi-Dimensional? A Proposal for Robust Local Music Annotation.

TOWARDS AUTOMATED EXTRACTION OF TEMPO PARAMETERS FROM EXPRESSIVE MUSIC RECORDINGS

Music Information Retrieval

Towards Automated Processing of Folk Song Recordings

mir_eval: A TRANSPARENT IMPLEMENTATION OF COMMON MIR METRICS

USING MUSICAL STRUCTURE TO ENHANCE AUTOMATIC CHORD TRANSCRIPTION

Music Information Retrieval. Juan Pablo Bello MPATE-GE 2623 Music Information Retrieval New York University

Transcription:

Tutorial Automatisierte Methoden der Musikverarbeitung 47. Jahrestagung der Gesellschaft für Informatik Music Structure Analysis Meinard Müller, Christof Weiss, Stefan Balke International Audio Laboratories Erlangen {meinard.mueller, christof.weiss, stefan.balke}@audiolabs-erlangen.de

Book: Fundamentals of Music Processing Meinard Müller Fundamentals of Music Processing Audio, Analysis, Algorithms, Applications 483 p., 249 illus., hardcover ISBN: 978-3-319-21944-8 Springer, 2015 Accompanying website: www.music-processing.de 2

Book: Fundamentals of Music Processing Meinard Müller Fundamentals of Music Processing Audio, Analysis, Algorithms, Applications 483 p., 249 illus., hardcover ISBN: 978-3-319-21944-8 Springer, 2015 Accompanying website: www.music-processing.de 3

Motivation T T 4

Music Structure Analysis General Goal: Divide an audio recording into temporal segments corresponding to musical parts and group these segments into musically meaningful categories. Examples: Stanzas of a folk song Intro, verse, chorus, bridge, outro sections of a pop song Exposition, development, recapitulation, coda of a sonata Musical form ABACADA of a rondo Solo parts in a jazz recording 5

Example: Folk Song Nederlandse Liederenbank Time (seconds) 6

Example: Opera Weber, Song (No. 4) from Der Freischütz Introduction Stanzas Dialogues....... Kleiber 0 50 100 150 200 Time (seconds) Ackermann 20 40 60 80 100 120 Time (seconds) 7

Example: Jazz Recording Clifford Brown - Jordu T T Repetitions Head-In Head-Out Instrument Comics by H. Grohganz: https://mir.sechsachtel.de/orchpics/ 8

Weimar Jazz Database (WJD) http://jazzomat.hfm-weimar.de 456 transcribed jazz solos of monophonic instruments. Transcription Beats E 7 A 7 D 7 G 7 Chords Transcriptions specify a musical pitch for physical time instances. 810 min. of audio recordings. Soon available: Track structure Thanks to the Jazzomat research team: M. Pfleiderer, K. Frieler, J. Abeßer, W.-G. Zaddach 9

Annotation Example from the WJD Clifford Brown - Jordu Time (s) Song-Centric Annotations: Chorus boundaries Solo choruses Theme repetitions Many Thanks to Moritz Berendes and Julian Reck! 10

Music Structure Analysis Main principles: Repetition-based Structure Analysis Homogeneity-based Structure Analysis Novelty-based Structure Analysis 11

Music Structure Analysis Image Analogy Novelty Homogeneity Repetition

Self-Similarity Matrix 1. Step: Extract Audio Features MFCC Time (s) Mel Frequency Cepstral Coefficients (MFCC) correlate to the timbre. 13

Self-Similarity Matrix 2. Step: Calculate Pairwise Similarity Time (s) Time (s) 14

Repetitions: Path-like structures T O Self-Similarity Matrix Analysis Homogeneity: Block-like structures T Novelty: Corners T T 15 O

Self-Similarity Matrix Novelty Detection T O Idea (Foote): Use checkerboard-like kernel function to detect corner points T on main diagonal of SSM. 16

Self-Similarity Matrix Chroma Features B Chroma F# C# C Time (s) Chroma Feature correlate to harmonic and melodic progressions. 17

Self-Similarity Matrix Chroma Features Time (s) Time (s) 18

Self-Similarity Matrix Chroma Features Chroma instead of MFCC Repetitions result in path-like structures Head-In and Head-Out T T O Drum Solo T T O 19

Self-Similarity Matrix Recap Repetitions: Path-like structures Homogeneity: Block-like structures Novelty: Corners Features are important! T T O 20

Audio Examples Clifford Brown Jordu Chroma T MFCC T O 21

Audio Examples John Coltrane Blue Trane Chroma T T 22 MFCC

Audio Examples Herbie Hancock Maiden Voyage Chroma I T T O 23 MFCC

Self-Similarity Matrix Example: Brahms Hungarian Dance No. 5 (Ormandy) A1 A2 B1 B2 C A3 B3 B4 Time (seconds) 24

Self-Similarity Matrix Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix Example: Brahms Hungarian Dance No. 5 (Ormandy) G major G major

Self-Similarity Matrix Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix Example: Brahms Hungarian Dance No. 5 (Ormandy)

Faster Self-Similarity Matrix Example: Brahms Hungarian Dance No. 5 (Ormandy) Slower

Faster Self-Similarity Matrix Example: Brahms Hungarian Dance No. 5 (Ormandy) Slower

Self-Similarity Matrix Example: Brahms Hungarian Dance No. 5 (Ormandy) Idealized SSM

Similarity Matrix Toolbox Meinard Müller, Nanzhu Jiang, Harald Grohganz SM Toolbox: MATLAB Implementations for Computing and Enhancing Similarity Matrices http://www.audiolabs-erlangen.de/resources/mir/smtoolbox/ 35

Demo Code: https://github.com/stefan-balke/mpa-exc

Conclusions Structure Analysis

Conclusions Score Audio MIDI Representations Structure Analysis

Conclusions Score Audio MIDI Representations Harmony Musical Aspects Timbre Structure Analysis Tempo

Conclusions Score Audio MIDI Representations Musical Aspects Structure Analysis Segmentation Principles Harmony Timbre Tempo Repetition Homogeneity Novelty

Conclusions Temporal and Hierarchical Context Score Audio MIDI Representations Musical Aspects Structure Analysis Segmentation Principles Harmony Timbre Tempo Repetition Homogeneity Novelty

References W. CHAI AND B. VERCOE, Music thumbnailing via structural analysis, in Proceedings of the ACM International Conference on Multimedia, Berkeley, CA, USA, 2003, pp. 223 226. M. COOPER AND J. FOOTE, Automatic music summarization via similarity analysis, in Proceedings of the International Conference on Music Information Retrieval (ISMIR), Paris, France, 2002, pp. 81 85. R. B. DANNENBERG AND M. GOTO, Music structure analysis from acoustic signals, in Handbook of Signal Processing in Acoustics, D. Havelock, S. J. FOOTE, Visualizing music and audio using self-similarity, in Proceedings of the ACM International Conference on Multimedia, Orlando, FL, USA, 1999, pp. 77 80. J. FOOTE, Automatic audio segmentation using a measure of audio novelty, in Proceedings of the IEEE International Conference on Multimedia and Expo (ICME), New York, NY, USA, 2000, pp. 452 455. M. GOTO, A chorus section detection method for musical audio signals and its application to a music listening station, IEEE Transactions on Audio, Speech and Language Processing, 14 (2006), pp. 1783 1794 H. GROHGANZ, M. CLAUSEN, N. JIANG, AND M. MÜLLER, Converting path structures into block structures using eigenvalue decompositions of self-similarity matrices, in Proceedings of the 14th International Conference on Music Information Retrieval (ISMIR), Curitiba, Brazil, 2013, pp. 209 214. K. JENSEN, Multiple scale music segmentation using rhythm, timbre, and harmony, EURASIP Journal on Advances in Signal Processing, 2007 (2007). F. KAISER AND T. SIKORA, Music structure discovery in popular music using non-negative matrix factorization, in Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Utrecht, The Netherlands, 2010, pp. 429 434.

References M. LEVY, M. SANDLER, AND M. A. CASEY, Extraction of high-level musical structure from audio data and its application to thumbnail generation, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Toulouse, France, 2006, pp. 13 16. H. LUKASHEVICH, Towards quantitative measures of evaluating song segmentation, in Proceedings of the International Conference on Music Information Retrieval (ISMIR), Philadelphia, USA, 2008, pp. 375 380. M. MÜLLER AND M. CLAUSEN, Transposition-invariant self-similarity matrices, in Proceedings of the 8th International Conference on Music Information Retrieval (ISMIR), Vienna, Austria, 2007, pp. 47 50. M. MÜLLER AND N. JIANG, A scape plot representation for visualizing repetitive structures of music recordings, in Proceedings of the 13th International Conference on Music Information Retrieval (ISMIR), Porto, Portugal, 2012, pp. 97 102. M. MÜLLER, N. JIANG, AND H. GROHGANZ, SM Toolbox: MATLAB implementations for computing and enhancing similiarty matrices, in Proceedings of the 53rd AES Conference on Semantic Audio, London, GB, 2014. M. MÜLLER, N. JIANG, AND P. GROSCHE, A robust fitness measure for capturing repetitions in music recordings with applications to audio thumbnailing, IEEE Transactions on Audio, Speech & Language Processing, 21 (2013), pp. 531 543. M. MÜLLER AND F. KURTH, Enhancing similarity matrices for music audio analysis, in Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toulouse, France, 2006, pp. 437 440. M. MÜLLER AND F. KURTH, Towards structural analysis of audio recordings in the presence of musical variations, EURASIP Journal on Advances in Signal Processing, 2007 (2007).

References J. PAULUS AND A. P. KLAPURI, Music structure analysis using a probabilistic fitness measure and a greedy search algorithm, IEEE Transactions on Audio, Speech, and Language Processing, 17 (2009), pp. 1159 1170. J. PAULUS, M. MÜLLER, AND A. P. KLAPURI, Audio-based music structure analysis, in Proceedings of the 11th International Conference on Music Information Retrieval (ISMIR), Utrecht, The Netherlands, 2010, pp. 625 636. G. PEETERS, Deriving musical structure from signal analysis for music audio summary generation: sequence and state approach, in Computer Music Modeling and Retrieval, vol. 2771 of Lecture Notes in Computer Science, Springer Berlin / Heidelberg, 2004, pp. 143 166. G. PEETERS, Sequence representation of music structure using higher-order similarity matrix and maximum-likelihood approach, in Proceedings of the International Conference on Music Information Retrieval (ISMIR), Vienna, Austria, 2007, pp. 35 40. C. RHODES AND M. A. CASEY, Algorithms for determining and labelling approximate hierarchical self-similarity, in Proceedings of the International Conference on Music Information Retrieval (ISMIR), Vienna, Austria, 2007, pp. 41 46. J. SERRÀ, M. MÜLLER, P. GROSCHE, AND J. L. ARCOS, Unsupervised detection of music boundaries by time series structure features, in Proceedings of the AAAI International Conference on Artificial Intelligence, Toronto, Ontario, Canada, 2012, pp. 1613 1619. J. B. L. SMITH, J. A. BURGOYNE, I. FUJINAGA, D. D. ROURE, AND J. S. DOWNIE, Design and creation of a large-scale database of structural annotations, in Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), Miami, FL, USA, 2011, pp. 555 560. J. B. L. SMITH AND E. CHEW, Using quadratic programming to estimate feature relevance in structural analyses of music, in Proceedings of the ACM International Conference on Multimedia, 2013, pp. 113 122.

References M. SUNKEL, S. JANSEN, M. WAND, E. EISEMANN, H.-P. SEIDEL, Learning Line Features in 3D Geometry, in Computer Graphics Forum (Proc. Eurographics), 2011. D. TURNBULL, G. LANCKRIET, E. PAMPALK, AND M. GOTO, A supervised approach for detecting boundaries in music using difference features and boosting, in Proceedings of the International Conference on Music Information Retrieval (ISMIR), Vienna, Austria, 2007, pp. 51 54. G. TZANETAKIS AND P. COOK, Multifeature audio segmentation for browsing and annotation, in Proceedings of the IEEEWorkshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Platz, NY, USA, 1999, pp. 103 106.