Music Structure Analysis

Similar documents
Audio Structure Analysis

Music Structure Analysis

Audio Structure Analysis

Music Structure Analysis

Audio Structure Analysis

AUTOMATED METHODS FOR ANALYZING MUSIC RECORDINGS IN SONATA FORM

Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900)

Music Processing Introduction Meinard Müller

Tempo and Beat Tracking

CS 591 S1 Computational Audio

Book: Fundamentals of Music Processing. Audio Features. Book: Fundamentals of Music Processing. Book: Fundamentals of Music Processing

Tempo and Beat Analysis

Informed Feature Representations for Music and Motion

Music Processing Audio Retrieval Meinard Müller

Further Topics in MIR

Meinard Müller. Beethoven, Bach, und Billionen Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

AUDIO-BASED MUSIC STRUCTURE ANALYSIS

AUDIO-BASED MUSIC STRUCTURE ANALYSIS

Music Information Retrieval

Music Information Retrieval (MIR)

Audio. Meinard Müller. Beethoven, Bach, and Billions of Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Methods for the automatic structural analysis of music. Jordan B. L. Smith CIRMMT Workshop on Structural Analysis of Music 26 March 2010

Grouping Recorded Music by Structural Similarity Juan Pablo Bello New York University ISMIR 09, Kobe October 2009 marl music and audio research lab

Music Representations

A repetition-based framework for lyric alignment in popular songs

Music Radar: A Web-based Query by Humming System

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Music Segmentation Using Markov Chain Methods

Towards Supervised Music Structure Annotation: A Case-based Fusion Approach.

Music Information Retrieval (MIR)

MUSI-6201 Computational Music Analysis

Beethoven, Bach, and Billions of Bytes

Effects of acoustic degradations on cover song recognition

A FORMALIZATION OF RELATIVE LOCAL TEMPO VARIATIONS IN COLLECTIONS OF PERFORMANCES

A MID-LEVEL REPRESENTATION FOR CAPTURING DOMINANT TEMPO AND PULSE INFORMATION IN MUSIC RECORDINGS

FREISCHÜTZ DIGITAL: A CASE STUDY FOR REFERENCE-BASED AUDIO SEGMENTATION OF OPERAS

AUTOMATIC ACCOMPANIMENT OF VOCAL MELODIES IN THE CONTEXT OF POPULAR MUSIC

New Developments in Music Information Retrieval

Music Alignment and Applications. Introduction

Beethoven, Bach und Billionen Bytes

AUTOMATIC MAPPING OF SCANNED SHEET MUSIC TO AUDIO RECORDINGS

Popular Song Summarization Using Chorus Section Detection from Audio Signal

Audio Feature Extraction for Corpus Analysis

Topic 11. Score-Informed Source Separation. (chroma slides adapted from Meinard Mueller)

ARECENT emerging area of activity within the music information

AUTOMASHUPPER: AN AUTOMATIC MULTI-SONG MASHUP SYSTEM

A wavelet-based approach to the discovery of themes and sections in monophonic melodies Velarde, Gissel; Meredith, David

MUSIC is a ubiquitous and vital part of the lives of billions

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST)

SHEET MUSIC-AUDIO IDENTIFICATION

10 Visualization of Tonal Content in the Symbolic and Audio Domains

Aspects of Music. Chord Recognition. Musical Chords. Harmony: The Basis of Music. Musical Chords. Musical Chords. Piece of music. Rhythm.

The song remains the same: identifying versions of the same piece using tonal descriptors

The Effect of DJs Social Network on Music Popularity

Music Synchronization. Music Synchronization. Music Data. Music Data. General Goals. Music Information Retrieval (MIR)

Shades of Music. Projektarbeit

MATCHING MUSICAL THEMES BASED ON NOISY OCR AND OMR INPUT. Stefan Balke, Sanu Pulimootil Achankunju, Meinard Müller

TOWARDS AUTOMATED EXTRACTION OF TEMPO PARAMETERS FROM EXPRESSIVE MUSIC RECORDINGS

Detecting Musical Key with Supervised Learning

MODELS of music begin with a representation of the

MUSIC SHAPELETS FOR FAST COVER SONG RECOGNITION

USING MUSICAL STRUCTURE TO ENHANCE AUTOMATIC CHORD TRANSCRIPTION

Homework 2 Key-finding algorithm

arxiv: v1 [cs.ir] 2 Aug 2017

Citation for published version (APA): Jensen, K. K. (2005). A Causal Rhythm Grouping. Lecture Notes in Computer Science, 3310,

Lecture 9 Source Separation

CSC475 Music Information Retrieval

A CHROMA-BASED SALIENCE FUNCTION FOR MELODY AND BASS LINE ESTIMATION FROM MUSIC AUDIO SIGNALS

Robert Alexandru Dobre, Cristian Negrescu

THE importance of music content analysis for musical

Krzysztof Rychlicki-Kicior, Bartlomiej Stasiak and Mykhaylo Yatsymirskyy Lodz University of Technology

DISCOVERY OF REPEATED VOCAL PATTERNS IN POLYPHONIC AUDIO: A CASE STUDY ON FLAMENCO MUSIC. Univ. of Piraeus, Greece

A comparison and evaluation of approaches to the automatic formal analysis of musical audio

Rhythm related MIR tasks

Topic 10. Multi-pitch Analysis

Chord Recognition. Aspects of Music. Musical Chords. Harmony: The Basis of Music. Musical Chords. Musical Chords. Music Processing.

Analysing Musical Pieces Using harmony-analyser.org Tools

AUDIO MATCHING VIA CHROMA-BASED STATISTICAL FEATURES

hit), and assume that longer incidental sounds (forest noise, water, wind noise) resemble a Gaussian noise distribution.

SINGING EXPRESSION TRANSFER FROM ONE VOICE TO ANOTHER FOR A GIVEN SONG. Sangeon Yong, Juhan Nam

An Examination of Foote s Self-Similarity Method

Computational Modelling of Harmony

Music Similarity and Cover Song Identification: The Case of Jazz

Music Database Retrieval Based on Spectral Similarity

REpeating Pattern Extraction Technique (REPET): A Simple Method for Music/Voice Separation

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

CSC475 Music Information Retrieval

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

Research Article Multiple Scale Music Segmentation Using Rhythm, Timbre, and Harmony

Interacting with a Virtual Conductor

Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification

Repeating Pattern Discovery and Structure Analysis from Acoustic Music Data

/$ IEEE

Automatic Identification of Samples in Hip Hop Music

Subjective Similarity of Music: Data Collection for Individuality Analysis

Audio Cover Song Identification using Convolutional Neural Network

Machine Learning Term Project Write-up Creating Models of Performers of Chopin Mazurkas

STRUCTURAL CHANGE ON MULTIPLE TIME SCALES AS A CORRELATE OF MUSICAL COMPLEXITY

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

Transcription:

Lecture Music Processing Music Structure Analysis Meinard Müller International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de

Book: Fundamentals of Music Processing Meinard Müller Fundamentals of Music Processing Audio, Analysis, Algorithms, Applications 483 p., 249 illus., hardcover ISBN: 978-3-319-21944-8 Springer, 2015 Accompanying website: www.music-processing.de

Book: Fundamentals of Music Processing Meinard Müller Fundamentals of Music Processing Audio, Analysis, Algorithms, Applications 483 p., 249 illus., hardcover ISBN: 978-3-319-21944-8 Springer, 2015 Accompanying website: www.music-processing.de

Book: Fundamentals of Music Processing Meinard Müller Fundamentals of Music Processing Audio, Analysis, Algorithms, Applications 483 p., 249 illus., hardcover ISBN: 978-3-319-21944-8 Springer, 2015 Accompanying website: www.music-processing.de

Chapter 4: Music Structure Analysis 4.1 General Principles 4.2 Self-Similarity Matrices 4.3 Audio Thumbnailing 4.4 Novelty-Based Segmentation 4.5 Evaluation 4.6 Further Notes In Chapter 4, we address a central and well-researched area within MIR known as music structure analysis. Given a music recording, the objective is to identify important structural elements and to temporally segment the recording according to these elements. Within this scenario, we discuss fundamental segmentation principles based on repetitions, homogeneity, and novelty principles that also apply to other types of multimedia beyond music. As an important technical tool, we study in detail the concept of self-similarity matrices and discuss their structural properties. Finally, we briefly touch the topic of evaluation, introducing the notions of precision, recall, and F-measure.

Music Structure Analysis Example: Zager & Evans In The Year 2525 Time (seconds)

Music Structure Analysis Example: Zager & Evans In The Year 2525 Time (seconds)

Music Structure Analysis Example: Zager & Evans In The Year 2525 I V1 V2 V3 V4 V5 V6 V7 B V8 O

Music Structure Analysis Example: Brahms Hungarian Dance No. 5 (Ormandy) A1 A2 B1 B2 C A3 B3 B4 Time (seconds)

Music Structure Analysis Example: Folk Song Field Recording (Nederlandse Liederenbank) Time (seconds)

Music Structure Analysis Example: Weber, Song (No. 4) from Der Freischütz Introduction Stanzas Dialogues....... Kleiber 0 50 100 150 200 Time (seconds) Ackermann 20 40 60 80 100 120 Time (seconds)

Music Structure Analysis General goal: Divide an audio recording into temporal segments corresponding to musical parts and group these segments into musically meaningful categories. Examples: Stanzas of a folk song Intro, verse, chorus, bridge, outro sections of a pop song Exposition, development, recapitulation, coda of a sonata Musical form ABACADA of a rondo

Music Structure Analysis General goal: Divide an audio recording into temporal segments corresponding to musical parts and group these segments into musically meaningful categories. Challenge: There are many different principles for creating relationships that form the basis for the musical structure. Homogeneity: Novelty: Repetition: Consistency in tempo, instrumentation, key, Sudden changes, surprising elements Repeating themes, motives, rhythmic patterns,

Music Structure Analysis Novelty Homogeneity Repetition

Overview Introduction Feature Representations Self-Similarity Matrices Audio Thumbnailing Novelty-based Segmentation Thanks: Clausen, Ewert, Kurth, Grohganz, Dannenberg, Goto Grosche, Jiang Paulus, Klapuri Peeters, Kaiser, Serra, Gómez, Smith, Fujinaga, Wiering, Wand, Sunkel, Jansen

Overview Introduction Feature Representations Self-Similarity Matrices Audio Thumbnailing Novelty-based Segmentation Thanks: Clausen, Ewert, Kurth, Grohganz, Dannenberg, Goto Grosche, Jiang Paulus, Klapuri Peeters, Kaiser, Serra, Gómez, Smith, Fujinaga, Wiering, Wand, Sunkel, Jansen

Feature Representation General goal: Convert an audio recording into a mid-level representation that captures certain musical properties while supressing other properties. Timbre / Instrumentation Tempo / Rhythm Pitch / Harmony

Feature Representation General goal: Convert an audio recording into a mid-level representation that captures certain musical properties while supressing other properties. Timbre / Instrumentation Tempo / Rhythm Pitch / Harmony

Feature Representation Example: Chromatic scale C1 24 C2 36 C3 48 C4 60 C5 72 C6 84 C7 96 C8 108 Waveform Amplitude Time (seconds)

Feature Representation Frequency (Hz) Frequency (Hz) Intensity (db) Intensity (db) Example: Chromatic scale C1 24 Spectrogram C2 36 C3 48 C4 60 C5 72 Time (seconds) C6 84 C7 96 C8 108

Feature Representation Frequency (Hz) Frequency (Hz) Intensity (db) Intensity (db) Example: Chromatic scale C1 24 Spectrogram C2 36 C3 48 C4 60 C5 72 Time (seconds) C6 84 C7 96 C8 108

Feature Representation Example: Chromatic scale C1 24 Spectrogram C2 36 C3 48 C4 60 C5 72 C6 84 C7 96 C8 108 C8: 4186 Hz C7: 2093 Hz C6: 1046 Hz C5: 523 Hz C4: 261 Hz C3: 131 Hz Time (seconds) Intensity (db)

Feature Representation Example: Chromatic scale C1 24 Log-frequency spectrogram C2 36 C3 48 C4 60 C5 72 C6 84 C7 96 C8 108 C8: 4186 Hz C7: 2093 Hz C6: 1046 Hz C5: 523 Hz C4: 261 Hz C3: 131 Hz Intensity (db) Time (seconds)

Feature Representation Example: Chromatic scale C1 24 Log-frequency spectrogram C2 36 C3 48 C4 60 C5 72 C6 84 C7 96 C8 108 Pitch (MIDI note number) Intensity (db) Time (seconds)

Feature Representation Example: Chromatic scale C1 24 Log-frequency spectrogram C2 36 C3 48 C4 60 C5 72 C6 84 C7 96 C8 108 Pitch (MIDI note number) Intensity (db) Chroma C Time (seconds)

Feature Representation Example: Chromatic scale C1 24 Log-frequency spectrogram C2 36 C3 48 C4 60 C5 72 C6 84 C7 96 C8 108 Pitch (MIDI note number) Intensity (db) Chroma C # Time (seconds)

Feature Representation Example: Chromatic scale C1 24 C2 36 C3 48 C4 60 C5 72 C6 84 C7 96 C8 108 Chroma representation Chroma Intensity (db) Time (seconds)

Feature Representation Example: Brahms Hungarian Dance No. 5 (Ormandy) A1 A2 B1 B2 C A3 B3 B4 Time (seconds)

Feature Representation Example: Brahms Hungarian Dance No. 5 (Ormandy) Chroma (Harmony) Feature extraction A1 A2 B1 B2 C A3 B3 B4 Time (seconds)

Feature Representation Example: Brahms Hungarian Dance No. 5 (Ormandy) B b G D Chroma (Harmony) G minor Feature extraction G minor A1 A2 B1 B2 C A3 B3 B4 Time (seconds)

Feature Representation Example: Brahms Hungarian Dance No. 5 (Ormandy) B G B b G Chroma (Harmony) Feature extraction D D G minor G major G minor A1 A2 B1 B2 C A3 B3 B4 Time (seconds)

Overview Introduction Feature Representations Self-Similarity Matrices Audio Thumbnailing Novelty-based Segmentation

Self-Similarity Matrix (SSM) General idea: Compare each element of the feature sequence with each other element of the feature sequence based on a suitable similarity measure. Quadratic self-similarity matrix

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy) G major G major

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy) Faster Slower

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy) Faster Slower

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy) Idealized SSM

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy) Idealized SSM Blocks: Homogeneity Paths: Repetition Corners: Novelty

SSM Enhancement Block Enhancement Feature smoothing Coarsening Time (samples) Time (samples)

SSM Enhancement Block Enhancement Feature smoothing Coarsening Time (samples) Time (samples)

SSM Enhancement Block Enhancement Feature smoothing Coarsening Time (samples) Time (samples)

SSM Enhancement Challenge: Presence of musical variations Fragmented paths and gaps Paths of poor quality Regions of constant (high) similarity Curved paths Idea: Enhancement of path structure

SSM Enhancement Shostakovich Waltz 2, Jazz Suite No. 2 (Chailly) SSM

SSM Enhancement Shostakovich Waltz 2, Jazz Suite No. 2 (Chailly) SSM

SSM Enhancement Shostakovich Waltz 2, Jazz Suite No. 2 (Chailly) SSM

SSM Enhancement Shostakovich Waltz 2, Jazz Suite No. 2 (Chailly) Enhanced SSM Filtering along main diagonal

SSM Enhancement Idea: Usage of contextual information (Foote 1999) Comparison of entire sequences = length of sequences = enhanced SSM smoothing effect

SSM Enhancement SSM

SSM Enhancement Enhanced SSM with Filtering along main diagonal

SSM Enhancement Enhanced SSM with Filtering along 8 different directions and minimizing

SSM Enhancement Idea: Smoothing along various directions and minimizing over all directions Tempo changes of -50 to +50 percent

SSM Enhancement Path Enhancement Time (samples) Time (samples)

SSM Enhancement Path Enhancement Diagonal smoothing Time (samples) Time (samples)

SSM Enhancement Path Enhancement Diagonal smoothing Multiple filtering Time (samples) Time (samples)

SSM Enhancement Path Enhancement Diagonal smoothing Multiple filtering Thresholding (relative) Scaling & penalty Time (samples) Time (samples)

SSM Enhancement Further Processing Path extraction Time (samples) Time (samples)

SSM Enhancement Further Processing Path extraction Pairwise relations Time (samples) 1 2 3 4 5 6 7 100 200 300 400 Time (samples) Time (samples)

SSM Enhancement Further Processing Path extraction Pairwise relations Grouping (transitivity) Time (samples) 1 2 3 4 5 6 7 100 200 300 400 Time (samples) Time (samples)

SSM Enhancement Further Processing Path extraction Pairwise relations Grouping (transitivity) Time (samples) 1 2 3 4 5 6 7 100 200 300 400 Time (samples) Time (samples) 100 200 300 400 Time (samples)

SSM Enhancement Example: Zager & Evans In The Year 2525 I V1 V2 V3 V4 V5 V6 V7 B V8 O

SSM Enhancement Example: Zager & Evans In The Year 2525

SSM Enhancement Example: Zager & Evans In The Year 2525 Missing relations because of transposed sections

SSM Enhancement Example: Zager & Evans In The Year 2525 Idea: Cyclic shift of one of the chroma sequences One semitone up

SSM Enhancement Example: Zager & Evans In The Year 2525 Idea: Cyclic shift of one of the chroma sequences Two semitones up

SSM Enhancement Example: Zager & Evans In The Year 2525 Idea: Overlay & Maximize Transposition-invariant SSM

SSM Enhancement Example: Zager & Evans In The Year 2525 Note: Order of enhancement steps important! Maximization Smoothing & Maximization

Similarity Matrix Toolbox Meinard Müller, Nanzhu Jiang, Harald Grohganz SM Toolbox: MATLAB Implementations for Computing and Enhancing Similarity Matrices http://www.audiolabs-erlangen.de/resources/mir/smtoolbox/

Overview Introduction Feature Representations Self-Similarity Matrices Audio Thumbnailing Thanks: Jiang, Grosche Peeters Cooper, Foote Goto Levy, Sandler Mauch Sapp Novelty-based Segmentation

Audio Thumbnailing General goal: Determine the most representative section ( Thumbnail ) of a given music recording. Example: Zager & Evans In The Year 2525 I V1 V2 V3 V4 V5 V6 V7 B V8 O Example: Brahms Hungarian Dance No. 5 (Ormandy) A1 A2 B1 B2 C A3 B3 B4 Thumbnail is often assumed to be the most repetitive segment

Audio Thumbnailing Two steps 1. Path extraction Both steps are problematic! Paths of poor quality (fragmented, gaps) Block-like structures Curved paths 2. Grouping Noisy relations (missing, distorted, overlapping) Transitivity computation difficult Main idea: Do both, path extraction and grouping, jointly One optimization scheme for both steps Stabilizing effect Efficient

Audio Thumbnailing Main idea: Do both path extraction and grouping jointly For each audio segment we define a fitness value This fitness value expresses how well the segment explains the entire audio recording The segment with the highest fitness value is considered to be the thumbnail As main technical concept we introduce the notion of a path family

Fitness Measure 200 180 160 140 120 100 80 60 40 20 0 0 50 100 150 200 1 0.5 0 0.5 1 1.5 2 Enhanced SSM

Fitness Measure Path over segment Consider a fixed segment

Fitness Measure Path over segment Consider a fixed segment Path over segment Induced segment Score is high

Fitness Measure Path over segment Consider a fixed segment Path over segment Induced segment Score is high A second path over segment Induced segment Score is not so high

Fitness Measure Path over segment Consider a fixed segment Path over segment Induced segment Score is high A second path over segment Induced segment Score is not so high A third path over segment Induced segment Score is very low

Fitness Measure Path family Consider a fixed segment A path family over a segment is a family of paths such that the induced segments do not overlap.

Fitness Measure Path family Consider a fixed segment A path family over a segment is a family of paths such that the induced segments do not overlap. This is not a path family!

Fitness Measure Path family Consider a fixed segment A path family over a segment is a family of paths such that the induced segments do not overlap. This is a path family! (Even though not a good one)

Fitness Measure Optimal path family Consider a fixed segment

Fitness Measure Optimal path family Consider a fixed segment Consider over the segment the optimal path family, i.e., the path family having maximal overall score. Call this value: Score(segment) Note: This optimal path family can be computed using dynamic programming.

Fitness Measure Optimal path family Consider a fixed segment Consider over the segment the optimal path family, i.e., the path family having maximal overall score. Call this value: Score(segment) Furthermore consider the amount covered by the induced segments. Call this value: Coverage(segment)

Fitness Measure Fitness Consider a fixed segment P := R := Score(segment) Coverage(segment)

Fitness Measure Fitness Consider a fixed segment Self-explanation are trivial! P := R := Score(segment) Coverage(segment)

Fitness Measure Fitness Consider a fixed segment Self-explanation are trivial! Subtract length of segment P := R := Score(segment) Coverage(segment) - length(segment) - length(segment)

Fitness Measure Fitness Consider a fixed segment Self-explanation are trivial! Subtract length of segment Normalization P := Normalize( Score(segment) - length(segment) ) R := Normalize( Coverage(segment) - length(segment) ) [0,1] [0,1]

Fitness Measure Fitness Consider a fixed segment Fitness(segment) F := 2 P R / (P + R) P := Normalize( Score(segment) - length(segment) ) R := Normalize( Coverage(segment) - length(segment) ) [0,1] [0,1]

Thumbnail Fitness Scape Plot Fitness Segment length Segment length Segment center Segment center

Thumbnail Fitness Scape Plot Fitness Segment length Fitness(segment) Segment length Segment center Segment center

Thumbnail Fitness Scape Plot Fitness Segment length Segment center

Thumbnail Fitness Scape Plot Fitness Segment length Segment center Note: Self-explanations are ignored fitness is zero

Thumbnail Fitness Scape Plot Fitness Segment length Segment center Thumbnail := segment having the highest fitness

Thumbnail Fitness Scape Plot Fitness A1 A2 B1 B2 C A3 B3 B4 Example: Brahms Hungarian Dance No. 5 (Ormandy)

Thumbnail Fitness Scape Plot Fitness A1 A2 B1 B2 C A3 B3 B4 Example: Brahms Hungarian Dance No. 5 (Ormandy)

Thumbnail Fitness Scape Plot Fitness A1 A2 B1 B2 C A3 B3 B4 Example: Brahms Hungarian Dance No. 5 (Ormandy)

Thumbnail Fitness Scape Plot Fitness A1 A2 B1 B2 C A3 B3 B4 Example: Brahms Hungarian Dance No. 5 (Ormandy)

Scape Plot Example: Brahms Hungarian Dance No. 5 (Ormandy)

Scape Plot Coloring according to clustering result (grouping) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Scape Plot Coloring according to clustering result (grouping) A1 A2 B1 B2 C A3 B3 B4 Example: Brahms Hungarian Dance No. 5 (Ormandy)

Thumbnail Fitness Scape Plot Fitness I V1 V2 V3 V4 V5 V6 V7 B V8 O Example: Zager & Evans In The Year 2525

Thumbnail Fitness Scape Plot Fitness I V1 V2 V3 V4 V5 V6 V7 B V8 O Example: Zager & Evans In The Year 2525

Overview Introduction Feature Representations Self-Similarity Matrices Thanks: Foote Serra, Grosche, Arcos Goto Tzanetakis, Cook Audio Thumbnailing Novelty-based Segmentation

Novelty-based Segmentation General goals: Find instances where musical changes occur. Find transition between subsequent musical parts. Idea (Foote): Use checkerboard-like kernel function to detect corner points on main diagonal of SSM.

Novelty-based Segmentation Idea (Foote): Use checkerboard-like kernel function to detect corner points on main diagonal of SSM.

Novelty-based Segmentation Idea (Foote): Use checkerboard-like kernel function to detect corner points on main diagonal of SSM.

Novelty-based Segmentation Idea (Foote): Use checkerboard-like kernel function to detect corner points on main diagonal of SSM.

Novelty-based Segmentation Idea (Foote): Use checkerboard-like kernel function to detect corner points on main diagonal of SSM.

Novelty-based Segmentation Idea (Foote): Use checkerboard-like kernel function to detect corner points on main diagonal of SSM. Novelty function using

Novelty-based Segmentation Idea (Foote): Use checkerboard-like kernel function to detect corner points on main diagonal of SSM. Novelty function using Novelty function using

Novelty-based Segmentation Idea: Find instances where structural changes occur. Structure features Combine global and local aspects within a unifying framework

Novelty-based Segmentation Structure features Enhanced SSM

Novelty-based Segmentation Structure features Enhanced SSM Time-lag SSM

Novelty-based Segmentation Structure features Enhanced SSM Time-lag SSM Cyclic time-lag SSM

Novelty-based Segmentation Structure features Enhanced SSM Time-lag SSM Cyclic time-lag SSM Columns as features

Novelty-based Segmentation Example: Chopin Mazurka Op. 24, No. 1 SSM Time-lag SSM

Novelty-based Segmentation Example: Chopin Mazurka Op. 24, No. 1 SSM Time-lag SSM

Novelty-based Segmentation Example: Chopin Mazurka Op. 24, No. 1 SSM Time-lag SSM

Novelty-based Segmentation Example: Chopin Mazurka Op. 24, No. 1 SSM Time-lag SSM Structure-based novelty function

Conclusions Structure Analysis

Conclusions Score Audio MIDI Representations Structure Analysis

Conclusions Score Audio MIDI Representations Harmony Musical Aspects Timbre Structure Analysis Tempo

Conclusions Score Audio MIDI Representations Musical Aspects Structure Analysis Segmentation Principles Harmony Timbre Tempo Repetition Homogeneity Novelty

Conclusions Temporal and Hierarchical Context Score Audio MIDI Representations Musical Aspects Structure Analysis Segmentation Principles Harmony Timbre Tempo Repetition Homogeneity Novelty

Conclusions Combined Approaches Hierarchical Approaches Evaluation Explaining Structure MIREX SALAMI-Project Smith, Chew

Links SM Toolbox (MATLAB) http://www.audiolabs-erlangen.de/resources/mir/smtoolbox/ MSAF: Music Structure Analysis Framework (Python) https://github.com/urinieto/msaf SALAMI Annotation Data http://ddmal.music.mcgill.ca/research/salami/annotations LibROSA (Python) https://librosa.github.io/librosa/ Evaluation: mir_eval (Python) https://craffel.github.io/mir_eval/ Deep Learning: Boundary Detection Jan Schlüter (PhD thesis)