Audio Structure Analysis

Similar documents
Music Structure Analysis

Music Structure Analysis

Music Structure Analysis

Audio Structure Analysis

Audio Structure Analysis

AUTOMATED METHODS FOR ANALYZING MUSIC RECORDINGS IN SONATA FORM

CS 591 S1 Computational Audio

Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900)

Music Processing Introduction Meinard Müller

Tempo and Beat Tracking

Further Topics in MIR

Book: Fundamentals of Music Processing. Audio Features. Book: Fundamentals of Music Processing. Book: Fundamentals of Music Processing

Music Information Retrieval

Audio. Meinard Müller. Beethoven, Bach, and Billions of Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Methods for the automatic structural analysis of music. Jordan B. L. Smith CIRMMT Workshop on Structural Analysis of Music 26 March 2010

Informed Feature Representations for Music and Motion

AUDIO-BASED MUSIC STRUCTURE ANALYSIS

AUDIO-BASED MUSIC STRUCTURE ANALYSIS

Grouping Recorded Music by Structural Similarity Juan Pablo Bello New York University ISMIR 09, Kobe October 2009 marl music and audio research lab

Music Processing Audio Retrieval Meinard Müller

Tempo and Beat Analysis

Music Information Retrieval (MIR)

Meinard Müller. Beethoven, Bach, und Billionen Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

A repetition-based framework for lyric alignment in popular songs

Music Information Retrieval (MIR)

Music Representations

FREISCHÜTZ DIGITAL: A CASE STUDY FOR REFERENCE-BASED AUDIO SEGMENTATION OF OPERAS

New Developments in Music Information Retrieval

The song remains the same: identifying versions of the same piece using tonal descriptors

Towards Supervised Music Structure Annotation: A Case-based Fusion Approach.

A FORMALIZATION OF RELATIVE LOCAL TEMPO VARIATIONS IN COLLECTIONS OF PERFORMANCES

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Music Radar: A Web-based Query by Humming System

AUTOMATIC MAPPING OF SCANNED SHEET MUSIC TO AUDIO RECORDINGS

MATCHING MUSICAL THEMES BASED ON NOISY OCR AND OMR INPUT. Stefan Balke, Sanu Pulimootil Achankunju, Meinard Müller

Music Alignment and Applications. Introduction

AUTOMATIC ACCOMPANIMENT OF VOCAL MELODIES IN THE CONTEXT OF POPULAR MUSIC

MUSIC SHAPELETS FOR FAST COVER SONG RECOGNITION

Beethoven, Bach und Billionen Bytes

Popular Song Summarization Using Chorus Section Detection from Audio Signal

Music Similarity and Cover Song Identification: The Case of Jazz

Beethoven, Bach, and Billions of Bytes

Rhythm related MIR tasks

Aspects of Music. Chord Recognition. Musical Chords. Harmony: The Basis of Music. Musical Chords. Musical Chords. Piece of music. Rhythm.

A MID-LEVEL REPRESENTATION FOR CAPTURING DOMINANT TEMPO AND PULSE INFORMATION IN MUSIC RECORDINGS

SHEET MUSIC-AUDIO IDENTIFICATION

Shades of Music. Projektarbeit

Lecture 12: Alignment and Matching

Towards Automated Processing of Folk Song Recordings

Audio Feature Extraction for Corpus Analysis

MUSI-6201 Computational Music Analysis

Pattern Based Melody Matching Approach to Music Information Retrieval

Music Segmentation Using Markov Chain Methods

A comparison and evaluation of approaches to the automatic formal analysis of musical audio

Analysing Musical Pieces Using harmony-analyser.org Tools

ANALYZING MEASURE ANNOTATIONS FOR WESTERN CLASSICAL MUSIC RECORDINGS

AUTOMASHUPPER: AN AUTOMATIC MULTI-SONG MASHUP SYSTEM

Effects of acoustic degradations on cover song recognition

RETRIEVING AUDIO RECORDINGS USING MUSICAL THEMES

DESIGN AND CREATION OF A LARGE-SCALE DATABASE OF STRUCTURAL ANNOTATIONS

TOWARDS AUTOMATED EXTRACTION OF TEMPO PARAMETERS FROM EXPRESSIVE MUSIC RECORDINGS

Audio Cover Song Identification using Convolutional Neural Network

arxiv: v1 [cs.ir] 2 Aug 2017

/$ IEEE

MODELS of music begin with a representation of the

STRUCTURAL ANALYSIS AND SEGMENTATION OF MUSIC SIGNALS

Citation for published version (APA): Jensen, K. K. (2005). A Causal Rhythm Grouping. Lecture Notes in Computer Science, 3310,

Audio-based Music Segmentation Using Multiple Features

SINGING EXPRESSION TRANSFER FROM ONE VOICE TO ANOTHER FOR A GIVEN SONG. Sangeon Yong, Juhan Nam

The Effect of DJs Social Network on Music Popularity

USING MUSICAL STRUCTURE TO ENHANCE AUTOMATIC CHORD TRANSCRIPTION

Chord Recognition. Aspects of Music. Musical Chords. Harmony: The Basis of Music. Musical Chords. Musical Chords. Music Processing.

DISCOVERY OF REPEATED VOCAL PATTERNS IN POLYPHONIC AUDIO: A CASE STUDY ON FLAMENCO MUSIC. Univ. of Piraeus, Greece

MAKE YOUR OWN ACCOMPANIMENT: ADAPTING FULL-MIX RECORDINGS TO MATCH SOLO-ONLY USER RECORDINGS

Automatic Identification of Samples in Hip Hop Music

ARECENT emerging area of activity within the music information

MAKE YOUR OWN ACCOMPANIMENT: ADAPTING FULL-MIX RECORDINGS TO MATCH SOLO-ONLY USER RECORDINGS

Repeating Pattern Discovery and Structure Analysis from Acoustic Music Data

Music Information Retrieval. Juan Pablo Bello MPATE-GE 2623 Music Information Retrieval New York University

Computational Modelling of Harmony

Data-Driven Solo Voice Enhancement for Jazz Music Retrieval

SIMULTANEOUS SEPARATION AND SEGMENTATION IN LAYERED MUSIC

An Examination of Foote s Self-Similarity Method

Homework 2 Key-finding algorithm

Music Synchronization. Music Synchronization. Music Data. Music Data. General Goals. Music Information Retrieval (MIR)

Research Article Multiple Scale Music Segmentation Using Rhythm, Timbre, and Harmony

Automatic characterization of ornamentation from bassoon recordings for expressive synthesis

Audio alignment for improved melody transcription of Irish traditional music

Automatic Extraction of Popular Music Ringtones Based on Music Structure Analysis

10 Visualization of Tonal Content in the Symbolic and Audio Domains

Musical Examination to Bridge Audio Data and Sheet Music

Transcription of the Singing Melody in Polyphonic Music

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

Topic 11. Score-Informed Source Separation. (chroma slides adapted from Meinard Mueller)

SALAMI: Structural Analysis of Large Amounts of Music Information. Annotator s Guide

A SIMPLE-CYCLES WEIGHTED KERNEL BASED ON HARMONY STRUCTURE FOR SIMILARITY RETRIEVAL

Aspects of Music Information Retrieval. Will Meurer. School of Information at. The University of Texas at Austin

GROUPING RECORDED MUSIC BY STRUCTURAL SIMILARITY

Interacting with a Virtual Conductor

Theory D-examination 1

Transcription:

Tutorial T3 A Basic Introduction to Audio-Related Music Information Retrieval Audio Structure Analysis Meinard Müller, Christof Weiß International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de, christof.weiss@audiolabs-erlangen.de

Book: Fundamentals of Music Processing Meinard Müller Fundamentals of Music Processing Audio, Analysis, Algorithms, Applications 483 p., 249 illus., hardcover ISBN: 978-3-319-21944-8 Springer, 2015 Accompanying website: www.music-processing.de

Book: Fundamentals of Music Processing Meinard Müller Fundamentals of Music Processing Audio, Analysis, Algorithms, Applications 483 p., 249 illus., hardcover ISBN: 978-3-319-21944-8 Springer, 2015 Accompanying website: www.music-processing.de

Book: Fundamentals of Music Processing Meinard Müller Fundamentals of Music Processing Audio, Analysis, Algorithms, Applications 483 p., 249 illus., hardcover ISBN: 978-3-319-21944-8 Springer, 2015 Accompanying website: www.music-processing.de

Chapter 4: Music Structure Analysis 4.1 General Principles 4.2 Self-Similarity Matrices 4.3 Audio Thumbnailing 4.4 Novelty-Based Segmentation 4.5 Evaluation 4.6 Further Notes In Chapter 4, we address a central and well-researched area within MIR known as music structure analysis. Given a music recording, the objective is to identify important structural elements and to temporally segment the recording according to these elements. Within this scenario, we discuss fundamental segmentation principles based on repetitions, homogeneity, and novelty principles that also apply to other types of multimedia beyond music. As an important technical tool, we study in detail the concept of self-similarity matrices and discuss their structural properties. Finally, we briefly touch the topic of evaluation, introducing the notions of precision, recall, and F-measure.

Music Structure Analysis Example: Zager & Evans In The Year 2525 Time (seconds)

Music Structure Analysis Example: Zager & Evans In The Year 2525 Time (seconds)

Music Structure Analysis Example: Zager & Evans In The Year 2525 I V1 V2 V3 V4 V5 V6 V7 B V8 O

Music Structure Analysis Example: Brahms Hungarian Dance No. 5 (Ormandy) A1 A2 B1 B2 C A3 B3 B4 Time (seconds)

Music Structure Analysis Example: Folk Song Field Recording (Nederlandse Liederenbank) Time (seconds)

Music Structure Analysis Example: Weber, Song (No. 4) from Der Freischütz Introduction Stanzas Dialogues....... Kleiber 0 50 100 150 200 Time (seconds) Ackermann 20 40 60 80 100 120 Time (seconds)

Music Structure Analysis General goal: Divide an audio recording into temporal segments corresponding to musical parts and group these segments into musically meaningful categories. Examples: Stanzas of a folk song Intro, verse, chorus, bridge, outro sections of a pop song Exposition, development, recapitulation, coda of a sonata Musical form ABACADA of a rondo

Music Structure Analysis General goal: Divide an audio recording into temporal segments corresponding to musical parts and group these segments into musically meaningful categories. Challenge: There are many different principles for creating relationships that form the basis for the musical structure. Homogeneity: Novelty: Repetition: Consistency in tempo, instrumentation, key, Sudden changes, surprising elements Repeating themes, motives, rhythmic patterns,

Music Structure Analysis Novelty Homogeneity Repetition

Overview Introduction Feature Representations Self-Similarity Matrices Novelty-Based Segmentation Thanks: Clausen, Ewert, Kurth, Grohganz, Dannenberg, Goto Grosche, Jiang Paulus, Klapuri Peeters, Kaiser, Serra, Gómez, Smith, Fujinaga, Wiering, Wand, Sunkel, Jansen

Overview Introduction Feature Representations Self-Similarity Matrices Novelty-Based Segmentation Thanks: Clausen, Ewert, Kurth, Grohganz, Dannenberg, Goto Grosche, Jiang Paulus, Klapuri Peeters, Kaiser, Serra, Gómez, Smith, Fujinaga, Wiering, Wand, Sunkel, Jansen

Feature Representation General goal: Convert an audio recording into a mid-level representation that captures certain musical properties while supressing other properties. Timbre / Instrumentation Tempo / Rhythm Pitch / Harmony

Feature Representation General goal: Convert an audio recording into a mid-level representation that captures certain musical properties while supressing other properties. Timbre / Instrumentation Tempo / Rhythm Pitch / Harmony

Feature Representation Example: Brahms Hungarian Dance No. 5 (Ormandy) A1 A2 B1 B2 C A3 B3 B4 Time (seconds)

Feature Representation Example: Brahms Hungarian Dance No. 5 (Ormandy) Chroma (Harmony) Feature extraction A1 A2 B1 B2 C A3 B3 B4 Time (seconds)

Feature Representation Example: Brahms Hungarian Dance No. 5 (Ormandy) B b G D Chroma (Harmony) G minor Feature extraction G minor A1 A2 B1 B2 C A3 B3 B4 Time (seconds)

Feature Representation Example: Brahms Hungarian Dance No. 5 (Ormandy) B G B b G Chroma (Harmony) Feature extraction D D G minor G major G minor A1 A2 B1 B2 C A3 B3 B4 Time (seconds)

Overview Introduction Feature Representations Self-Similarity Matrices Novelty-Based Segmentation

Self-Similarity Matrix (SSM) General idea: Compare each element of the feature sequence with each other element of the feature sequence based on a suitable similarity measure. Quadratic self-similarity matrix

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy) G major G major

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy)

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy) Faster Slower

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy) Faster Slower

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy) Idealized SSM

Self-Similarity Matrix (SSM) Example: Brahms Hungarian Dance No. 5 (Ormandy) Idealized SSM Blocks: Homogeneity Paths: Repetition Corners: Novelty

SSM Enhancement Challenge: Presence of musical variations Fragmented paths and gaps Paths of poor quality Regions of constant (low) cost Curved paths Idea: Enhancement of path structure

SSM Enhancement Path Enhancement Time (samples) Time (samples)

SSM Enhancement Path Enhancement Diagonal smoothing Time (samples) Time (samples)

SSM Enhancement Path Enhancement Diagonal smoothing Multiple filtering Time (samples) Time (samples)

SSM Enhancement Path Enhancement Diagonal smoothing Multiple filtering Thresholding (relative) Scaling & penalty Time (samples) Time (samples)

SSM Enhancement Further Processing Path extraction Time (samples) Time (samples)

SSM Enhancement Further Processing Path extraction Pairwise relations Time (samples) 1 2 3 4 5 6 7 100 200 300 400 Time (samples) Time (samples)

SSM Enhancement Further Processing Path extraction Pairwise relations Grouping (transitivity) Time (samples) 1 2 3 4 5 6 7 100 200 300 400 Time (samples) Time (samples)

SSM Enhancement Further Processing Path extraction Pairwise relations Grouping (transitivity) Time (samples) 1 2 3 4 5 6 7 100 200 300 400 Time (samples) Time (samples) 100 200 300 400 Time (samples)

SSM Enhancement Example: Zager & Evans In The Year 2525 I V1 V2 V3 V4 V5 V6 V7 B V8 O

SSM Enhancement Example: Zager & Evans In The Year 2525 Missing relations because of transposed sections

SSM Enhancement Example: Zager & Evans In The Year 2525 Idea: Cyclic shift of one of the chroma sequences One semitone up

SSM Enhancement Example: Zager & Evans In The Year 2525 Idea: Cyclic shift of one of the chroma sequences Two semitones up

SSM Enhancement Example: Zager & Evans In The Year 2525 Idea: Overlay & Maximize Transposition-invariant SSM

Overview Introduction Feature Representations Self-Similarity Matrices Novelty-Based Segmentation

Novelty-Based Segmentation General goals: Find instances where musical changes occur. Find transition between subsequent musical parts. Idea (Foote): Use checkerboard-like kernel function to detect corner points on main diagonal of SSM.

Novelty-Based Segmentation Idea (Foote): Use checkerboard-like kernel function to detect corner points on main diagonal of SSM.

Novelty-Based Segmentation Idea (Foote): Use checkerboard-like kernel function to detect corner points on main diagonal of SSM.

Novelty-Based Segmentation Idea (Foote): Use checkerboard-like kernel function to detect corner points on main diagonal of SSM.

Novelty-Based Segmentation Idea (Foote): Use checkerboard-like kernel function to detect corner points on main diagonal of SSM.

Novelty-Based Segmentation Idea (Foote): Use checkerboard-like kernel function to detect corner points on main diagonal of SSM. Novelty function using

Novelty-Based Segmentation Idea (Foote): Use checkerboard-like kernel function to detect corner points on main diagonal of SSM. Novelty function using Novelty function using

Conclusions Structure Analysis

Conclusions Score Audio MIDI Representations Structure Analysis

Conclusions Score Audio MIDI Representations Harmony Musical Aspects Timbre Structure Analysis Tempo

Conclusions Score Audio MIDI Representations Musical Aspects Structure Analysis Segmentation Principles Harmony Timbre Tempo Repetition Homogeneity Novelty

Conclusions Temporal and Hierarchical Context Score Audio MIDI Representations Musical Aspects Structure Analysis Segmentation Principles Harmony Timbre Tempo Repetition Homogeneity Novelty

Links SM Toolbox (MATLAB) http://www.audiolabs-erlangen.de/resources/mir/smtoolbox/ MSAF: Music Structure Analysis Framework (Python) https://github.com/urinieto/msaf SALAMI Annotation Data http://ddmal.music.mcgill.ca/research/salami/annotations LibROSA (Python) https://librosa.github.io/librosa/ Evaluation: mir_eval (Python) https://craffel.github.io/mir_eval/ Deep Learning: Boundary Detection Jan Schlüter (PhD thesis)