Audio Structure Analysis

Similar documents
Audio Structure Analysis

Music Structure Analysis

Audio Structure Analysis

CS 591 S1 Computational Audio

Music Structure Analysis

Music Structure Analysis

Music Synchronization. Music Synchronization. Music Data. Music Data. General Goals. Music Information Retrieval (MIR)

Music Information Retrieval (MIR)

Tempo and Beat Analysis

Music Information Retrieval (MIR)

Methods for the automatic structural analysis of music. Jordan B. L. Smith CIRMMT Workshop on Structural Analysis of Music 26 March 2010

Informed Feature Representations for Music and Motion

AUTOMATED METHODS FOR ANALYZING MUSIC RECORDINGS IN SONATA FORM

Chord Recognition. Aspects of Music. Musical Chords. Harmony: The Basis of Music. Musical Chords. Musical Chords. Music Processing.

Tempo and Beat Tracking

AUDIO-BASED MUSIC STRUCTURE ANALYSIS

Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900)

Music Processing Audio Retrieval Meinard Müller

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

AUDIO-BASED MUSIC STRUCTURE ANALYSIS

Grouping Recorded Music by Structural Similarity Juan Pablo Bello New York University ISMIR 09, Kobe October 2009 marl music and audio research lab

The song remains the same: identifying versions of the same piece using tonal descriptors

Music Representations

A MID-LEVEL REPRESENTATION FOR CAPTURING DOMINANT TEMPO AND PULSE INFORMATION IN MUSIC RECORDINGS

Book: Fundamentals of Music Processing. Audio Features. Book: Fundamentals of Music Processing. Book: Fundamentals of Music Processing

AUTOMATIC MAPPING OF SCANNED SHEET MUSIC TO AUDIO RECORDINGS

AUDIO MATCHING VIA CHROMA-BASED STATISTICAL FEATURES

Shades of Music. Projektarbeit

JOINT STRUCTURE ANALYSIS WITH APPLICATIONS TO MUSIC ANNOTATION AND SYNCHRONIZATION

SHEET MUSIC-AUDIO IDENTIFICATION

MUSI-6201 Computational Music Analysis

New Developments in Music Information Retrieval

Meinard Müller. Beethoven, Bach, und Billionen Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

TOWARDS AUTOMATED EXTRACTION OF TEMPO PARAMETERS FROM EXPRESSIVE MUSIC RECORDINGS

ONE main goal of content-based music analysis and retrieval

GROUPING RECORDED MUSIC BY STRUCTURAL SIMILARITY

Music Segmentation Using Markov Chain Methods

Automatic Extraction of Popular Music Ringtones Based on Music Structure Analysis

Music Similarity and Cover Song Identification: The Case of Jazz

Towards Automated Processing of Folk Song Recordings

Music Processing Introduction Meinard Müller

MODELS of music begin with a representation of the

Chord Classification of an Audio Signal using Artificial Neural Network

Music Information Retrieval

USING MUSICAL STRUCTURE TO ENHANCE AUTOMATIC CHORD TRANSCRIPTION

Automatic Piano Music Transcription

A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES

The Effect of DJs Social Network on Music Popularity

Audio. Meinard Müller. Beethoven, Bach, and Billions of Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Music Alignment and Applications. Introduction

MUSIC is a ubiquitous and vital part of the lives of billions

A repetition-based framework for lyric alignment in popular songs

Analysing Musical Pieces Using harmony-analyser.org Tools

Subjective Similarity of Music: Data Collection for Individuality Analysis

Citation for published version (APA): Jensen, K. K. (2005). A Causal Rhythm Grouping. Lecture Notes in Computer Science, 3310,

CHAPTER 6. Music Retrieval by Melody Style

CS229 Project Report Polyphonic Piano Transcription

LEVELS IN NATIONAL CURRICULUM MUSIC

LEVELS IN NATIONAL CURRICULUM MUSIC

Popular Song Summarization Using Chorus Section Detection from Audio Signal

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST)

Fisk Street Primary School Curriculum. The Arts. Music

Curriculum Standard One: The student will listen to and analyze music critically, using the vocabulary and language of music.

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

ILLINOIS LICENSURE TESTING SYSTEM

EDEXCEL GCSE REVISION PACK INTRODUCTION

Lecture 12: Alignment and Matching

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

Automatic Music Genre Classification

The Intervalgram: An Audio Feature for Large-scale Melody Recognition

AUTOMATIC ACCOMPANIMENT OF VOCAL MELODIES IN THE CONTEXT OF POPULAR MUSIC

Music theory revision (Grade 5 and 6). Topics: forms, melody, rhythm, metre. Glossary given to the students (All Units).

Deep learning for music data processing

A Study of Synchronization of Audio Data with Symbolic Data. Music254 Project Report Spring 2007 SongHui Chon

Aspects of Music. Chord Recognition. Musical Chords. Harmony: The Basis of Music. Musical Chords. Musical Chords. Piece of music. Rhythm.

La Salle University MUS 150 Art of Listening Final Exam Name

Further Topics in MIR

PIANO SAFARI FOR THE OLDER STUDENT REPERTOIRE & TECHNIQUE BOOK 1

7. Components to Establish Time

Discovering Musical Structure in Audio Recordings

Toward Automatic Music Audio Summary Generation from Signal Analysis

GRADIENT-BASED MUSICAL FEATURE EXTRACTION BASED ON SCALE-INVARIANT FEATURE TRANSFORM

Effects of acoustic degradations on cover song recognition

A SCORE-INFORMED PIANO TUTORING SYSTEM WITH MISTAKE DETECTION AND SCORE SIMPLIFICATION

Towards Supervised Music Structure Annotation: A Case-based Fusion Approach.

ILLINOIS LICENSURE TESTING SYSTEM

Data-Driven Solo Voice Enhancement for Jazz Music Retrieval

Audio Content-Based Music Retrieval

Music Database Retrieval Based on Spectral Similarity

SINGING EXPRESSION TRANSFER FROM ONE VOICE TO ANOTHER FOR A GIVEN SONG. Sangeon Yong, Juhan Nam

Pitch Perception. Roger Shepard

FINE ARTS Institutional (ILO), Program (PLO), and Course (SLO) Alignment

Wipe Scene Change Detection in Video Sequences

Curriculum Overview Music Year 9

Computational Modelling of Harmony

School of Professional Studies

Query By Humming: Finding Songs in a Polyphonic Database

MHSIB.5 Composing and arranging music within specified guidelines a. Creates music incorporating expressive elements.

EIGENVECTOR-BASED RELATIONAL MOTIF DISCOVERY

Semantic Segmentation and Summarization of Music

Transcription:

Advanced Course Computer Science Music Processing Summer Term 2009 Meinard Müller Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Music Structure Analysis Music segmentation pitch content (e.g., melody, harmony) music texture (e.g., timbre, instrumentation, sound) rhythm Detection of repeating sections, phrases, motives song structure (e.g., intro, versus, chorus) musical form (e.g., sonata, symphony, concerto) Detection of other hidden relationships 2 Given: CD recording Goal: Automatic extraction of the repetitive structure (or of the musical form) Example: Brahms Hungarian Dance No. 5 (Ormandy) Dannenberg/Hu (ISMIR 2002) Peeters/Burthe/Rodet (ISMIR 2002) Cooper/Foote (ISMIR 2002) Goto (ICASSP 2003) Chai/Vercoe (ACM Multimedia 2003) Lu/Wang/Zhang (ACM Multimedia 2004) Bartsch/Wakefield (IEEE Trans. Multimedia 2005) Goto (IEEE Trans. Audio 2006) Müller/Kurth (EURASIP 2007) Rhodes/Casey (ISMIR 2007) Peeters (ISMIR 2007) 3 4 Audio features Cost measure and cost matrix self-similarity matrix Path extraction (pairwise similarity of segments) Global structure (clustering, grouping) Audio = 12-dimensional normalized chroma vector Local cost measure cost matrix quadratic self-similarity matrix 5 6

7 8 9 10 11 12

13 14 Similarity cluster Challenge: Presence of musical variations Fragmented paths and gaps Paths of poor quality Regions of constant (low) cost Curved paths Idea: Enhancement of path structure 15 16 Shostakovich Waltz 2, Jazz Suite No. 2 (Chailly) Idea: Usage of contextual information (Foote 1999) Comparison of entire sequences length of sequences enhanced cost matrix smoothing effect 17 18

(Shostakovich) (Shostakovich) Cost matrix Enhanced cost matrix 19 20 (Brahms) (Brahms) Cost matrix Enhanced cost matrix Problem: Relative tempo differences are smoothed out 21 22 Idea: Smoothing along various directions and minimizing over all directions th direction of smoothing enhanced cost matrix w.r.t. Usage of eight slope values tempo changes of -30 to +40 percent 23 24

Cost matrix Cost matrix with Filtering along main diagonal 25 26 Cost matrix with Filtering along 8 different directions and minimizing Start with initial point Extend path in greedy fashion Remove path neighborhood 27 28 Cost matrix Enhanced cost matrix 29 30

Enhanced cost matrix Thresholded 31 32 Thresholded, upper left 33 Path removal 34 Path removal 35 Path removal 36

Extracted paths 37 Extracted paths after postprocessing 38 How can one derive the global structure from pairwise relations? 39 40 Taks: Computation of similarity clusters Path relations Problem: Missing and inconsistent path relations Strategy: Approximate transitive hull 41 42

Path relations Path relations 43 44 Path relations Path relations 45 46 Path relations Example: Zager & Evans In The Year 2525 Final result Ground truth 47 48

Goto (ICASSP 2003) Cyclically shift chroma vectors in one sequence Compare shifted sequence with original sequence Perform for each of the twelve shifts a separate structure analysis Combine the results Goto (ICASSP 2003) Cyclically shift chroma vectors in one sequence Compare shifted sequence with original sequence Perform for each of the twelve shifts a separate structure analysis Combine the results Müller/Clausen (ISMIR 2007) Integrate all cyclic information in one transposition-invariant self-similarity matrix Perform one joint structure analysis 49 50 Example: Zager & Evans In The Year 2525 Example: Zager & Evans In The Year 2525 Original: 51 Original: Shifted: 52 53 54

55 56 Minimize over all twelve matrices 57 Thresholded self-similarity matrix 58 Path extraction 59 Path extraction Computation of similarity clusters 60

Stabilizing effect Stabilizing effect (thresholded) 61 (thresholded) 62 Stabilizing effect Transposition-invariant matrix Minimizing shift index Transposition-invariant self-similarity matrix (thresholded) 63 64 Transposition-invariant matrix Minimizing shift index Transposition-invariant matrix Minimizing shift index = 0 65 66

Transposition-invariant matrix Minimizing shift index = 1 Transposition-invariant matrix Minimizing shift index = 2 67 68 Example: Beethoven Tempest Serra/Gomez (ICASSP 2008): Used for Cover Song ID Discrete structure suitable for indexing? 69 70 Example: Beethoven Tempest Conclusions: Challenge: Musical variations Timbre, dynamics, tempo Musical key cyclic chroma shifts Major/minor Differences at note level / improvisations Transposition-invariant self-similarity matrix 71 72

Conclusions: Strategy: Matrix enhancement Conclusions: Challenge: Hierarchical structure of music Filtering techniques / contextual information Cooper/Foote (ISMIR 2002) Müller/Kurth (ICASSP 2006) Transposition-invariant similarity matrices Goto (ICASSP 2003) Müller/Clausen (ISMIR 2007) Higher-order similarity matrices Peeters (ISMIR 2007) Rhodes/Casey (ISMIR 2007) 73 74 System: SmartMusicKiosk (Goto) System: SyncPlayer/AudioStructure 75 76