Music Synchronization. Music Synchronization. Music Data. Music Data. General Goals. Music Information Retrieval (MIR)

Similar documents
Music Information Retrieval (MIR)

Music Information Retrieval (MIR)

Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900)

Beethoven, Bach, and Billions of Bytes

Music Information Retrieval

Tempo and Beat Analysis

Audio Structure Analysis

Book: Fundamentals of Music Processing. Audio Features. Book: Fundamentals of Music Processing. Book: Fundamentals of Music Processing

Audio. Meinard Müller. Beethoven, Bach, and Billions of Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Tempo and Beat Tracking

Music Representations

Audio Structure Analysis

CS 591 S1 Computational Audio

Beethoven, Bach und Billionen Bytes

Meinard Müller. Beethoven, Bach, und Billionen Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Music Processing Audio Retrieval Meinard Müller

SHEET MUSIC-AUDIO IDENTIFICATION

Music Processing Introduction Meinard Müller

Topic 11. Score-Informed Source Separation. (chroma slides adapted from Meinard Mueller)

New Developments in Music Information Retrieval

Chord Recognition. Aspects of Music. Musical Chords. Harmony: The Basis of Music. Musical Chords. Musical Chords. Music Processing.

Informed Feature Representations for Music and Motion

Further Topics in MIR

Music Representations

Refinement Strategies for Music Synchronization

A Multimodal Way of Experiencing and Exploring Music

A MID-LEVEL REPRESENTATION FOR CAPTURING DOMINANT TEMPO AND PULSE INFORMATION IN MUSIC RECORDINGS

TOWARDS AUTOMATED EXTRACTION OF TEMPO PARAMETERS FROM EXPRESSIVE MUSIC RECORDINGS

AUTOMATIC MAPPING OF SCANNED SHEET MUSIC TO AUDIO RECORDINGS

Music Similarity and Cover Song Identification: The Case of Jazz

Robert Alexandru Dobre, Cristian Negrescu

MUSI-6201 Computational Music Analysis

A SCORE-INFORMED PIANO TUTORING SYSTEM WITH MISTAKE DETECTION AND SCORE SIMPLIFICATION

Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification

Lecture 9 Source Separation

MUSIC is a ubiquitous and vital part of the lives of billions

ONE main goal of content-based music analysis and retrieval

Score-Informed Source Separation for Musical Audio Recordings: An Overview

Automatic music transcription

MAKE YOUR OWN ACCOMPANIMENT: ADAPTING FULL-MIX RECORDINGS TO MATCH SOLO-ONLY USER RECORDINGS

LEARNING AUDIO SHEET MUSIC CORRESPONDENCES. Matthias Dorfer Department of Computational Perception

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

JOINT STRUCTURE ANALYSIS WITH APPLICATIONS TO MUSIC ANNOTATION AND SYNCHRONIZATION

Music Structure Analysis

Audio Content-Based Music Retrieval

Searching for Similar Phrases in Music Audio

gresearch Focus Cognitive Sciences

Towards Automated Processing of Folk Song Recordings

Introductions to Music Information Retrieval

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

MAKE YOUR OWN ACCOMPANIMENT: ADAPTING FULL-MIX RECORDINGS TO MATCH SOLO-ONLY USER RECORDINGS

AUDIO MATCHING VIA CHROMA-BASED STATISTICAL FEATURES

AUTOREGRESSIVE MFCC MODELS FOR GENRE CLASSIFICATION IMPROVED BY HARMONIC-PERCUSSION SEPARATION

Video-based Vibrato Detection and Analysis for Polyphonic String Music

FREISCHÜTZ DIGITAL: A CASE STUDY FOR REFERENCE-BASED AUDIO SEGMENTATION OF OPERAS

A Study of Synchronization of Audio Data with Symbolic Data. Music254 Project Report Spring 2007 SongHui Chon

NI GSM/EDGE Toolkit Specifications

Polyphonic Audio Matching for Score Following and Intelligent Audio Editors

Data Driven Music Understanding

Soundprism: An Online System for Score-Informed Source Separation of Music Audio Zhiyao Duan, Student Member, IEEE, and Bryan Pardo, Member, IEEE

AUTOMASHUPPER: AN AUTOMATIC MULTI-SONG MASHUP SYSTEM

Effects of acoustic degradations on cover song recognition

Methods for the automatic structural analysis of music. Jordan B. L. Smith CIRMMT Workshop on Structural Analysis of Music 26 March 2010

A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES

Automatic Extraction of Popular Music Ringtones Based on Music Structure Analysis

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST)

Query By Humming: Finding Songs in a Polyphonic Database

ANALYZING MEASURE ANNOTATIONS FOR WESTERN CLASSICAL MUSIC RECORDINGS

Case Study Beatles Songs What can be Learned from Unreliable Music Alignments?

MATCH: A MUSIC ALIGNMENT TOOL CHEST

SCORE-INFORMED VOICE SEPARATION FOR PIANO RECORDINGS

SINGING PITCH EXTRACTION BY VOICE VIBRATO/TREMOLO ESTIMATION AND INSTRUMENT PARTIAL DELETION

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Music Structure Analysis

MUSICAL INSTRUMENT IDENTIFICATION BASED ON HARMONIC TEMPORAL TIMBRE FEATURES

TOWARDS AN EFFICIENT ALGORITHM FOR AUTOMATIC SCORE-TO-AUDIO SYNCHRONIZATION

FULL-AUTOMATIC DJ MIXING SYSTEM WITH OPTIMAL TEMPO ADJUSTMENT BASED ON MEASUREMENT FUNCTION OF USER DISCOMFORT

Automatic Piano Music Transcription

OBSERVED DIFFERENCES IN RHYTHM BETWEEN PERFORMANCES OF CLASSICAL AND JAZZ VIOLIN STUDENTS

Music Radar: A Web-based Query by Humming System

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

Statistical Modeling and Retrieval of Polyphonic Music

Automatic Music Clustering using Audio Attributes

Refined Spectral Template Models for Score Following

Audio Structure Analysis

Music Segmentation Using Markov Chain Methods

SCORE-INFORMED IDENTIFICATION OF MISSING AND EXTRA NOTES IN PIANO RECORDINGS

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Music Alignment and Applications. Introduction

Improving Polyphonic and Poly-Instrumental Music to Score Alignment

SINGING EXPRESSION TRANSFER FROM ONE VOICE TO ANOTHER FOR A GIVEN SONG. Sangeon Yong, Juhan Nam

The Effect of DJs Social Network on Music Popularity

AN ADAPTIVE KARAOKE SYSTEM THAT PLAYS ACCOMPANIMENT PARTS OF MUSIC AUDIO SIGNALS SYNCHRONOUSLY WITH USERS SINGING VOICES

The H.26L Video Coding Project

HUMMING METHOD FOR CONTENT-BASED MUSIC INFORMATION RETRIEVAL

Supervised Musical Source Separation from Mono and Stereo Mixtures based on Sinusoidal Modeling

Lecture 12: Alignment and Matching

Topic 10. Multi-pitch Analysis

AUDIO-BASED MUSIC STRUCTURE ANALYSIS

Transcription:

Advanced Course Computer Science Music Processing Summer Term 2010 Music ata Meinard Müller Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Music Synchronization Music ata Various interpretations Beethoven s Fifth Bernstein Karajan Scherbakov (piano) (piano) General Goals Automated organization of complex and inhomogeneous music collections Generation of annotations and cross-links Tools and methods for multimodal search, navigation and interaction Music Information Retrieval (MIR) Music Synchronization Schematic view of various synchronization tasks Music Synchronization ( Alignment) Turetsky/Ellis (ISMIR 2003) Soulez/Rodet/Schwarz (ISMIR 2003) Arifi/Clausen/Kurth/Müller (ISMIR 2003) Hu/annenberg/Tzanetakis (WASPAA 2003) Müller/Kurth/Röder (ISMIR 2004) Raphael (ISMIR 2004) ixon/widmer (ISMIR 2005) Müller/Mattes/Kurth (ISMIR 2006) annenberg /Raphael (Special Issue ACM 2006) Kurth/Müller/Fremerey/Chang/Clausen (ISMIR 2007) Fujihara/Goto (ICASSP 2008) Wang/Iskandar/New/Shenoy (IEEE-TASLP 2008) Ewert/Müller/Grosche (ICASSP 2009)

Music Synchronization: - Music Synchronization: - Beethoven s Fifth Given: Two different audio recordings of the same underlying piece of music. Karajan Goal: Find for each position in one audio recording the musically corresponding position in the other audio recording. Scherbakov Music Synchronization: - Beethoven s Fifth Music Synchronization: - Bach Toccata Karajan Koopman Scherbakov Ruebsam Synchronization: Karajan Scherbakov Music Synchronization: - Bach Toccata Koopman Music Synchronization: - Transformation of audio recordings into sequences of feature vectors Ruebsam Fix cost measure Compute on the feature space cost matrix Synchronization: Koopman Ruebsam Compute cost-minimizing warping path from

Chroma Features Chroma Features Example: Bach Toccata Example: Bach Toccata Koopman Ruebsam Koopman Ruebsam Feature resolution: 10 Hz Feature resolution: 1 Hz Music Synchronization: - Music Synchronization: - Koopman Ruebsam = 12-dimensional normalized chroma vectors Local cost measure cost matrix Music Synchronization: - Cost-minimizing warping path Cost-Minimizing Warping Path Computation via dynamic programming ynamic Time Warping (TW) Memory requirements and running time: O(NM) Problem: Infeasible for large N and M Example: Feature resolution 10 Hz, pieces 15 min N, M ~ 10,000 N M ~ 100,000,000

Strategy: Global Constraints Strategy: Global Constraints Sakoe-Chiba band Itakura parallelogram Sakoe-Chiba band Itakura parallelogram Problem: Optimal warping path not in constraint region Compute optimal warping path on coarse level Project on fine level Specify constraint region Compute constrained optimal warping path

Suitable features? Resolution 4 Hz Resolution 2 Hz Resolution 1 Hz Suitable resolution levels? Size of constraint regions? Good trade-off between efficiency and robustness? Improve robustness by enhancing cost matrix Resolution 4 Hz Resolution 2 Hz Resolution 1 Hz Resolution 4 Hz Resolution 2 Hz Resolution 1 Hz Original Problem: Cost matrix may degenerate useless warping path Enhanced Improve robustness by enhancing cost matrix Resolution 4 Hz Resolution 2 Hz Resolution 1 Hz Chroma features at three levels: 0.33 Hz / 1 Hz / 10 Hz Enhanced Original

Chroma features at three levels: 0.33 Hz / 1 Hz / 10 Hz Music Synchronization: - Conclusions Chroma features Number of matrix entries needed for TW and MsTW: suited for harmony-based music Relatively coarse but good global alignments Multiscale approach: simple, robust, fast Music Synchronization: - System: Match (ixon) Applications Efficient music browsing Blending from one interpretation to another one Mixing and morphing different interpretations Tempo studies System: SyncPlayer/Switcher Music Synchronization: - Time

Music Synchronization: - Music Synchronization: - = meta data Automated annotation recording = reference (score) Tempo information recording Sonification of annotations Performance Analysis: Tempo Curves Performance Analysis: Tempo Curves Schumann: Träumerei What can be done if no reference is available? Musical tempo (B BPM) BPM) Musical tempo (B Musical time (measures) Musical time (measures) Music Synchronization: - Music Synchronization: Scan- Applications Automated audio annotation Accurate audio access after -based retrieval Automated tracking of note parameters during audio playback Performance Analysis

Music Synchronization: Scan- Music Synchronization: Scan- Scanned Sheet Music Scanned Sheet Music Symbolic Note Events OMR Correspondence Correspondence Recording Recording Music Synchronization: Scan- Music Synchronization: Scan- Scanned Sheet Music Symbolic Note Events Scanned Sheet Music High Qualtity OMR Correspondence Correspondence Recording OMR Symbolic Note Events irty but hidden High Qualtity Recording Application: Score Viewer Music Synchronization: Lyrics- ifficult task! [ECL 08, ICMI 08]

Music Synchronization: Lyrics- System: SyncPlayer/LyricsSeeker Lyrics- Lyrics- + - Normalized chroma features robust to changes in instrumentation and dynamics robust synchronization of reasonable overall quality rawback: low temporal alignment accuracy Idea: Integration of note onset information Normalized chroma features robust to changes in instrumentation and dynamics robust synchronization of reasonable overall quality rawback: low temporal alignment accuracy Idea: Integration of note onset information Example: - synchronization Chroma-Chroma: Chroma-Chroma + onset information: Example: C C C C Example: C C C C C C C C Cost-minimizing warping path

Example: C C C C Musically correct warping path Example: C C C C Musically correct warping path C C Cost-minimizing warping path C C Cost-minimizing warping path Problem: note onsets are not captured in feature representation Example: Beethoven s Fifth Example: Beethoven s Fifth Chroma representations Problem: note onsets are not captured in feature representation Example: Beethoven s Fifth Example: Beethoven s Fifth Cost matrix Cost matrix Warping path of poor local quality

General goal: etection of onsets of musical notes Magnitude spectrogram X Typical signal properties at note onset positions: increase in energy change of pitch change of spectral content high frequency content Frequency Idea: locate note onset candidates by measuring changes in spectral content Time Compressed spectrogram Y Spectral difference 3. ifferentiation Frequency Y = log( 1+ C X ) Frequency human sensation energy increase to be captured Time enhances low intensity values high frequency content reduces influence of amplitude modulation Time only positive values considered Spectral difference Frequency 3. ifferentiation 4. Accumulation Novelty Curve 3. ifferentiation 4. Accumulation Novelty Curve t

3. ifferentiation 4. Accumulation 5. Normalization 3. ifferentiation 4. Accumulation 5. Normalization Novelty Curve Substraction of local average Normalized novelty curve Normalized novelty curve 3. ifferentiation 4. Accumulation 5. Normalization 6. Peak picking Impulses 3. ifferentiation 4. Accumulation 5. Normalization 6. Peak picking ecaying impulses 3. ifferentiation 4. Accumulation 5. Normalization 6. Peak picking 7. ecay Filter Cost matrix based on impulses Cost matrix

Cost matrix based on decaying impulses Cost matrix based on decaying impulses Cost matrix Cost matrix Warping path based on onset information Ideas: Build up cost matrix with corridors of low cost ecaying strategy enforce corridor structure Each corridor corresponds to -audio pair of note onset candidates Warping path tends to run through corridors of low cost Impulses zoom ecaying impulses zoom note onset positions are likely to be aligned Cost matrix for decaying impulses Cost matrix for decaying impulses Corridor of low cost

Combination of two different types of cost matrices: Chroma cost matrix Onset cost matrix Cost matrix obtained from chroma features controls the global course of warping path robust synchronization Addition Cost matrix obtained from onset information controls the local course of warping path accurate alignment Conclusions: Music Synchronization Chroma cost matrix Onset cost matrix Various requirements Efficiency Addition Robustness Accuracy Variablity of music Conclusions: Music Synchronization Conclusions: Music Synchronization Combination of various strategies Feature level Local cost measure level Global alignment level Evidence pooling using competing strategies Offline vs. Online Online version: ixon/widmer (ISMIR 2005) Hidden Markov Models: Raphael (ISMIR 2004) Score-following Automatic accompaniment

Conclusions: Music Synchronization Presence of variations Instrumentation Musical structure Polyphony Musical key