Music Information Retrieval (MIR)

Similar documents
Music Information Retrieval (MIR)

Beethoven, Bach, and Billions of Bytes

Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900)

Beethoven, Bach und Billionen Bytes

Music Information Retrieval

Meinard Müller. Beethoven, Bach, und Billionen Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Audio. Meinard Müller. Beethoven, Bach, and Billions of Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Music Processing Introduction Meinard Müller

Music Synchronization. Music Synchronization. Music Data. Music Data. General Goals. Music Information Retrieval (MIR)

Tempo and Beat Tracking

Tempo and Beat Analysis

Music Representations

Informed Feature Representations for Music and Motion

Further Topics in MIR

Audio Structure Analysis

Music Representations

Music Processing Audio Retrieval Meinard Müller

Introductions to Music Information Retrieval

Topic 11. Score-Informed Source Separation. (chroma slides adapted from Meinard Mueller)

A MID-LEVEL REPRESENTATION FOR CAPTURING DOMINANT TEMPO AND PULSE INFORMATION IN MUSIC RECORDINGS

Music Structure Analysis

AUTOMATIC MAPPING OF SCANNED SHEET MUSIC TO AUDIO RECORDINGS

TOWARDS AUTOMATED EXTRACTION OF TEMPO PARAMETERS FROM EXPRESSIVE MUSIC RECORDINGS

SHEET MUSIC-AUDIO IDENTIFICATION

Chord Recognition. Aspects of Music. Musical Chords. Harmony: The Basis of Music. Musical Chords. Musical Chords. Music Processing.

A Multimodal Way of Experiencing and Exploring Music

Book: Fundamentals of Music Processing. Audio Features. Book: Fundamentals of Music Processing. Book: Fundamentals of Music Processing

Audio Structure Analysis

CS 591 S1 Computational Audio

MUSIC is a ubiquitous and vital part of the lives of billions

New Developments in Music Information Retrieval

Music Information Retrieval

Audio Structure Analysis

AUDIO MATCHING VIA CHROMA-BASED STATISTICAL FEATURES

Outline. Why do we classify? Audio Classification

Automatic music transcription

CSC475 Music Information Retrieval

Music Similarity and Cover Song Identification: The Case of Jazz

Music Radar: A Web-based Query by Humming System

Music Structure Analysis

Music Structure Analysis

LEARNING AUDIO SHEET MUSIC CORRESPONDENCES. Matthias Dorfer Department of Computational Perception

CHAPTER 6. Music Retrieval by Melody Style

2 2. Melody description The MPEG-7 standard distinguishes three types of attributes related to melody: the fundamental frequency LLD associated to a t

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models

Semi-automated extraction of expressive performance information from acoustic recordings of piano music. Andrew Earis

Timing In Expressive Performance

ANALYZING MEASURE ANNOTATIONS FOR WESTERN CLASSICAL MUSIC RECORDINGS

Video-based Vibrato Detection and Analysis for Polyphonic String Music

JOINT STRUCTURE ANALYSIS WITH APPLICATIONS TO MUSIC ANNOTATION AND SYNCHRONIZATION

jsymbolic 2: New Developments and Research Opportunities

The Effect of DJs Social Network on Music Popularity

Music Information Retrieval: An Inspirational Guide to Transfer from Related Disciplines

MUSI-6201 Computational Music Analysis

Rhythm related MIR tasks

Data Driven Music Understanding

CTP431- Music and Audio Computing Music Information Retrieval. Graduate School of Culture Technology KAIST Juhan Nam

The song remains the same: identifying versions of the same piece using tonal descriptors

Audio Content-Based Music Retrieval

Musical Examination to Bridge Audio Data and Sheet Music

ONE main goal of content-based music analysis and retrieval

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

Hidden Markov Model based dance recognition

PULSE-DEPENDENT ANALYSES OF PERCUSSIVE MUSIC

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

Aspects of Music. Chord Recognition. Musical Chords. Harmony: The Basis of Music. Musical Chords. Musical Chords. Piece of music. Rhythm.

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models

Music Information Retrieval

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

Automatic Labelling of tabla signals

STOCHASTIC MODELING OF A MUSICAL PERFORMANCE WITH EXPRESSIVE REPRESENTATIONS FROM THE MUSICAL SCORE

SCORE-INFORMED VOICE SEPARATION FOR PIANO RECORDINGS

DISCOVERING MORPHOLOGICAL SIMILARITY IN TRADITIONAL FORMS OF MUSIC. Andre Holzapfel

FREISCHÜTZ DIGITAL: A CASE STUDY FOR REFERENCE-BASED AUDIO SEGMENTATION OF OPERAS

Music Alignment and Applications. Introduction

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

Automatic Piano Music Transcription

AUDIO-BASED MUSIC STRUCTURE ANALYSIS

IMPROVING RHYTHMIC SIMILARITY COMPUTATION BY BEAT HISTOGRAM TRANSFORMATIONS

AUDIO-BASED MUSIC STRUCTURE ANALYSIS

MUSICAL INSTRUMENT IDENTIFICATION BASED ON HARMONIC TEMPORAL TIMBRE FEATURES

Voice & Music Pattern Extraction: A Review

HUMAN PERCEPTION AND COMPUTER EXTRACTION OF MUSICAL BEAT STRENGTH

Music Information Retrieval. Juan Pablo Bello MPATE-GE 2623 Music Information Retrieval New York University

Towards Automated Processing of Folk Song Recordings

THE importance of music content analysis for musical

Representing, comparing and evaluating of music files

WHO IS WHO IN THE END? RECOGNIZING PIANISTS BY THEIR FINAL RITARDANDI

A Study of Synchronization of Audio Data with Symbolic Data. Music254 Project Report Spring 2007 SongHui Chon

MATCHING MUSICAL THEMES BASED ON NOISY OCR AND OMR INPUT. Stefan Balke, Sanu Pulimootil Achankunju, Meinard Müller

Analyzer Documentation

A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES

gresearch Focus Cognitive Sciences

Computational Modelling of Harmony

Music Information Retrieval. Juan P Bello

Music Mood. Sheng Xu, Albert Peyton, Ryan Bhular

GCT535- Sound Technology for Multimedia Timbre Analysis. Graduate School of Culture Technology KAIST Juhan Nam

AUTOMATED METHODS FOR ANALYZING MUSIC RECORDINGS IN SONATA FORM

Transcription:

Ringvorlesung Perspektiven der Informatik Wintersemester 2011/2012 Meinard Müller Universität des Saarlandes und MPI Informatik meinard@mpi-inf.mpg.de Priv.-Doz. Dr. Meinard Müller 2007 Habilitation, Bonn 2007 MPI Informatik, Saarland Cluster of Excellence Automatisierte Musikverarbeitung 5 PhD Students Music Data Music Information Retrieval (MIR) Detection of semantic relations, e.g., harmonic, rhythmic, or motivic similarity Extraction of musical entities such as note events, instrumentation, or musical form Tools and methods for multimodal search, navigation, and interaction Piano Roll Representation Piano Roll Representation Player Piano (1900)

Piano Roll Representation (MIDI) J.S. Bach, C-Major Fuge (Well Tempered Piano, BWV 846) Piano Roll Representation (MIDI) Query: Goal: Find all occurrences of the query Time Pitch Piano Roll Representation (MIDI) Query: Audio Data Beethoven s Fifth Goal: Find all occurrences of the query Matches: Various interpretations Bernstein Karajan Scherbakov (piano) MIDI (piano) Memory Requirements Memory Requirements 1 Bit = 1: on 0: off 1 Byte = 8 Bits 1 Kilobyte (KB) = 1 Thousand Bytes 1 Megabyte (MB) = 1 Million Bytes 1 Gigabyte (GB) = 1 Billion Bytes 1 Terabyte (TB) = 1000 Billion Bytes 12.000 MIDI files < 350 MB One audio CD 650 MB Two audio CDs > 1 Billion Bytes 1000 audio CDs Billions of Bytes

Music Synchronization: Audio-Audio Beethoven s Fifth Music Synchronization: Audio-Audio Beethoven s Fifth Karajan Karajan Scherbakov Scherbakov Synchronization: Karajan Scherbakov Music Synchronization: Audio-Audio Feature extraction: chroma features Music Synchronization: Audio-Audio Cost matrix Karajan Scherbakov 18 1 0.9 16 0.8 Karaja an 14 12 10 8 6 4 2 0.7 0.6 0.5 0.4 0.3 0.2 0.1 2 4 6 8 10 12 14 16 18 20 0 Scherbakov Music Synchronization: Audio-Audio Application: Interpretation Switcher Cost-minimizing warping path 1 18 16 0.9 0.8 Karaja an 14 12 10 8 0.7 0.6 0.5 0.4 6 4 2 0.3 0.2 0.1 2 4 6 8 10 12 14 16 18 20 Scherbakov 0

Music Synchronization: MIDI-Audio Music Synchronization: MIDI-Audio MIDI = meta data Automated annotation Audio recording Sonification of annotations Music Synchronization: Scan-Audio Music Synchronization: Scan-Audio Audio Audio Scan Scan Music Synchronization: Scan-Audio Music Synchronization: Scan-Audio Scanned Sheet Music Scanned Sheet Music Symbolic Note Events OMR Correspondence Correspondence Audio Recording Audio Recording

Music Synchronization: Scan-Audio Music Synchronization: Scan-Audio Symbolic Note Events Scanned Sheet Music Symbolic Note Events Scanned Sheet Music High Qualtity OMR Correspondence Dirty but hidden OMR High Qualtity Correspondence Audio Recording Audio Recording Application: Score Viewer Audio Structure Analysis Given: CD recording Goal: Automatic extraction of the repetitive structure (or of the musical form) Example: Brahms Hungarian Dance No. 5 (Ormandy) 50 100 150 200 Similarity structure 50 100 150 200 50 100 Similarity structure 150 200

) Similarity structure ) Similarity structure ) Similarity structure ) Similarity structure ) ) Path relations Path relations 1 1 2 2 3 3 4 4 5 5 6 6 7 7 Grouping / Transitivity

) ) Path relations Path relations 1 1 2 2 3 3 4 4 5 5 6 6 7 Grouping / Transitivity 7 Grouping / Transitivity Coarse Level Fine Level Coarse Level Fine Level What do different versions have in common? What are the characteristics of a specific version? What do different versions have in common? What are the characteristics of a specific version? What makes up a piece of music? What makes music come alive? Coarse Level Fine Level Coarse Level Fine Level What do different versions have in common? What are the characteristics of a specific version? What do different versions have in common? What are the characteristics of a specific version? What makes up a piece of music? What makes music come alive? What makes up a piece of music? What makes music come alive? Identify despite of differences Identify the differences Identify despite of differences Identify the differences Example tasks: Audio Matching Cover Song Identification Example tasks: Tempo Estimation Performance Analysis

Performance Analysis Performance Analysis: Tempo Curves Schumann: Träumerei 1. Capture nuances regarding tempo, dynamics, articulation, timbre, 2. Discover commonalities between different performances and derive general performance rules 3. Characterize the style of a specific musician (``Horowitz Factor ) Performance: Performance Analysis: Tempo Curves Schumann: Träumerei Score (reference): Performance Analysis: Tempo Curves Schumann: Träumerei Score (reference): Performance: Strategy: Compute score-audio synchronization and derive tempo curve Performance: Performance Analysis: Tempo Curves Schumann: Träumerei Score (reference): Performance Analysis: Tempo Curves Schumann: Träumerei Score (reference): Tempo Curve: Tempo Curves: Musical tempo (BPM) Musical tempo (BPM) Musical time (measures) Musical time (measures)

Performance Analysis: Tempo Curves Schumann: Träumerei Score (reference): Performance Analysis Schumann: Träumerei What can be done if no reference is available? Tempo Curves: Tempo Curves: Musical tempo (BPM) Musical tempo (BPM) Musical time (measures) Musical time (measures) Relative Absolute Relative Absolute Given: Several versions Given: One version Given: Several versions Given: One version Comparison of extracted parameters Direct interpretation of extracted parameters Relative Absolute Relative Absolute Given: Several versions Given: One version Given: Several versions Given: One version Comparison of extracted parameters Direct interpretation of extracted parameters Comparison of extracted parameters Direct interpretation of extracted parameters Extraction errors have often no consequence on final result Extraction errors immediately become evident Extraction errors have often no consequence on final result Extraction errors immediately become evident Example tasks: Music Synchronization Genre Classification Example tasks: Music Transcription Tempo Estimation

Tempo Estimation Tempo Estimation Measure Tactus (beat) Tempo Estimation Tatum (temporal atom) Example: Chopin Mazurka Op. 68-3 Pulse level: Quarter note Tempo:??? Tempo Estimation Example: Chopin Mazurka Op. 68-3 Pulse level: Tempo: Quarter note 50-200 BPM Which temporal level? Local tempo deviations Tempo curve Tempo (BPM) 200 50 Sparse information (e.g., only note onsets available) Vague information (e.g., extracted note onsets corrupt) Time (beats)

Local Energy Curve: Local Energy Curve: Note Onset Positions Energy Energy Spectrogram Steps: Compressed Spectrogram Steps: 1. Spectrogram 1. Spectrogram 2. Log Compression Frequency (Hz z) Frequency (Hz z) Difference Spectrogram Steps: Steps: 1. Spectrogram 2. Log Compression 3. Differentiation 1. Spectrogram 2. Log Compression 3. Differentiation z) Frequency (Hz Novelty Curve 4. Accumulation

Steps: Steps: 1. Spectrogram 2. Log Compression 3. Differentiation 4. Accumulation 1. Spectrogram 2. Log Compression 3. Differentiation 4. Accumulation 5. Normalization Novelty Curve Local Average Novelty Curve Tempo o (BPM) Inte ensity Tempo o (BPM) Inte ensity Tempo o (BPM) Inte ensity Tempo o (BPM) Inte ensity

Motivic Similarity Novelty Curve Predominant Local Pulse (PLP) Motivic Similarity Motivic Similarity Beethoven s Fifth (1st Mov.) Beethoven s Fifth (1st Mov.) Beethoven s Fifth (3rd Mov.) Motivic Similarity Multimodal Computing and Interaction Sheet Music (Image) MIDI CD / MP3 (Audio) Beethoven s Fifth (1st Mov.) Music Beethoven s Fifth (3rd Mov.) Beethoven s Appassionata

Multimodal Computing and Interaction Sheet Music (Image) MIDI CD / MP3 (Audio) MusicXML (Text) Music Singing / Voice (Audio) Music Literature (Text) Music Film (Video) Dance / Motion (Mocap)