Meinard Müller. Beethoven, Bach, und Billionen Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Similar documents
Beethoven, Bach und Billionen Bytes

Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900)

Beethoven, Bach, and Billions of Bytes

Audio. Meinard Müller. Beethoven, Bach, and Billions of Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Music Information Retrieval

Music Information Retrieval (MIR)

Music Processing Introduction Meinard Müller

Music Information Retrieval (MIR)

Informed Feature Representations for Music and Motion

Tempo and Beat Tracking

Tempo and Beat Analysis

Further Topics in MIR

Music Structure Analysis

Music Representations

Music Synchronization. Music Synchronization. Music Data. Music Data. General Goals. Music Information Retrieval (MIR)

Music Structure Analysis

Music Representations

Audio Structure Analysis

Music Processing Audio Retrieval Meinard Müller

A MID-LEVEL REPRESENTATION FOR CAPTURING DOMINANT TEMPO AND PULSE INFORMATION IN MUSIC RECORDINGS

Book: Fundamentals of Music Processing. Audio Features. Book: Fundamentals of Music Processing. Book: Fundamentals of Music Processing

A Multimodal Way of Experiencing and Exploring Music

AUTOMATIC MAPPING OF SCANNED SHEET MUSIC TO AUDIO RECORDINGS

Introductions to Music Information Retrieval

SHEET MUSIC-AUDIO IDENTIFICATION

Audio Structure Analysis

TOWARDS AUTOMATED EXTRACTION OF TEMPO PARAMETERS FROM EXPRESSIVE MUSIC RECORDINGS

AUTOMATED METHODS FOR ANALYZING MUSIC RECORDINGS IN SONATA FORM

Automatic music transcription

ANALYZING MEASURE ANNOTATIONS FOR WESTERN CLASSICAL MUSIC RECORDINGS

Music Structure Analysis

MUSIC is a ubiquitous and vital part of the lives of billions

RETRIEVING AUDIO RECORDINGS USING MUSICAL THEMES

AUDIO MATCHING VIA CHROMA-BASED STATISTICAL FEATURES

SCORE-INFORMED VOICE SEPARATION FOR PIANO RECORDINGS

Semi-automated extraction of expressive performance information from acoustic recordings of piano music. Andrew Earis

Audio Structure Analysis

FREISCHÜTZ DIGITAL: A CASE STUDY FOR REFERENCE-BASED AUDIO SEGMENTATION OF OPERAS

Musical Examination to Bridge Audio Data and Sheet Music

MATCHING MUSICAL THEMES BASED ON NOISY OCR AND OMR INPUT. Stefan Balke, Sanu Pulimootil Achankunju, Meinard Müller

New Developments in Music Information Retrieval

Topic 11. Score-Informed Source Separation. (chroma slides adapted from Meinard Mueller)

Robert Alexandru Dobre, Cristian Negrescu

CSC475 Music Information Retrieval

Music Similarity and Cover Song Identification: The Case of Jazz

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

Chord Recognition. Aspects of Music. Musical Chords. Harmony: The Basis of Music. Musical Chords. Musical Chords. Music Processing.

A prototype system for rule-based expressive modifications of audio recordings

NOTE-LEVEL MUSIC TRANSCRIPTION BY MAXIMUM LIKELIHOOD SAMPLING

THE importance of music content analysis for musical

The song remains the same: identifying versions of the same piece using tonal descriptors

Automatic Classification of Instrumental Music & Human Voice Using Formant Analysis

A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES

Music Radar: A Web-based Query by Humming System

Aspects of Music. Chord Recognition. Musical Chords. Harmony: The Basis of Music. Musical Chords. Musical Chords. Piece of music. Rhythm.

Effects of acoustic degradations on cover song recognition

Tool-based Identification of Melodic Patterns in MusicXML Documents

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

Towards Automated Processing of Folk Song Recordings

Grouping Recorded Music by Structural Similarity Juan Pablo Bello New York University ISMIR 09, Kobe October 2009 marl music and audio research lab

gresearch Focus Cognitive Sciences

2 2. Melody description The MPEG-7 standard distinguishes three types of attributes related to melody: the fundamental frequency LLD associated to a t

Outline. Why do we classify? Audio Classification

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Digital audio and computer music. COS 116, Spring 2012 Guest lecture: Rebecca Fiebrink

Topic 10. Multi-pitch Analysis

BRAIN BEATS: TEMPO EXTRACTION FROM EEG DATA

Representing, comparing and evaluating of music files

The Effect of DJs Social Network on Music Popularity

A REAL-TIME SIGNAL PROCESSING FRAMEWORK OF MUSICAL EXPRESSIVE FEATURE EXTRACTION USING MATLAB

ONE main goal of content-based music analysis and retrieval

UNIVERSITY OF DUBLIN TRINITY COLLEGE

Subjective Similarity of Music: Data Collection for Individuality Analysis

Data Driven Music Understanding

SINGING EXPRESSION TRANSFER FROM ONE VOICE TO ANOTHER FOR A GIVEN SONG. Sangeon Yong, Juhan Nam

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models

Author Index. Absolu, Brandt 165. Montecchio, Nicola 187 Mukherjee, Bhaswati 285 Müllensiefen, Daniel 365. Bay, Mert 93

MUSI-6201 Computational Music Analysis

SEMANTIC AUDIO. 20 HJuly COCHAIRS Karlheinz Brandenburg and Mark Sandler. PAPERS COCHAIRS Anssi Klapuri and Christian Dittmar

Music Information Retrieval: An Inspirational Guide to Transfer from Related Disciplines

Data-Driven Solo Voice Enhancement for Jazz Music Retrieval

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

Chord Classification of an Audio Signal using Artificial Neural Network

A Study of Synchronization of Audio Data with Symbolic Data. Music254 Project Report Spring 2007 SongHui Chon

AUDIO-BASED MUSIC STRUCTURE ANALYSIS

Music Information Retrieval

Music Information Retrieval. Juan P Bello

CS 591 S1 Computational Audio

CSC475 Music Information Retrieval

ON FINDING MELODIC LINES IN AUDIO RECORDINGS. Matija Marolt

However, in studies of expressive timing, the aim is to investigate production rather than perception of timing, that is, independently of the listene

SCORE-INFORMED IDENTIFICATION OF MISSING AND EXTRA NOTES IN PIANO RECORDINGS

A TIMBRE-BASED APPROACH TO ESTIMATE KEY VELOCITY FROM POLYPHONIC PIANO RECORDINGS

Example 1 (W.A. Mozart, Piano Trio, K. 542/iii, mm ):

IMPROVING RHYTHMIC SIMILARITY COMPUTATION BY BEAT HISTOGRAM TRANSFORMATIONS

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

MAKE YOUR OWN ACCOMPANIMENT: ADAPTING FULL-MIX RECORDINGS TO MATCH SOLO-ONLY USER RECORDINGS

A CHROMA-BASED SALIENCE FUNCTION FOR MELODY AND BASS LINE ESTIMATION FROM MUSIC AUDIO SIGNALS

Transcription:

Beethoven, Bach, und Billionen Bytes Musik trifft Informatik Meinard Müller Meinard Müller 2007 Habilitation, Bonn 2007 MPI Informatik, Saarbrücken Senior Researcher Music Processing & Motion Processing Informatik-Kolloquium Universität Bamberg 23. Oktober 2014 2012 W3-Professur, Labs Erlangen Semantic Processing Fraunhofer IIS & Multimedia Realtime Systems Technical Faculty Electrical Engineering Dept. Uni / FAU Labs-IIS 30..40 Staff Labs-FAU 6 Professors Fraunhofer IIS & Multimedia Realtime Systems Technical Faculty Electrical Engineering Dept. Uni / FAU Labs-IIS 30..40 Staff Labs-FAU 6 Professors

Coding 3D Psychoacoustics Music Processing Music Representations Research Goals Sheet Music (Image) CD / MP3 () MusicXML (Text) Music Information Retrieval (MIR) ISMIR Analysis of music signals (harmonic, melodic, rhythmic, motivic aspects) Dance / Motion (Mocap) Music MIDI Design of musically relevant audio features Tools for multimodal search and interaction Singing / Voice () Music Film (Video) Music Literature (Text) Piano Roll Representation Player Piano (1900)

Piano Roll Representation (MIDI) J.S. Bach, C-Major Fuge (Well Tempered Piano, BWV 6) Piano Roll Representation (MIDI) Query: Goal: Find all occurrences of the query Time Pitch Piano Roll Representation (MIDI) Data Query: Goal: Find all occurrences of the query Matches: Various interpretations Beethoven s Fifth Bernstein Karajan Scherbakov (piano) MIDI (piano) Data (Memory Requirements) Music Synchronization: - 1 Bit = 1: on, 0: off 1 Byte = 8 Bits 1 Kilobyte (KB) = 1 Thousand Bytes 1 Megabyte (MB) = 1 Million Bytes 1 Gigabyte (GB) = 1 Billion Bytes 1 Terabyte (TB) = 1000 Billion Bytes Beethoven s Fifth Two audio CDs > 1 Billion Bytes 1000 audio CDs Billions of Bytes 12.000 MIDI files < 350 MB

Music Synchronization: - Music Synchronization: - Beethoven s Fifth Beethoven s Fifth Orchester (Karajan) Orchester (Karajan) Piano (Scherbakov) Piano (Scherbakov) Application: Interpretation Switcher Music Synchronization: Image- Image Music Synchronization: Image- How to make the data comparable? Image Image

How to make the data comparable? Image Processing: Optical Music Recognition How to make the data comparable? Image Processing: Optical Music Recognition Image Image Processing: Fourier Analyse How to make the data comparable? Application: Score Viewer Image Processing: Optical Music Recognition Image Processing: Fourier Analyse General goal: Convert an audio recording into a mid-level representation that captures certain musical properties while supressing other properties. Timbre / Instrumentation Tempo / Rhythm Pitch / Harmony General goal: Convert an audio recording into a mid-level representation that captures certain musical properties while supressing other properties. Timbre / Instrumentation Tempo / Rhythm Pitch / Harmony

Chromatic scale Chromatic scale Spectrogram Waveform Amplitude Chromatic scale Chromatic scale Spectrogram Spectrogram : 4186 Hz : 2093 Hz : 1046 Hz : 523 Hz : 261 Hz : 131 Hz Chromatic scale Chromatic scale Log-frequency spectrogram Log-frequency spectrogram : 4186 Hz : 2093 Hz : 1046 Hz : 523 Hz : 261 Hz : 131 Hz Pitch (MIDI note number)

Chromatic scale Chromatic scale Log-frequency spectrogram Log-frequency spectrogram Pitch (MIDI note number) Pitch (MIDI note number) Chroma C Chroma C # Chromatic scale Why is Music Processing Challenging? Chopin, Mazurka Op. 63 No. 3 Chroma representation Chroma Why is Music Processing Challenging? Why is Music Processing Challenging? Chopin, Mazurka Op. 63 No. 3 Chopin, Mazurka Op. 63 No. 3 Waveform Waveform / Spectrogram Amplitude

Why is Music Processing Challenging? Why is Music Processing Challenging? Chopin, Mazurka Op. 63 No. 3 Chopin, Mazurka Op. 63 No. 3 Waveform / Spectrogram Performance Tempo Dynamics Note deviations Sustain pedal Waveform / Spectrogram Performance Tempo Dynamics Note deviations Sustain pedal Polyphony Main Melody Additional melody line Accompaniment Source Separation Decomposition of audio stream into different sound sources Central task in digital signal processing Cocktail party effect Sources are often assumed to be statistically independent This is often not the case in music Strategy: Exploit additional information (e.g. musical score) to support the seperation process

Goal: Approximate spectrogram using a parametric model exploiting availablity of score information Estimate Parameters Render Original audio recording Original audio recording Model initialized with MIDI information Original audio recording Temporal synchronization Tuning estimation Original audio recording Activity parameters updated Original audio recording Partial energy distribution Resonance body properties

Experimental results for separating left and right hands for piano recordings: Original audio recording Model after three iterations Note: Each note specified by the score parameterizes a portion of the spectrogram Composer Piece Database Results L R Eq Org Bach BWV 875, Prelude SMD Chopin Op. 28, No. 15 SMD Chopin Op. 64, No. 1 European Archive editing Basic task: Tapping the foot when listening to music 10 10 1200 1200 800 800 400 400 6 7 8 9 6 7 8 9 Frequency (Hertz) 580 523 500 0 0.5 1 Frequency (Hertz) 580 554 500 0 0.5 1 Basic task: Tapping the foot when listening to music Basic task: Tapping the foot when listening to music Queen Another One Bites The Dust Queen Another One Bites The Dust

Happy Birthday to you Happy Birthday to you Pulse level: Measure Pulse level: Tactus (beat) Happy Birthday to you Chopin Mazurka Op. 68-3 Pulse level: Tatum (temporal atom) Pulse level: Quarter note Tempo:??? Chopin Mazurka Op. 68-3 Pulse level: Quarter note Tempo: 50-200 BPM Tempo curve Which temporal level? Local tempo deviations Sparse information (e.g., only note onsets available) Tempo (BPM) 200 50 Vague information (e.g., extracted note onsets corrupt) Time (beats)

Spectrogram Steps: Compressed Spectrogram Steps: 1. Spectrogram 1. Spectrogram 2. Log Compression Difference Spectrogram Steps: Steps: 1. Spectrogram 2. Log Compression 3. Differentiation Novelty Curve 1. Spectrogram 2. Log Compression 3. Differentiation 4. Accumulation Steps: Steps: 1. Spectrogram 2. Log Compression 3. Differentiation 4. Accumulation 1. Spectrogram 2. Log Compression 3. Differentiation 4. Accumulation 5. Normalization Novelty Curve Local Average Novelty Curve

Tempo (BPM) Intensity Tempo (BPM) Intensity Tempo (BPM) Intensity Tempo (BPM) Intensity Tempo (BPM) Intensity Novelty Curve Predominant Local Pulse (PLP)

Information Retrieval Query Database Information Retrieval Hit Musiksynchronisation -ID Version-ID Category-ID Bernstein (12) Beethoven, Symphony No. 5 Beethoven, Symphony No. 5: Bernstein (12) Karajan (1982) Gould (1992) Beethoven, Symphony No. 9 Beethoven, Symphony No. 3 Haydn Symphony No. 94 -ID Version-ID Category-ID Information Retrieval Motivic Similarity Musiksynchronisation -ID Version-ID Category-ID Similarity? Relevance? User! Beethoven s Fifth (1st Mov.) Motivic Similarity Motivic Similarity Beethoven s Fifth (1st Mov.) Beethoven s Fifth (3rd Mov.) Beethoven s Fifth (1st Mov.) Beethoven s Fifth (3rd Mov.) Beethoven s Appassionata

Motivic Similarity Motivic Similarity B A C H Music Processing Computer Sciene Information Retrieval Pattern Matching Multimedia User Interfaces Humanities Music Analysis Performance Analysis Music Education EEI Signal Processing Processing Computational Acoustics Sensors Thanks Michael Clausen (Bonn University) David Damm (Bonn University) Jonathan Driedger (Universität Erlangen-Nürnberg) Sebastian Ewert (Bonn University) Christian Fremerey (Bonn University) Peter Grosche (Saarland University) Nanzhu Jiang (Universität Erlangen-Nürnberg) Verena Konz (Saarland University) Frank Kurth (Fraunhofer-FKIE, Wachtberg ) Thomas Prätzlich (Universität Erlangen-Nürnberg) Verena Thomas (Bonn University) Projekte & Kooperationen Selected Publications (Music Processing) DFG-Projekt: METRUM Mehrschichtige Analyse und Strukturierung von Musiksignalen Kooperation: Michael Clausen Laufzeit: 2011-2015 BMBF-Projekt: Freischütz Digital Freischütz Digital Paradigmatische Umsetzung eines genuin digitalen Editionskonzepts Kooperation: Joachim Veit, Thomas Betzwieser, Gerd Szwillus Laufzeit: 2012-2015 DFG-Projekt: SIAMUS Notentext-Informierte parametrisierung von Musiksignalen Laufzeit: 2014-2017 Projekt Musikwissenschaften Computergestützte Analyse harmonischer Strukturen Kooperation: Rainer Kleinertz Laufzeit: 2015-2018 M. Müller, N. Jiang, P. Grosche (2013): A robust fitness measure for capturing repetitions in music recordings with applications to audio thumbnailing. IEEE Trans. on, Speech & Language Processing, Vol. 21, No. 3, pp. 531 543. M. Müller, P.W. Ellis, A. Klapuri, G. Richard (2011): Signal Processing for Music Analysis. IEEE Journal of Selected Topics in Signal Processing, Vol. 5, No. 6, pp. 8-1110. P. Grosche and M. Müller (2011): Extracting Predominant Local Pulse Information from Music Recordings. IEEE Trans. on, Speech & Language Processing, Vol. 19, No. 6, pp. 1688-1701. M. Müller and S. Ewert (2010): Towards Timbre-Invariant Features for Harmony-Based Music. IEEE Trans. on, Speech & Language Processing, Vol. 18, No. 3, pp. 649-662. F. Kurth, M. Müller (2008): Efficient Index-Based Matching. IEEE Trans., Speech & Language Processing, Vol. 16, No. 2, 382-395. M. Müller (2007): Information Retrieval for Music and Motion. Monograph, Springer, 318 pages