Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900)

Similar documents
Beethoven, Bach, and Billions of Bytes

Music Information Retrieval (MIR)

Meinard Müller. Beethoven, Bach, und Billionen Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Beethoven, Bach und Billionen Bytes

Music Information Retrieval (MIR)

Audio. Meinard Müller. Beethoven, Bach, and Billions of Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Music Processing Introduction Meinard Müller

Music Information Retrieval

Informed Feature Representations for Music and Motion

Tempo and Beat Tracking

Music Synchronization. Music Synchronization. Music Data. Music Data. General Goals. Music Information Retrieval (MIR)

Tempo and Beat Analysis

Music Structure Analysis

Music Structure Analysis

Music Representations

Audio Structure Analysis

Music Representations

Further Topics in MIR

Music Processing Audio Retrieval Meinard Müller

Book: Fundamentals of Music Processing. Audio Features. Book: Fundamentals of Music Processing. Book: Fundamentals of Music Processing

AUTOMATIC MAPPING OF SCANNED SHEET MUSIC TO AUDIO RECORDINGS

A Multimodal Way of Experiencing and Exploring Music

Topic 11. Score-Informed Source Separation. (chroma slides adapted from Meinard Mueller)

TOWARDS AUTOMATED EXTRACTION OF TEMPO PARAMETERS FROM EXPRESSIVE MUSIC RECORDINGS

Audio Structure Analysis

A MID-LEVEL REPRESENTATION FOR CAPTURING DOMINANT TEMPO AND PULSE INFORMATION IN MUSIC RECORDINGS

SHEET MUSIC-AUDIO IDENTIFICATION

Introductions to Music Information Retrieval

AUTOMATED METHODS FOR ANALYZING MUSIC RECORDINGS IN SONATA FORM

Music Structure Analysis

New Developments in Music Information Retrieval

Audio Structure Analysis

AUDIO MATCHING VIA CHROMA-BASED STATISTICAL FEATURES

CS 591 S1 Computational Audio

ANALYZING MEASURE ANNOTATIONS FOR WESTERN CLASSICAL MUSIC RECORDINGS

MUSIC is a ubiquitous and vital part of the lives of billions

A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES

Music Similarity and Cover Song Identification: The Case of Jazz

Automatic music transcription

The Effect of DJs Social Network on Music Popularity

Data Driven Music Understanding

Aspects of Music. Chord Recognition. Musical Chords. Harmony: The Basis of Music. Musical Chords. Musical Chords. Piece of music. Rhythm.

Outline. Why do we classify? Audio Classification

CSC475 Music Information Retrieval

RETRIEVING AUDIO RECORDINGS USING MUSICAL THEMES

Data-Driven Solo Voice Enhancement for Jazz Music Retrieval

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

FREISCHÜTZ DIGITAL: A CASE STUDY FOR REFERENCE-BASED AUDIO SEGMENTATION OF OPERAS

MATCHING MUSICAL THEMES BASED ON NOISY OCR AND OMR INPUT. Stefan Balke, Sanu Pulimootil Achankunju, Meinard Müller

MUSI-6201 Computational Music Analysis

NOTE-LEVEL MUSIC TRANSCRIPTION BY MAXIMUM LIKELIHOOD SAMPLING

Video-based Vibrato Detection and Analysis for Polyphonic String Music

Music Information Retrieval. Juan Pablo Bello MPATE-GE 2623 Music Information Retrieval New York University

Robert Alexandru Dobre, Cristian Negrescu

Chord Classification of an Audio Signal using Artificial Neural Network

MAKE YOUR OWN ACCOMPANIMENT: ADAPTING FULL-MIX RECORDINGS TO MATCH SOLO-ONLY USER RECORDINGS

A TIMBRE-BASED APPROACH TO ESTIMATE KEY VELOCITY FROM POLYPHONIC PIANO RECORDINGS

Grouping Recorded Music by Structural Similarity Juan Pablo Bello New York University ISMIR 09, Kobe October 2009 marl music and audio research lab

Music Information Retrieval

Towards Automated Processing of Folk Song Recordings

Automatic Classification of Instrumental Music & Human Voice Using Formant Analysis

Musical Examination to Bridge Audio Data and Sheet Music

gresearch Focus Cognitive Sciences

THE importance of music content analysis for musical

SINGING EXPRESSION TRANSFER FROM ONE VOICE TO ANOTHER FOR A GIVEN SONG. Sangeon Yong, Juhan Nam

Chord Recognition. Aspects of Music. Musical Chords. Harmony: The Basis of Music. Musical Chords. Musical Chords. Music Processing.

SCORE-INFORMED IDENTIFICATION OF MISSING AND EXTRA NOTES IN PIANO RECORDINGS

Music Information Retrieval: An Inspirational Guide to Transfer from Related Disciplines

Effects of acoustic degradations on cover song recognition

MAKE YOUR OWN ACCOMPANIMENT: ADAPTING FULL-MIX RECORDINGS TO MATCH SOLO-ONLY USER RECORDINGS

Music Radar: A Web-based Query by Humming System

Topic 10. Multi-pitch Analysis

2 2. Melody description The MPEG-7 standard distinguishes three types of attributes related to melody: the fundamental frequency LLD associated to a t

Retrieval of textual song lyrics from sung inputs

The song remains the same: identifying versions of the same piece using tonal descriptors

CHAPTER 6. Music Retrieval by Melody Style

A Study of Synchronization of Audio Data with Symbolic Data. Music254 Project Report Spring 2007 SongHui Chon

BRAIN BEATS: TEMPO EXTRACTION FROM EEG DATA

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

Lecture 12: Alignment and Matching

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

Music Database Retrieval Based on Spectral Similarity

The Intervalgram: An Audio Feature for Large-scale Melody Recognition

Semi-automated extraction of expressive performance information from acoustic recordings of piano music. Andrew Earis

ONE main goal of content-based music analysis and retrieval

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

IMPROVING RHYTHMIC SIMILARITY COMPUTATION BY BEAT HISTOGRAM TRANSFORMATIONS

Transcription of the Singing Melody in Polyphonic Music

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models

Lecture 9 Source Separation

GCT535- Sound Technology for Multimedia Timbre Analysis. Graduate School of Culture Technology KAIST Juhan Nam

Audio Content-Based Music Retrieval

Tools for music information retrieval and playing.

A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models

Statistical Modeling and Retrieval of Polyphonic Music

Audio Cover Song Identification using Convolutional Neural Network

Music Information Retrieval Using Audio Input

MUSICAL INSTRUMENT IDENTIFICATION BASED ON HARMONIC TEMPORAL TIMBRE FEATURES

DECODING TEMPO AND TIMING VARIATIONS IN MUSIC RECORDINGS FROM BEAT ANNOTATIONS

Transcription:

Music Representations Lecture Music Processing Sheet Music (Image) CD / MP3 (Audio) MusicXML (Text) Beethoven, Bach, and Billions of Bytes New Alliances between Music and Computer Science Dance / Motion (Mocap) Music MIDI Meinard Müller International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de Singing / Voice (Audio) Music Film (Video) Music Literature (Text) Research Goals Roll Representation Music Information Retrieval (MIR) ISMIR Analysis of music signals (harmonic, melodic, rhythmic, motivic aspects) Design of musically relevant audio features Tools for multimodal search and interaction Player (1900) Roll Representation (MIDI) J.S. Bach, C-Major Fuge (Well Tempered, BWV 846) Time Pitch

Roll Representation (MIDI) Query: Goal: Find all occurrences of the query Roll Representation (MIDI) Query: Goal: Find all occurrences of the query Matches: Audio Data Memory Requirements Various interpretations Beethoven s Fifth Bernstein Karajan Scherbakov (piano) 1 Bit = 1: on 0: off 1 Byte = 8 Bits 1 Kilobyte (KB) = 1 Thousand Bytes 1 Megabyte (MB) = 1 Million Bytes 1 Gigabyte (GB) = 1 Billion Bytes 1 Terabyte (TB) = 1000 Billion Bytes MIDI (piano) Memory Requirements Why is Music Processing Challenging? 12.000 MIDI files < 350 MB Example: Chopin, Mazurka Op. 63 No. 3 One audio CD 650 MB Two audio CDs > 1 Billion Bytes 1000 audio CDs Billions of Bytes

Why is Music Processing Challenging? Why is Music Processing Challenging? Example: Chopin, Mazurka Op. 63 No. 3 Example: Chopin, Mazurka Op. 63 No. 3 Waveform Waveform / Spectrogram Amplitude Frequency (Hz) Why is Music Processing Challenging? Why is Music Processing Challenging? Example: Chopin, Mazurka Op. 63 No. 3 Example: Chopin, Mazurka Op. 63 No. 3 Waveform / Spectrogram Performance Tempo Dynamics Note deviations Sustain pedal Waveform / Spectrogram Performance Tempo Dynamics Note deviations Sustain pedal Polyphony Main Melody Additional melody line Accompaniment

Application: Interpretation Switcher Two main steps: 1.) Audio features Robust but discriminative Chroma features Robust to variations in instrumentation, timbre, dynamics Correlate to harmonic progression 2.) Alignment procedure Deals with local and global tempo variations Needs to be efficient

Music Synchronization: Image-Audio Image Audio Music Synchronization: Image-Audio Music Synchronization: Image-Audio Convert into common mid-level feature representation Audio Image

Music Synchronization: Image-Audio Convert into common mid-level feature representation Music Synchronization: Image-Audio Convert into common mid-level feature representation Digital signal processing Digital signal processing Optical music recognition Audio chroma representation Audio chroma representation Image chroma representation Application: Score Viewer Audio Structure Analysis Given: CD recording Goal: Automatic extraction of the repetitive structure (or of the musical form) Example: Brahms Hungarian Dance No. 5 (Ormandy) 50 100 150 200 Basic Procedure Basic Procedure Self-similarity matrix Similarity structure Self-similarity matrix Similarity structure 50 100 150 200 50 100 150 200

Basic Procedure Basic Procedure Self-similarity matrix Similarity structure Self-similarity matrix Similarity structure 50 100 150 200 50 100 150 200 Basic Procedure Basic Procedure Self-similarity matrix Similarity structure Self-similarity matrix Similarity structure 50 100 150 200 50 100 150 200 Music Processing Music Processing Coarse Level Fine Level Coarse Level Fine Level What do different versions have in common? What are the characteristics of a specific version? What do different versions have in common? What are the characteristics of a specific version? What makes up a piece of music? What makes music come alive?

Music Processing Music Processing Coarse Level Fine Level Coarse Level Fine Level What do different versions have in common? What are the characteristics of a specific version? What do different versions have in common? What are the characteristics of a specific version? What makes up a piece of music? What makes music come alive? What makes up a piece of music? What makes music come alive? Identify despite of differences Identify the differences Identify despite of differences Identify the differences Example tasks: Audio Matching Cover Song Identification Example tasks: Performance Analysis Performance Analysis Performance Analysis: Tempo Curves Schumann: Träumerei 1. Capture nuances regarding tempo, dynamics, articulation, timbre, 2. Discover commonalities between different performances and derive general performance rules 3. Characterize the style of a specific musician (``Horowitz Factor ) Performance: Performance Analysis: Tempo Curves Schumann: Träumerei Score (reference): Performance Analysis: Tempo Curves Schumann: Träumerei Score (reference): Performance: Strategy: Compute score-audio synchronization and derive tempo curve Performance:

Performance Analysis: Tempo Curves Schumann: Träumerei Score (reference): Performance Analysis: Tempo Curves Schumann: Träumerei Score (reference): Tempo Curve: Tempo Curves: Musical tempo (BPM) Musical tempo (BPM) Musical time (measures) Musical time (measures) Performance Analysis: Tempo Curves Schumann: Träumerei Score (reference): Performance Analysis Schumann: Träumerei What can be done if no reference is available? Tempo Curves: Tempo Curves: Musical tempo (BPM) Musical tempo (BPM) Musical time (measures) Musical time (measures) Music Processing Music Processing Relative Absolute Relative Absolute Given: Several versions Given: One version Given: Several versions Given: One version Comparison of extracted parameters Direct interpretation of extracted parameters

Music Processing Music Processing Relative Absolute Relative Absolute Given: Several versions Given: One version Given: Several versions Given: One version Comparison of extracted parameters Direct interpretation of extracted parameters Comparison of extracted parameters Direct interpretation of extracted parameters Extraction errors have often no consequence on final result Extraction errors immediately become evident Extraction errors have often no consequence on final result Extraction errors immediately become evident Example tasks: Music Synchronization Genre Classification Example tasks: Music Transcription Measure Tactus (beat) Tatum (temporal atom) and Beat Tracking Example: Chopin Mazurka Op. 68-3 Pulse level: Quarter note Tempo:???

and Beat Tracking Example: Chopin Mazurka Op. 68-3 Pulse level: Tempo: Quarter note 50-200 BPM Which temporal level? Local tempo deviations Tempo curve Tempo (BPM) 200 50 Sparse information (e.g., only note onsets available) Vague information (e.g., extracted note onsets corrupt) Time (beats) Local Energy Curve: Local Energy Curve: Note Onset Positions Energy Energy Spectrogram Steps: Compressed Spectrogram Steps: 1. Spectrogram 1. Spectrogram 2. Log Compression Frequency (Hz) Frequency (Hz)

Difference Spectrogram Steps: Steps: Frequency (Hz) 1. Spectrogram 2. Log Compression 3. Differentiation Novelty Curve 1. Spectrogram 2. Log Compression 3. Differentiation 4. Accumulation Steps: Steps: 1. Spectrogram 2. Log Compression 3. Differentiation 4. Accumulation 1. Spectrogram 2. Log Compression 3. Differentiation 4. Accumulation 5. Normalization Novelty Curve Local Average Novelty Curve Tempo (BPM) Intensity Tempo (BPM) Intensity

Tempo (BPM) Intensity Tempo (BPM) Intensity Borodin String Quartet No. 2 Novelty Curve Tempo (BPM) Predominant Local Pulse (PLP) Borodin String Quartet No. 2 Motivic Similarity Tempo (BPM)

Motivic Similarity Motivic Similarity (1st Mov.) (1st Mov.) (3rd Mov.) Motivic Similarity Motivic Similarity B A C H (1st Mov.) (3rd Mov.) Beethoven s Appassionata Thanks Michael Clausen (Bonn University) David Damm (Bonn University) Jonathan Driedger (University of Erlangen-Nürnberg) Sebastian Ewert (Bonn University) Christian Fremerey (Bonn University) Peter Grosche (Saarland University) Nanzhu Jiang (University of Erlangen-Nürnberg) Verena Konz (Saarland University) Frank Kurth (Fraunhofer-FKIE, Wachtberg) Thomas Prätzlich (University of Erlangen-Nürnberg) Verena Thomas (Bonn University) Selected Publications (Music Processing) M. Müller, P.W. Ellis, A. Klapuri, G. Richard (2011): Signal Processing for Music Analysis. IEEE Journal of Selected Topics in Signal Processing, Vol. 5, No. 6, pp. 1088-1110. P. Grosche and M. Müller (2011): Extracting Predominant Local Pulse Information from Music Recordings. IEEE Trans. on Audio, Speech & Language Processing, Vol. 19, No. 6, pp. 1688-1701. M. Müller, M. Clausen, V. Konz, S. Ewert, C. Fremerey (2010): A Multimodal Way of Experiencing and Exploring Music. Interdisciplinary Science Reviews (ISR), Vol. 35, No. 2. M. Müller and S. Ewert (2010): Towards Timbre-Invariant Audio Features for Harmony-Based Music. IEEE Trans. on Audio, Speech & Language Processing, Vol. 18, No. 3, pp. 649-662. F. Kurth, M. Müller (2008): Efficient Index-Based Audio Matching. IEEE Trans. Audio, Speech & Language Processing, Vol. 16, No. 2, 382-395. M. Müller (2007): Information Retrieval for Music and Motion. Monograph, Springer, 318 pages