Further Topics in MIR

Similar documents
Music Information Retrieval

Audio. Meinard Müller. Beethoven, Bach, and Billions of Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Tempo and Beat Tracking

Music Processing Introduction Meinard Müller

Lecture 9 Source Separation

Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900)

Music Structure Analysis

Meinard Müller. Beethoven, Bach, und Billionen Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Music Representations

Music Information Retrieval (MIR)

Tempo and Beat Analysis

Audio Structure Analysis

Beethoven, Bach, and Billions of Bytes

Music Structure Analysis

Music Processing Audio Retrieval Meinard Müller

Beethoven, Bach und Billionen Bytes

Introductions to Music Information Retrieval

Book: Fundamentals of Music Processing. Audio Features. Book: Fundamentals of Music Processing. Book: Fundamentals of Music Processing

Lecture 10 Harmonic/Percussive Separation

Music Synchronization. Music Synchronization. Music Data. Music Data. General Goals. Music Information Retrieval (MIR)

Keywords Separation of sound, percussive instruments, non-percussive instruments, flexible audio source separation toolbox

Semi-automated extraction of expressive performance information from acoustic recordings of piano music. Andrew Earis

Topic 11. Score-Informed Source Separation. (chroma slides adapted from Meinard Mueller)

Music Information Retrieval (MIR)

Music Structure Analysis

Automatic music transcription

Supervised Musical Source Separation from Mono and Stereo Mixtures based on Sinusoidal Modeling

EVALUATION OF A SCORE-INFORMED SOURCE SEPARATION SYSTEM

Data-Driven Solo Voice Enhancement for Jazz Music Retrieval

TIMBRE-CONSTRAINED RECURSIVE TIME-VARYING ANALYSIS FOR MUSICAL NOTE SEPARATION

Score-Informed Source Separation for Musical Audio Recordings: An Overview

Topic 10. Multi-pitch Analysis

Voice & Music Pattern Extraction: A Review

A Survey on: Sound Source Separation Methods

A SCORE-INFORMED PIANO TUTORING SYSTEM WITH MISTAKE DETECTION AND SCORE SIMPLIFICATION

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

/$ IEEE

THE importance of music content analysis for musical

AUTOMATIC CONVERSION OF POP MUSIC INTO CHIPTUNES FOR 8-BIT PIXEL ART

A REAL-TIME SIGNAL PROCESSING FRAMEWORK OF MUSICAL EXPRESSIVE FEATURE EXTRACTION USING MATLAB

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Transcription of the Singing Melody in Polyphonic Music

Multiple instrument tracking based on reconstruction error, pitch continuity and instrument activity

MUSICAL INSTRUMENT IDENTIFICATION BASED ON HARMONIC TEMPORAL TIMBRE FEATURES

Audio Structure Analysis

Music Representations

AUTOREGRESSIVE MFCC MODELS FOR GENRE CLASSIFICATION IMPROVED BY HARMONIC-PERCUSSION SEPARATION

Soundprism: An Online System for Score-Informed Source Separation of Music Audio Zhiyao Duan, Student Member, IEEE, and Bryan Pardo, Member, IEEE

Data Driven Music Understanding

CTP431- Music and Audio Computing Music Information Retrieval. Graduate School of Culture Technology KAIST Juhan Nam

NOTE-LEVEL MUSIC TRANSCRIPTION BY MAXIMUM LIKELIHOOD SAMPLING

SINGING EXPRESSION TRANSFER FROM ONE VOICE TO ANOTHER FOR A GIVEN SONG. Sangeon Yong, Juhan Nam

Singing Pitch Extraction and Singing Voice Separation

Video-based Vibrato Detection and Analysis for Polyphonic String Music

MATCHING MUSICAL THEMES BASED ON NOISY OCR AND OMR INPUT. Stefan Balke, Sanu Pulimootil Achankunju, Meinard Müller

Single Channel Vocal Separation using Median Filtering and Factorisation Techniques

ARECENT emerging area of activity within the music information

Expressive Singing Synthesis based on Unit Selection for the Singing Synthesis Challenge 2016

A prototype system for rule-based expressive modifications of audio recordings

SCORE-INFORMED IDENTIFICATION OF MISSING AND EXTRA NOTES IN PIANO RECORDINGS

AUTOMASHUPPER: AN AUTOMATIC MULTI-SONG MASHUP SYSTEM

SINGING PITCH EXTRACTION BY VOICE VIBRATO/TREMOLO ESTIMATION AND INSTRUMENT PARTIAL DELETION

MUSI-6201 Computational Music Analysis

ON FINDING MELODIC LINES IN AUDIO RECORDINGS. Matija Marolt

RETRIEVING AUDIO RECORDINGS USING MUSICAL THEMES

POLYPHONIC INSTRUMENT RECOGNITION USING SPECTRAL CLUSTERING

DEEP SALIENCE REPRESENTATIONS FOR F 0 ESTIMATION IN POLYPHONIC MUSIC

Music Radar: A Web-based Query by Humming System

SHEET MUSIC-AUDIO IDENTIFICATION

Gaussian Mixture Model for Singing Voice Separation from Stereophonic Music

Audio Source Separation: "De-mixing" for Production

A TIMBRE-BASED APPROACH TO ESTIMATE KEY VELOCITY FROM POLYPHONIC PIANO RECORDINGS

A STATISTICAL VIEW ON THE EXPRESSIVE TIMING OF PIANO ROLLED CHORDS

Refined Spectral Template Models for Score Following

Subjective Similarity of Music: Data Collection for Individuality Analysis

MUSIC is a ubiquitous and vital part of the lives of billions

Audio Structure Analysis

MODELING RHYTHM SIMILARITY FOR ELECTRONIC DANCE MUSIC

A DISCRETE FILTER BANK APPROACH TO AUDIO TO SCORE MATCHING FOR POLYPHONIC MUSIC

ON DRUM PLAYING TECHNIQUE DETECTION IN POLYPHONIC MIXTURES

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES

Semantic Audio. Semantic audio is the relatively young field concerned with. International Conference. Erlangen, Germany June, 2017

Efficient Vocal Melody Extraction from Polyphonic Music Signals

SIMULTANEOUS SEPARATION AND SEGMENTATION IN LAYERED MUSIC

Effects of acoustic degradations on cover song recognition

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

The Effect of DJs Social Network on Music Popularity

MUSICAL INSTRUMENT RECOGNITION USING BIOLOGICALLY INSPIRED FILTERING OF TEMPORAL DICTIONARY ATOMS

GCT535- Sound Technology for Multimedia Timbre Analysis. Graduate School of Culture Technology KAIST Juhan Nam

COMBINING MODELING OF SINGING VOICE AND BACKGROUND MUSIC FOR AUTOMATIC SEPARATION OF MUSICAL MIXTURES

Singer Traits Identification using Deep Neural Network

Statistical Modeling and Retrieval of Polyphonic Music

TOWARDS EXPRESSIVE INSTRUMENT SYNTHESIS THROUGH SMOOTH FRAME-BY-FRAME RECONSTRUCTION: FROM STRING TO WOODWIND

Multipitch estimation by joint modeling of harmonic and transient sounds

Music Similarity and Cover Song Identification: The Case of Jazz

Query By Humming: Finding Songs in a Polyphonic Database

Drum Source Separation using Percussive Feature Detection and Spectral Modulation

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng

A Study of Synchronization of Audio Data with Symbolic Data. Music254 Project Report Spring 2007 SongHui Chon

A MID-LEVEL REPRESENTATION FOR CAPTURING DOMINANT TEMPO AND PULSE INFORMATION IN MUSIC RECORDINGS

Transcription:

Tutorial Automatisierte Methoden der Musikverarbeitung 47. Jahrestagung der Gesellschaft für Informatik Further Topics in MIR Meinard Müller, Christof Weiss, Stefan Balke International Audio Laboratories Erlangen {meinard.mueller, christof.weiss, stefan.balke}@audiolabs-erlangen.de

Why is Music Processing Challenging? Example: Chopin, Mazurka Op. 63 No. 3

Why is Music Processing Challenging? Example: Chopin, Mazurka Op. 63 No. 3 Waveform Amplitude Time (seconds)

Why is Music Processing Challenging? Example: Chopin, Mazurka Op. 63 No. 3 Waveform / Spectrogram (Hz) Time (seconds)

Why is Music Processing Challenging? Example: Chopin, Mazurka Op. 63 No. 3 Waveform / Spectrogram Performance Tempo Dynamics Note deviations Sustain pedal

Why is Music Processing Challenging? Example: Chopin, Mazurka Op. 63 No. 3 Waveform / Spectrogram Performance Tempo Dynamics Note deviations Sustain pedal Polyphony Main Melody Additional melody line Accompaniment

Source Separation Decomposition of audio stream into different sound sources Central task in digital signal processing Cocktail party effect

Source Separation Decomposition of audio stream into different sound sources Central task in digital signal processing Cocktail party effect Several input signals Sources are assumed to be statistically independent

Source Separation (Music) Main melody, accompaniment, drum track Instrumental voices Individual note events Only mono or stereo Time Sources are often highly dependent Time

Harmonic-Percussive Decomposition Mixture:

Harmonic-Percussive Decomposition Mixture: Clearly harmonic sounds Clearly percussive sounds Harmonic component Percussive component

Harmonic-Percussive Decomposition Mixture: Clearly harmonic sounds Clearly percussive sounds Harmonic component Residual component Percussive component

Harmonic-Percussive Decomposition Mixture: Clearly harmonic sounds of singing voice and accompaniment Noise-like sounds Vibrato/glissando sounds Drum hits Fricatives & plosives in singing voice Harmonic component Residual component Percussive component Literature: [Driedger/Müller/Disch, ISMIR 2014] Demo: https://www.audiolabs-erlangen.de/resources/2014-ismir-exthpsep/

Singing Voice Extraction Original Recording Singing voice Accompaniment

Singing Voice Extraction Time Original recording HPR F0 annotation Harmonic component Percussive component Residual component MR TR SL Harmonic portion singing voice Harmonic portion accompaniment Fricatives singing voice Instrument onsets accompaniment + + Vibrato & formants singing voice Diffuse instruments sounds accompaniment Estimate singing voice Estimate accompaniment

Score-Informed Source Separation Exploit musical score to support separation process Pitch Pitch Pitch Time Time Time

Parametric Model Approach Rebuild spectrogram information Estimate Parameters Render (Hz) (Hz) Time (seconds) Time (seconds)

NMF (Nonnegative Matrix Factorization) M K N 0 0 0 M K

NMF (Nonnegative Matrix Factorization) M K M N K Magnitude Spectrogram Templates Activations Templates: Pitch + Timbre Activations: Onset time + Duration How does it sound When does it sound

NMF-Decomposition Initialized template Initialized activations Note number Note number Time Random initialization

NMF-Decomposition Initialized template Initialized activations Note number Note number Learnt templates Learnt activations Note number Time Random initialization No semantic meaning

NMF-Decomposition Initialized template Initialized activations Note number Note number Time Constrained initialization

NMF-Decomposition Initialized template Initialized activations Note number Note number Template constraint for p=55 Time Activation constraints for p=55 Constrained initialization

NMF-Decomposition Initialized template Initialized activations Note number Learnt templates Learnt activations Note number Org Model Note number Time Constrained initialization NMF as refinement

Score-Informed Audio Decomposition Application: Audio editing 1600 1600 1200 1200 800 800 400 400 6 7 8 9 6 7 8 9 (Hertz) 580 523 500 0 0.5 1 Time (seconds) (Hertz) 580 554 500 0 0.5 1 Time (seconds)

Informed Drum-Sound Decomposition Remix: Literature: [Dittmar/Müller, IEEE/ACM-TASLP 2016] Demo: https://www.audiolabs-erlangen.de/resources/mir/2016-ieee-taslp-drumseparation

Loop Decomposition of EDM Decomposition Patterns Activations Literature: [López-Serrano/Dittmar/Müller, ISMIR 2016] Demo: https://www.audiolabs-erlangen.de/resources/mir/2016-ismir-emloop

Audio Mosaicing Target signal: Beatles Let it be Source signal: Bees Mosaic signal: Let it Bee Literature: [Driedger/Müller, ISMIR 2015] Demo: https://www.audiolabs-erlangen.de/resources/mir/2015-ismir-letitbee

NMF-Inspired Audio Mosaicing Non-negative matrix factorization (NMF) Non-negative matrix Components Activations. = fixed learned learned Proposed audio mosaicing approach Target s spectrogram Source s spectrogram Activations Mosaic s spectrogram. = Time source fixed Time source fixed Time target learned Time target

NMF-Inspired Audio Mosaicing Spectrogram target Spectrogram source Activation matrix Spectrogram mosaic Time source. = Time target Time source Time target Time target

This image cannot currently be displayed. NMF-Inspired Audio Mosaicing Spectrogram target Spectrogram source Iterative updates Activation matrix Spectrogram mosaic Time source. = Time target Time source Time target Time target Preserve temporal context Core idea: support the development of sparse diagonal activation structures

NMF-Inspired Audio Mosaicing Spectrogram target Spectrogram source Activation matrix Spectrogram mosaic Time source. = Time target Time source Time target Time target

NMF-Inspired Audio Mosaicing Spectrogram target Spectrogram source Activation matrix Spectrogram mosaic Time source. = Time target Time source Time target Time target

Audio Mosaicing Target signal: Chic Good times Source signal: Whales Mosaic signal

Audio Mosaicing Target signal: Adele Rolling in the Deep Source signal: Race car Mosaic signal

Motivic Similarity

Motivic Similarity B A C H

Summary Music information retrieval Audio decomposition techniques Machine learning Teaching Academic training of students Fundamental research Music applications & musicology Multimedia scenarios Web-based interfaces

Book: Fundamentals of Music Processing Meinard Müller Fundamentals of Music Processing Audio, Analysis, Algorithms, Applications 483 p., 249 illus., hardcover ISBN: 978-3-319-21944-8 Springer, 2015 Accompanying website: www.music-processing.de

Book: Fundamentals of Music Processing Meinard Müller Fundamentals of Music Processing Audio, Analysis, Algorithms, Applications 483 p., 249 illus., hardcover ISBN: 978-3-319-21944-8 Springer, 2015 Accompanying website: www.music-processing.de