Introductions to Music Information Retrieval

Similar documents
Video-based Vibrato Detection and Analysis for Polyphonic String Music

Music Representations

Audio. Meinard Müller. Beethoven, Bach, and Billions of Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Music Information Retrieval (MIR)

CSC475 Music Information Retrieval

Music Information Retrieval

Beethoven, Bach, and Billions of Bytes

Music Processing Introduction Meinard Müller

Topic 11. Score-Informed Source Separation. (chroma slides adapted from Meinard Mueller)

Copyright 2009 Pearson Education, Inc. or its affiliate(s). All rights reserved. NES, the NES logo, Pearson, the Pearson logo, and National

Music Information Retrieval (MIR)

Music Representations

A Study of Synchronization of Audio Data with Symbolic Data. Music254 Project Report Spring 2007 SongHui Chon

Further Topics in MIR

Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900)

Music Information Retrieval

Topic 10. Multi-pitch Analysis

Computational Modelling of Harmony

Music Radar: A Web-based Query by Humming System

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

Music Understanding and the Future of Music

CTP431- Music and Audio Computing Music Information Retrieval. Graduate School of Culture Technology KAIST Juhan Nam

Beethoven, Bach und Billionen Bytes

Outline. Why do we classify? Audio Classification

Music Similarity and Cover Song Identification: The Case of Jazz

Grade 4 Music Curriculum Maps

Semi-automated extraction of expressive performance information from acoustic recordings of piano music. Andrew Earis

Tempo and Beat Analysis

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

City, University of London Institutional Repository

Lecture 9 Source Separation

Music Information Retrieval

Automatic music transcription

A SCORE-INFORMED PIANO TUTORING SYSTEM WITH MISTAKE DETECTION AND SCORE SIMPLIFICATION

Automatic characterization of ornamentation from bassoon recordings for expressive synthesis

The MAMI Query-By-Voice Experiment Collecting and annotating vocal queries for music information retrieval

MUSIC CURRICULM MAP: KEY STAGE THREE:

Curriculum Standard One: The student will listen to and analyze music critically, using vocabulary and language of music.

K-12 Performing Arts - Music Standards Lincoln Community School Sources: ArtsEdge - National Standards for Arts Education

Curriculum Standard One: The student will listen to and analyze music critically, using the vocabulary and language of music.

Robert Alexandru Dobre, Cristian Negrescu

Music Information Retrieval

ESTIMATING THE ERROR DISTRIBUTION OF A TAP SEQUENCE WITHOUT GROUND TRUTH 1

2 2. Melody description The MPEG-7 standard distinguishes three types of attributes related to melody: the fundamental frequency LLD associated to a t

Music Information Retrieval. Juan P Bello

TExES Music EC 12 (177) Test at a Glance

A STATISTICAL VIEW ON THE EXPRESSIVE TIMING OF PIANO ROLLED CHORDS

Music Information Retrieval with Temporal Features and Timbre

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

Course Overview. Assessments What are the essential elements and. aptitude and aural acuity? meaning and expression in music?

6.UAP Project. FunPlayer: A Real-Time Speed-Adjusting Music Accompaniment System. Daryl Neubieser. May 12, 2016

TEST SUMMARY AND FRAMEWORK TEST SUMMARY

Curriculum Standard One: The student will listen to and analyze music critically, using the vocabulary and language of music.

Soundprism: An Online System for Score-Informed Source Separation of Music Audio Zhiyao Duan, Student Member, IEEE, and Bryan Pardo, Member, IEEE

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES

FINE ARTS Institutional (ILO), Program (PLO), and Course (SLO) Alignment

Curriculum Standard One: The student will listen to and analyze music critically, using the vocabulary and language of music.

NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS

Music. Music Instrumental. Program Description. Fine & Applied Arts/Behavioral Sciences Division

Lecture 10 Harmonic/Percussive Separation

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST)

Music Standards for Band. Proficient Apply instrumental technique (e.g., fingerings, bowings, stickings, playing position, tone quality, articulation)

Music Information Retrieval Using Audio Input

TEST SUMMARY AND FRAMEWORK TEST SUMMARY

Greeley-Evans School District 6 High School Vocal Music Curriculum Guide Unit: Men s and Women s Choir Year 1 Enduring Concept: Expression of Music

Automatic Rhythmic Notation from Single Voice Audio Sources

Assessment Schedule 2013 Making Music: Integrate aural skills into written representation (91420)

Frankenstein: a Framework for musical improvisation. Davide Morelli

Toward a Computationally-Enhanced Acoustic Grand Piano

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

Automatic music transcription

Singer Traits Identification using Deep Neural Network

Automatic Piano Music Transcription

NOTE-LEVEL MUSIC TRANSCRIPTION BY MAXIMUM LIKELIHOOD SAMPLING

Singing Pitch Extraction and Singing Voice Separation

The Keyboard. An Introduction to. 1 j9soundadvice 2013 KS3 Keyboard. Relevant KS3 Level descriptors; The Tasks. Level 4

A DISCRETE FILTER BANK APPROACH TO AUDIO TO SCORE MATCHING FOR POLYPHONIC MUSIC

MUSIC THEORY CURRICULUM STANDARDS GRADES Students will sing, alone and with others, a varied repertoire of music.

ILLINOIS LICENSURE TESTING SYSTEM

Level 1 Music, Demonstrate knowledge of conventions used in music scores a.m. Wednesday 11 November 2015 Credits: Four

CS 591 S1 Computational Audio

Transcription An Historical Overview

JGuido Library: Real-Time Score Notation from Raw MIDI Inputs

A CHROMA-BASED SALIENCE FUNCTION FOR MELODY AND BASS LINE ESTIMATION FROM MUSIC AUDIO SIGNALS

Curriculum Framework for Performing Arts

Advanced Placement Music Theory

Grade HS Band (1) Basic

Proc. of NCC 2010, Chennai, India A Melody Detection User Interface for Polyphonic Music

However, in studies of expressive timing, the aim is to investigate production rather than perception of timing, that is, independently of the listene

Texas State Solo & Ensemble Contest. May 25 & May 27, Theory Test Cover Sheet

Assessment Schedule 2016 Music: Demonstrate knowledge of conventions in a range of music scores (91276)

jsymbolic 2: New Developments and Research Opportunities

Characteristics of Polyphonic Music Style and Markov Model of Pitch-Class Intervals

MUSI-6201 Computational Music Analysis

Music Alignment and Applications. Introduction

EVALUATING AUTOMATIC POLYPHONIC MUSIC TRANSCRIPTION

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Course Outcome Summary

Transcription:

Introductions to Music Information Retrieval ECE 272/472 Audio Signal Processing Bochen Li University of Rochester

Wish List For music learners/performers While I play the piano, turn the page for me Tell me if I play wrong notes While I sing a song, automatic play accompaniment for me

Wish List For concert audiences Tell me what is the instrument being played from orchestra Tell me what is the pitch/chord/key/tempo of the music being played Display the lyrics for a choir performance Just record/play the solo part, and mute the others

Wish List For musicologists Transcribe an improvised piano jazz performance into sheet music Numerically compare different artists performances expressiveness and personal styles Scan a sheet music into computer readable format, e.g., MIDI, XML, Lilypond, etc. Generate music following a composer style, e.g., bring Chopin back to life

Wish List For music listeners Always play my favorite song in radio stream Sing a song fragment and find out the name Automatically play the song for my mood

Introduction What is Music Information Retrieval (MIR)? Audio Signal Processing Machine Learning Musicology Psychoacoustics Computer Vision

Automatic Music Transcription Music Synchronization Source Separation Performance Expressiveness Analysis

Automatic Music Transcription the process of converting an acoustic musical signal into some form of music notation (e.g. staff notation, MIDI file, piano-roll,...)

Automatic Music Transcription the process of converting an acoustic musical signal into some form of music notation (e.g. staff notation, MIDI file, piano-roll,...)

Automatic Music Transcription Subtasks: Pitch detection Onset/offset detection Instrument identification Rhythm parsing Identification of dynamics/expression

Automatic Music Transcription State-of-the-art Outline: 1. Multi-pitch Analysis Frame-level Note-level Stream-level 2. Towards a Complete Music Notation

Automatic Music Transcription State-of-the-art Outline: 1. Multi-pitch Analysis Frame-level (multi-pitch estimation) - Estimate pitches and polyphony in each frame Note-level - Estimate pitch, onset, offset of notes Stream-level - Stream pitches by sources

Automatic Music Transcription Frame-level (multi-pitch estimation) - Estimate pitches and polyphony in each frame Note-level - Estimate pitch, onset, offset of notes Stream-level - Stream pitches by sources

Automatic Music Transcription Frame-level (multi-pitch estimation) Categorization of methods: Domain of Operation: - Time - Frequency - Hybrid Core Algorithms: - Signal processing approaches - Maximum likelihood estimation - Bayesian - Spectrogram decomposition - Sparse coding

Automatic Music Transcription Note-level (Note Tracking) Onset Detection - Can be sensitive to onset detection accuracy Post-processing of frame-level results - Form notes independently by connecting nearby pitches - Consider interactions between simultaneous pitches

Automatic Music Transcription Stream-level (Timbre-tracking) Vocal Flute Clarinet Bassoon

Automatic Music Transcription Stream-level (Timbre-tracking) Supervised - Train timbre models of sound sources Unsupervised - Cluster pitch estimates according to timbre

Automatic Music Transcription State-of-the-art Results Frame-level (Multi-pitch Estimation)

Automatic Music Transcription State-of-the-art Results Note-level (Note Tracking)

Automatic Music Transcription State-of-the-art Results Original Audio Transcription results (played as MIDI synthesis) Bach's Minuet in G Chopin's Etude Op. 10 No. 1

Automatic Music Transcription State-of-the-art Outline: 1. Multi-pitch Analysis Frame-level Note-level Stream-level 2. Towards a Complete Music Notation

Automatic Music Transcription State-of-the-art Outline: 2. Towards a Complete Music Notation Current AMT systems can: Detect (multiple) pitches, onsets, offsets Identify instruments in polyphonic music Assign detected notes to a specific instrument Also, some systems are able to: Detect & integrate rhythmic information Detect tuning (per piece/note) Extract velocity per detected note Transcribe fingering (for specific instruments) Quantise pitches over time/beats Significant work needs to be done in order to extract a complete score

Automatic Music Transcription

Automatic Music Transcription Factors: Notes: Spelling, Staff assignment, Group into chords Rests: Duration, Staff assignment Binary matching: Barlines, Clefs, Key signatures, Time signatures

Automatic Music Transcription Music Synchronization Source Separation Performance Expressiveness Analysis

Music Synchronization Concept Align different versions/modalities of music performance Different renderings of audio performances Music score (Sheet music) Video frames

Music Synchronization Categories Offline music alignment - Have the full sequence of both signals - Method: Dynamic Time Warping Realtime music alignment (score following) - Have the full sequence of one signal (music score) - The other signal comes as live streams - The system should find the alignment in real-time

Music Synchronization Applications of offline music alignment Query by Humming System Construct multi-modal music digital library https://jdasam.github.io/performscore/ Music tutoring and grading

Music Synchronization Applications of real-time music alignment Automatic Accompaniment System - the computer follows the musician s speed

Music Synchronization Applications of real-time music alignment Automatic Accompaniment System - the computer follows the musician s speed https://www.youtube.com/watch?v=dnyjkwlzxpm

Music Synchronization Applications of real-time music alignment Automatic Lyrics Display - the computer follows the performance the display the encoded lyrics at the correct timing

Music Synchronization Applications of real-time music alignment Automatic Lyrics Display Project Lyrics - the computer follows the performance the display the encoded lyrics at the correct timing Algorithm Running

Automatic Music Transcription Music Synchronization Source Separation Performance Expressiveness Analysis

Source Separation Concept Separate the sound mixture into individual sources

Source Separation Time Domain Mixture It is not easy!

Source Separation Frequency Domain Spectrogram of sound mixture Mask Spectrogram of separated source The trick is to find the right mask

Source Separation Harmonic mask: Given the pitch track, know where to expect harmonics Spectrogram of sound mixture Mask Spectrogram of separated source

Source Separation Results: https://www.youtube.com/watch?v=b07t-y1jncs https://www.youtube.com/watch?v=c8azyvafcje http://paris.cs.illinois.edu/demos/ai/user-guide.mp4

Automatic Music Transcription Music Synchronization Source Separation Performance Expressiveness Analysis

Performance Expressiveness Analysis What is expressiveness? Volume Tempo Legato/Staccato/Vibrato Up-bow/Down-bow (String instrumentalists) Body Movements Some are visual aspect of music performance

Performance Expressiveness Analysis Vibrato Analysis Important artistic effect Pitch modulation of a note in a periodic fashion Characterized by Rate & Extent Spectrogram Audio Non-vibrato Vibrato

Performance Expressiveness Analysis Vibrato Analysis Vibrato Detection Note-level vibrato/non-vibrato classification Vibrato Characterization Pitch Vibrato rate: speed of pitch variation (1/T Hz) Time Vibrato extent: amount of pitch variation (A cents) Pitch A T Time

Performance Expressiveness Analysis Vibrato Analysis from visual modality Audio-based, Polyphonic Spec Pitch Video-based Hand 0 0.2 0.4 0.6 0.8 1.0 1.2 sec Hand Displacement 0 0.2 0.4 0.6 0.8 1.0 1.2 sec

Performance Expressiveness Analysis Visually Onset Prediction Predict the onset from body motion Help a robot accompanying player better synchronize with human Prediction Bar

Performance Expressiveness Analysis Body Expressiveness Modeling and Generation Applications Visual expressions give immersive music enjoyment experiences Replicating musicians body motions for educational purposes Visual human-computer interactions in automatic accompaniment system

Performance Expressiveness Analysis System 1: MIDI to Pose Input a sheet music, and output a simulation of human s body motion of playing this piece

Performance Expressiveness Analysis System 1: MIDI to Pose Input a sheet music, and output a simulation of human s body motion of playing this piece

Performance Expressiveness Analysis System 2: Audio to Body Dynamics Input an audio performance, and output a simulation of human s body motion of playing this piece

MIR Community International Society for Music Information Retrieval (ISMIR) The International Conference on New Interfaces for Musical Expression (NIME) Sound and Music Computing (SMC) International Computer Music Conference (ICMC) MIR Evaluation Exchange (MIREX) http://www.music-ir.org/mirex/wiki/mirex_home MIR-related PhD thesis http://www.pampalk.at/mir-phds/