Labelling. Friday 18th May. Goldsmiths, University of London. Bayesian Model Selection for Harmonic. Labelling. Christophe Rhodes.

Similar documents
Bayesian Model Selection for Harmonic Labelling

A PROBABILISTIC TOPIC MODEL FOR UNSUPERVISED LEARNING OF MUSICAL KEY-PROFILES

Week 14 Music Understanding and Classification

Curriculum Standard One: The student will listen to and analyze music critically, using the vocabulary and language of music.

Leaving Certificate 2017: Music Marking Scheme Composing Higher level - Core. SECTION A MELODY COMPOSITION (40 marks) Q Descriptors Mark

FANTASTIC: A Feature Analysis Toolbox for corpus-based cognitive research on the perception of popular music

BayesianBand: Jam Session System based on Mutual Prediction by User and System

Curriculum Standard One: The student will listen to and analyze music critically, using vocabulary and language of music.

Take a Break, Bach! Let Machine Learning Harmonize That Chorale For You. Chris Lewis Stanford University

Topic 11. Score-Informed Source Separation. (chroma slides adapted from Meinard Mueller)

The purpose of this essay is to impart a basic vocabulary that you and your fellow

Computational Modelling of Harmony

Chorale Harmonisation in the Style of J.S. Bach A Machine Learning Approach. Alex Chilvers

MUSIC PERFORMANCE: GROUP

CPU Bach: An Automatic Chorale Harmonization System

Notes on David Temperley s What s Key for Key? The Krumhansl-Schmuckler Key-Finding Algorithm Reconsidered By Carley Tanoue

Harmonic Generation based on Harmonicity Weightings

Algorithmic Music Composition

Probabilist modeling of musical chord sequences for music analysis

Statistical Modeling and Retrieval of Polyphonic Music

Curriculum Standard One: The student will listen to and analyze music critically, using the vocabulary and language of music.

Curriculum Standard One: The student will listen to and analyze music critically, using the vocabulary and language of music.

Bach in a Box - Real-Time Harmony

A probabilistic framework for audio-based tonal key and chord recognition

TEST SUMMARY AND FRAMEWORK TEST SUMMARY

Course Overview. Assessments What are the essential elements and. aptitude and aural acuity? meaning and expression in music?

PRESCOTT UNIFIED SCHOOL DISTRICT District Instructional Guide January 2016

Copyright 2009 Pearson Education, Inc. or its affiliate(s). All rights reserved. NES, the NES logo, Pearson, the Pearson logo, and National

SAMPLE ASSESSMENT TASKS MUSIC CONTEMPORARY ATAR YEAR 12

Influence of timbre, presence/absence of tonal hierarchy and musical training on the perception of musical tension and relaxation schemas

2 The Tonal Properties of Pitch-Class Sets: Tonal Implication, Tonal Ambiguity, and Tonalness

Harmonising Chorales by Probabilistic Inference

The Human Features of Music.

A new tool for measuring musical sophistication: The Goldsmiths Musical Sophistication Index

Music Information Retrieval with Temporal Features and Timbre

Chords not required: Incorporating horizontal and vertical aspects independently in a computer improvisation algorithm

ANNOTATING MUSICAL SCORES IN ENP

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

SAMPLE ASSESSMENT TASKS MUSIC CONTEMPORARY ATAR YEAR 11

TEST SUMMARY AND FRAMEWORK TEST SUMMARY

SAMPLE ASSESSMENT OUTLINE MUSIC ALL CONTEXTS ATAR YEAR 11

MELODIC AND RHYTHMIC EMBELLISHMENT IN TWO VOICE COMPOSITION. Chapter 10

6.UAP Project. FunPlayer: A Real-Time Speed-Adjusting Music Accompaniment System. Daryl Neubieser. May 12, 2016

IMPROVING PREDICTIONS OF DERIVED VIEWPOINTS IN MULTIPLE VIEWPOINT SYSTEMS

IEEE Proof Web Version

SAMPLE ASSESSMENT TASKS MUSIC JAZZ ATAR YEAR 11

Fourier Integral Representations Basic Formulas and facts

DETECTION OF KEY CHANGE IN CLASSICAL PIANO MUSIC

Outline. Why do we classify? Audio Classification

Music Composition with RNN

Texas State Solo & Ensemble Contest. May 25 & May 27, Theory Test Cover Sheet

Pitfalls and Windfalls in Corpus Studies of Pop/Rock Music

Probabilistic and Logic-Based Modelling of Harmony

Topic 10. Multi-pitch Analysis

2011 Music Performance GA 3: Aural and written examination

Monday 23 May 2016 Morning

Theory of Music Grade 2

Music Understanding and the Future of Music

Extracting Information from Music Audio

Creating a Feature Vector to Identify Similarity between MIDI Files

2014 Music Performance GA 3: Aural and written examination

PLANE TESSELATION WITH MUSICAL-SCALE TILES AND BIDIMENSIONAL AUTOMATIC COMPOSITION

1 Overview. 1.1 Nominal Project Requirements

A Shift-Invariant Latent Variable Model for Automatic Music Transcription

Structured training for large-vocabulary chord recognition. Brian McFee* & Juan Pablo Bello

T Y H G E D I. Music Informatics. Alan Smaill. Jan 21st Alan Smaill Music Informatics Jan 21st /1

MorpheuS: constraining structure in automatic music generation

Characteristics of Polyphonic Music Style and Markov Model of Pitch-Class Intervals

jsymbolic 2: New Developments and Research Opportunities

Introductions to Music Information Retrieval

MUSIC PERFORMANCE: SOLO

Automatic Labelling of tabla signals

EE391 Special Report (Spring 2005) Automatic Chord Recognition Using A Summary Autocorrelation Function

Music Similarity and Cover Song Identification: The Case of Jazz

MUSIC GROUP PERFORMANCE

Composer Style Attribution

1/18/17. MUSIC 232: Fundamentals of Music Teaching GB: Loops Only Project

CTP431- Music and Audio Computing Music Information Retrieval. Graduate School of Culture Technology KAIST Juhan Nam

Total Section A (/45) Total Section B (/45)

/$ IEEE

DEPARTMENT/GRADE LEVEL: Band (7 th and 8 th Grade) COURSE/SUBJECT TITLE: Instrumental Music #0440 TIME FRAME (WEEKS): 36 weeks

BA(Hons) Creative Music Performance JTC GUITAR

REPORT ON THE NOVEMBER 2009 EXAMINATIONS

Event-based Multitrack Alignment using a Probabilistic Framework

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng

SAMPLE ASSESSMENT TASKS MUSIC GENERAL YEAR 12

Audio Feature Extraction for Corpus Analysis

MUSIC CURRICULM MAP: KEY STAGE THREE:

Sample assessment task. Task details. Content description. Year level 10

Perceptual dimensions of short audio clips and corresponding timbre features

School of Church Music Southwestern Baptist Theological Seminary

TREE MODEL OF SYMBOLIC MUSIC FOR TONALITY GUESSING

CONTEMPORARY MUSIC. ATAR course examination Marking Key

AP MUSIC THEORY 2006 SCORING GUIDELINES. Question 7

Sequential Association Rules in Atonal Music

Chord Representations for Probabilistic Models

Music Information Retrieval Community

Soundprism: An Online System for Score-Informed Source Separation of Music Audio Zhiyao Duan, Student Member, IEEE, and Bryan Pardo, Member, IEEE

Music Information Retrieval

Transcription:

Selection Bayesian Goldsmiths, University of London Friday 18th May

Selection 1 Selection 2 3 4

Selection The task: identifying chords and assigning harmonic labels in popular music. currently to MIDI transcriptions of performances; could be applied to audio directly (given suitable processing). Applications: generating fake books, guitar chords feeding into models of music cognition, melodic memory

Selection The task: identifying chords and assigning harmonic labels in popular music. currently to MIDI transcriptions of performances; could be applied to audio directly (given suitable processing). Applications: generating fake books, guitar chords feeding into models of music cognition, melodic memory

Selection The task: identifying chords and assigning harmonic labels in popular music. currently to MIDI transcriptions of performances; could be applied to audio directly (given suitable processing). Applications: generating fake books, guitar chords feeding into models of music cognition, melodic memory

Selection The task: identifying chords and assigning harmonic labels in popular music. currently to MIDI transcriptions of performances; could be applied to audio directly (given suitable processing). Applications: generating fake books, guitar chords feeding into models of music cognition, melodic memory

Selection Previous work: preference rules / knowledge representation Building a better model implicit models of harmony and key (e.g. profiles) embedding in suitable space (spiral, torus) Common feature: label small section and then smooth. We attempt to build a model with some desireable attributes: credible, at least at the descriptive level; quantitative enough to be usable in further inference; able to be extended incorporate new information quantitatively.

Selection Previous work: preference rules / knowledge representation Building a better model implicit models of harmony and key (e.g. profiles) embedding in suitable space (spiral, torus) Common feature: label small section and then smooth. We attempt to build a model with some desireable attributes: credible, at least at the descriptive level; quantitative enough to be usable in further inference; able to be extended incorporate new information quantitatively.

Selection What is the next number in the series -1, 3, 7, 11,...? How many boxes in this scene? The problem

Selection The solution accept the simplest explanation that fits the data. Why? Build alternative classes of model which are capable of explaining the data, and compute and compare likelihoods of given data. -1, 3, 7, 11,...? f (n) = x 0 + kn: 15, 19 f (n) = x 0 + dn 2 + cn 3 : -19.9, 1043.8

Selection The chord model For pitch-class vector x, we express probability density given chord c as where p(x c;ω) = p D (t t c;ω)p D (rmd c;ω) p D (x α c ) = 1 B(α) i x α i 1 i ( i x i = 1) Then by Bayes theorem, for a given pitch-class vector x p(c xω) = p(x cω)p(cω) c p(x cω)p(cω)

Selection The chord model For pitch-class vector x, we express probability density given chord c as where p(x c;ω) = p D (t t c;ω)p D (rmd c;ω) p D (x α c ) = 1 B(α) i x α i 1 i ( i x i = 1) Then by Bayes theorem, for a given pitch-class vector x p(c xω) = p(x cω)p(cω) c p(x cω)p(cω)

Selection The chord model For pitch-class vector x, we express probability density given chord c as where p(x c;ω) = p D (t t c;ω)p D (rmd c;ω) p D (x α c ) = 1 B(α) i x α i 1 i ( i x i = 1) Then by Bayes theorem, for a given pitch-class vector x p(c xω) = p(x cω)p(cω) c p(x cω)p(cω)

Selection p D (x α c ) = 1 B(α) i x α i 1 i Dirichlet distributions ( i x i = 1) For two variables, choose one x (and the other is 1 x) pd(x {4, 3}) pd(x {4, 0.5}) pd(x {0.5, 0.5}) 0 x 1 0 x 1 0 x 1

Selection Parameter estimation A Dirichlet distribution over k variables has k parameters. Our model has (in principle) two Dirichlet distributions per distinct chord. Based on initial inspection of our corpus, we tie parameters such that there are only three distinct cases (instead of the 4 12 6 that there are in principle for our chord repertoire): major or minor chord over a whole bar; major or minor chord over a sub-bar window; anything else (aug, dim, sus4, sus9). Estimate parameters for these distributions Maximize likelihood of training set; Maximize posterior of training set given a suitable prior; Tune to maximize performance of labelling task on training set.

Selection Parameter estimation A Dirichlet distribution over k variables has k parameters. Our model has (in principle) two Dirichlet distributions per distinct chord. Based on initial inspection of our corpus, we tie parameters such that there are only three distinct cases (instead of the 4 12 6 that there are in principle for our chord repertoire): major or minor chord over a whole bar; major or minor chord over a sub-bar window; anything else (aug, dim, sus4, sus9). Estimate parameters for these distributions Maximize likelihood of training set; Maximize posterior of training set given a suitable prior; Tune to maximize performance of labelling task on training set.

Selection Parameter estimation A Dirichlet distribution over k variables has k parameters. Our model has (in principle) two Dirichlet distributions per distinct chord. Based on initial inspection of our corpus, we tie parameters such that there are only three distinct cases (instead of the 4 12 6 that there are in principle for our chord repertoire): major or minor chord over a whole bar; major or minor chord over a sub-bar window; anything else (aug, dim, sus4, sus9). Estimate parameters for these distributions Maximize likelihood of training set; Maximize posterior of training set given a suitable prior; Tune to maximize performance of labelling task on training set.

Limitations of this chord model Selection no special treatment of bass note; more generally, no handling of register of individual notes; no modelling of transitions between chords.

Choosing a region Selection When does one chord end and another begin? Assumptions: barline as fundamental division; new chords only on beats. The first assumption is probably reasonable for our task; the second leads to problems in strongly-syncopated passages.

Choosing a region Selection When does one chord end and another begin? Assumptions: barline as fundamental division; new chords only on beats. The first assumption is probably reasonable for our task; the second leads to problems in strongly-syncopated passages.

Selection Choosing a region s: all possible beatwise divisions of a bar. For example, for 4 4, {4} {3,1}, {1,3} {2,2} {2,1,1}, {1,2,1}, {1,1,2} {1,1,1,1} Choose between bar divisions ω using Bayesian model selection: p(ω xω ) c p(x cωω )p(cωω )

Selection Choosing a region s: all possible beatwise divisions of a bar. For example, for 4 4, {4} {3,1}, {1,3} {2,2} {2,1,1}, {1,2,1}, {1,1,2} {1,1,1,1} Choose between bar divisions ω using Bayesian model selection: p(ω xω ) c p(x cωω )p(cωω )

Example I Selection Saving all my love for you (Michael Masser) Bass note assignments and extensions are heuristically derived after the harmonic labelling; future work would incorporate those judgments into the framework.

Example I Selection Saving all my love for you (Michael Masser) Bass note assignments and extensions are heuristically derived after the harmonic labelling; future work would incorporate those judgments into the framework.

Evaluation: how good is our algorithm? Selection Maximum likelihood parameters estimated from training set (233 bars): 53% regions correctly bounded; 75% chords labelled correctly. Parameters tuned to training set: 75% regions correctly bounded; 76% chords labelled correctly.

Example II Selection Lady Madonna (Lennon/McCartney) Is there even a right answer?

Selection Evaluation investigation Current investigation: how much do experts opinions on this task differ? send machine-generated labels to acknowledged experts for and corrections; four experts, forty excerpts; for each expert, score and audio provided for thirty and audio only for ten; lead sheet format also include lead sheets from song books. Watch this space...

Selection Evaluation investigation Current investigation: how much do experts opinions on this task differ? send machine-generated labels to acknowledged experts for and corrections; four experts, forty excerpts; for each expert, score and audio provided for thirty and audio only for ten; lead sheet format also include lead sheets from song books. Watch this space...

Extensibility Selection If we have more domain knowledge, then we can use the same framework to incorporate that knowledge: bass note: p(c xbω ) p(x cbω )p(c bω ) genre: p(c xgω ) p(x cgω )p(c gω ) Bayesian inference doesn t give just one answer, but a probability distribution over labels and windows. We can quantify our uncertainty (e.g. distribution entropy).

Summary Selection Can segment bars and generate harmonic labels with reasonable accuracy. Actual accuracy figures are indicative only: ongoing investigation into performance of human experts. Framework is extensible: can incorporate specific information (e.g. knowledge of bass note, alphabet of chord labels for a known genre) in a principled way.

Acknowledgments Selection David Lewis, Daniel Müllensiefen Geerdes midimusic (http://www.midimusic.de/) EPSRC grants GR/S84750/01, EP/D038855/1