CS 591 S1 Computational Audio

Similar documents
Audio Structure Analysis

Audio Structure Analysis

Audio Structure Analysis

Music Structure Analysis

Music Synchronization. Music Synchronization. Music Data. Music Data. General Goals. Music Information Retrieval (MIR)

Tempo and Beat Analysis

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Music Information Retrieval (MIR)

CS229 Project Report Polyphonic Piano Transcription

MUSI-6201 Computational Music Analysis

Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900)

Music Alignment and Applications. Introduction

Methods for the automatic structural analysis of music. Jordan B. L. Smith CIRMMT Workshop on Structural Analysis of Music 26 March 2010

A repetition-based framework for lyric alignment in popular songs

Music Structure Analysis

Music Information Retrieval (MIR)

Tempo and Beat Tracking

Book: Fundamentals of Music Processing. Audio Features. Book: Fundamentals of Music Processing. Book: Fundamentals of Music Processing

Elements of Music. How can we tell music from other sounds?

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST)

AUTOMATIC ACCOMPANIMENT OF VOCAL MELODIES IN THE CONTEXT OF POPULAR MUSIC

Supervised Learning in Genre Classification

The song remains the same: identifying versions of the same piece using tonal descriptors

Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification

Meet the Piano Keyboard

Computational Modelling of Harmony

MODELING RHYTHM SIMILARITY FOR ELECTRONIC DANCE MUSIC

Topics in Computer Music Instrument Identification. Ioanna Karydi

Music Structure Analysis

GROUPING RECORDED MUSIC BY STRUCTURAL SIMILARITY

Citation for published version (APA): Jensen, K. K. (2005). A Causal Rhythm Grouping. Lecture Notes in Computer Science, 3310,

Musical Form. Module 2 of Music: Under the Hood. John Hooker Carnegie Mellon University. Osher Course September 2018

An Examination of Foote s Self-Similarity Method

Introductions to Music Information Retrieval

AUTOMATED METHODS FOR ANALYZING MUSIC RECORDINGS IN SONATA FORM

ALIGNING SEMI-IMPROVISED MUSIC AUDIO WITH ITS LEAD SHEET

Topic 10. Multi-pitch Analysis

Theory D-examination 1

SHEET MUSIC-AUDIO IDENTIFICATION

AP Music Theory

AUTOMASHUPPER: AN AUTOMATIC MULTI-SONG MASHUP SYSTEM

EE391 Special Report (Spring 2005) Automatic Chord Recognition Using A Summary Autocorrelation Function

Music Database Retrieval Based on Spectral Similarity

Simple time Has 2, 3 or 4 as number of beats in a bar (top number of time signature)

Discovering Musical Structure in Audio Recordings

Improving Polyphonic and Poly-Instrumental Music to Score Alignment

STRUCTURAL CHANGE ON MULTIPLE TIME SCALES AS A CORRELATE OF MUSICAL COMPLEXITY

Automatic Piano Music Transcription

Week 14 Music Understanding and Classification

Harmonic Visualizations of Tonal Music

2. AN INTROSPECTION OF THE MORPHING PROCESS

AP Music Theory Course Planner

MUSIC/AUDIO ANALYSIS IN PYTHON. Vivek Jayaram

Theme and Variations

Voice & Music Pattern Extraction: A Review

Effects of acoustic degradations on cover song recognition

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

CHAPTER 6. Music Retrieval by Melody Style

Popular Song Summarization Using Chorus Section Detection from Audio Signal

Grouping Recorded Music by Structural Similarity Juan Pablo Bello New York University ISMIR 09, Kobe October 2009 marl music and audio research lab

MUSIC is a ubiquitous and vital part of the lives of billions

AUDIO-BASED MUSIC STRUCTURE ANALYSIS

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

Scoregram: Displaying Gross Timbre Information from a Score

A CLASSIFICATION APPROACH TO MELODY TRANSCRIPTION

HUMMING METHOD FOR CONTENT-BASED MUSIC INFORMATION RETRIEVAL

Swept-tuned spectrum analyzer. Gianfranco Miele, Ph.D

Simple Harmonic Motion: What is a Sound Spectrum?

Temporal coordination in string quartet performance

Analyzer Documentation

Data Driven Music Understanding

Music Similarity and Cover Song Identification: The Case of Jazz

Music Curriculum Glossary

Automatic Extraction of Popular Music Ringtones Based on Music Structure Analysis

Years 7 and 8 standard elaborations Australian Curriculum: Music

First Steps. Music Scope & Sequence

MTP Music Year 3 The Environment (3 weeks) Humans (3 weeks)

Shades of Music. Projektarbeit

WHEN listening to music, people spontaneously tap their

Exemplar for Internal Achievement Standard. Music Level 2

MODELS of music begin with a representation of the

Evaluating Melodic Encodings for Use in Cover Song Identification

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models

TOWARDS AN EFFICIENT ALGORITHM FOR AUTOMATIC SCORE-TO-AUDIO SYNCHRONIZATION

hit), and assume that longer incidental sounds (forest noise, water, wind noise) resemble a Gaussian noise distribution.

A Survey of Audio-Based Music Classification and Annotation

Speech To Song Classification


BLUE VALLEY DISTRICT CURRICULUM & INSTRUCTION Music 9-12/Honors Music Theory

Lab P-6: Synthesis of Sinusoidal Signals A Music Illusion. A k cos.! k t C k / (1)

FINDING REPEATING PATTERNS IN ACOUSTIC MUSICAL SIGNALS : APPLICATIONS FOR AUDIO THUMBNAILING.

Automatic Music Genre Classification


Music structure information is

Analysis of local and global timing and pitch change in ordinary

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

Music Representations

On Human Capability and Acoustic Cues for Discriminating Singing and Speaking Voices

Transcription:

4/29/7 CS 59 S Computational Audio Wayne Snyder Computer Science Department Boston University Today: Comparing Musical Signals: Cross- and Autocorrelations of Spectral Data for Structure Analysis Segmentation Computer Science Identifying structure from audio Arthur G. Lintgen: able to identify unlabeled recorded orchestral works by observing the spacing and patterns of grooves in an LP Inspired J. Foote (ISMIR, 2) to develop a MIR system based on structural similarity

4/29/7 Musical Form Units can be assigned letters (A, B, C) or functional names (intro, verse, chorus, bridge, etc) Strophic: repeats the same section, e.g. AA Binary: alternates two sections, which are often repeated, e.g. ABAB or AABB Ternary: third section is often a variation of the first, e.g. AABA, AABA, AA BA Arch: symmetric, repetition of sections around a center, e.g. ABCBA Rondo: main theme is alternated with sub-themes, e.g. ABACADA.. Variations: theme plus variations, e.g. AA i A ii AA iii Sonata: complex developmental form including the exposition, development and recapitulation of a given theme(s). Repetition Musical form is often defined by the amount of repetition across sectional units. Repetition is central to music (in harmony, melody, rhythm, etc). Significant variations are often found between repeated parts.

4/29/7 Repetition The information necessary to characterize repetitions is encoded in the feature vectors (e.g., chroma, spectrum, etc.).6.4.2.2.4.6 (a) A A 2 B B2 A 3 T C C 2 A4 A 5 (b) G# G F#.8 F E.6 D# D.4 C# C B.2 A# A A A 2 B B2 A 3 C 4 A 5 C 2 Audio Structure Analysis Given: CD recording Goal: Automatic extraction of the repetitive structure (or of the musical form) Example: Brahms Hungarian Dance No. 5 (Ormandy) 5 5 2

4/29/7 Basic Procedure Extract audio feature vectors (e.g., chromagraph) Cost measure and cost matrix self-similarity matrix Path extraction (pairwise similarity of segments) Global structure (clustering, grouping) Self-similarity matrix For an N-long sequence of feature vectors Self-similarity matrix: N x N matrix of pairwise (dis)similarities between vectors

4/29/7 Self-similarity matrix Vertical and horizontal axes represent time symmetric similarity function = symmetric matrix Main diagonal: closer/ most similar values similar subsequences (repetitions) -> diagonal stripes in the plot Self-similarity matrix

4/29/7 Self-similarity matrix Self-similarity matrix

4/29/7 Let s look at a similarity matrix and hear the piece of music to see how it represents the structure... Basic Procedure Self-similarity matrix Similarity structure 5 5 2

4/29/7 Basic Procedure Self-similarity matrix Similarity structure 5 5 2 Basic Procedure Self-similarity matrix Similarity structure 5 5 2

4/29/7 Basic Procedure Self-similarity matrix Similarity structure 5 5 2 Basic Procedure Self-similarity matrix Similarity structure 5 5 2

4/29/7 Basic Procedure Self-similarity matrix Similarity structure 5 5 2 Higher level structure can be determined by first extracting paths in the matrix

4/29/7 Path Extraction Thresholded, upper left Path Extraction Path removal

4/29/7 Path Extraction Path removal Path Extraction Extracted paths

4/29/7 Path Extraction Extracted paths after postprocessing Global Structure can be derived from relating paths at a higher level...

4/29/7 Music Synchronization: Lyrics-Audio Difficult task! Music Synchronization: Lyrics-Audio Lyrics-Audio Lyrics-MIDI + MIDI-Audio

4/29/7 System: SyncPlayer/LyricsSeeker High-Resolution Music Synchronization Normalized chroma features robust to changes in instrumentation and dynamics robust synchronization of reasonable overall quality Drawback: low temporal alignment accuracy Idea: Integration of note onset information

4/29/7 High-Resolution Music Synchronization Example: Beethoven s Fifth Audio MIDI Cost matrix Audio Warping path of poor local quality MIDI High-Resolution Music Synchronization Cost matrix windows are based on based on onset intervals, not uniformly spaced! Audio MIDI Cost matrix Audio Warping path based on onset information MIDI

4/29/7 High-Resolution Music Synchronization Ideas: Build up cost matrix with corridors of low cost Warping path tends to run through corridors of low cost note onset positions are likely to be aligned High-Resolution Music Synchronization Impulses zoom Decaying impulses zoom

4/29/7 High-Resolution Music Synchronization Cost matrix for decaying impulses High-Resolution Music Synchronization Cost matrix for decaying impulses Corridor of low cost

4/29/7 Music Segmentation Analysis Music segmentation pitch content (e.g., melody, harmony) music texture (e.g., timbre, instruments) rhythm How to find the musical sections of the piece? Music Segmentation Analysis Basic idea (from image processing) uses a kernel or mask to modify data points according to their neighbors Each data point is replaced by the weighted sum of its neighbors * kernel values This is given the convoluted name convolution in the signal processing world, with an excessively complicated definition. It is really just the same as a weighted sum of neighboring values, as with smoothing..

4/29/7 Music Segmentation Analysis Music Segmentation Analysis A binary kernel finds boundaries along the axis where sections of music different from neighboring sections

4/29/7 Music Segmentation Analysis The kernel is slid along the axis of the similarity matrix, and the value of the convolution is recorded for each time (in the center of the kernel): Music Segmentation Analysis The kernel is slid along the axis of the similarity matrix, and the value of the convolution is recorded for each time (in the center of the kernel):

4/29/7 Music Segmentation Analysis The kernel is slid along the axis of the similarity matrix, and the value of the convolution is recorded for each time (in the center of the kernel): Music Segmentation Analysis As the kernel is slid along the axis, the values calculated give us a novelty score for how much the music is changing at that point Different kernel types and sizes give a different perspective on the scale of the changes, from individual notes to large sections. Peak picking gives us the times where there is a potential start of a new segment of music:

4/29/7 Music Segmentation Analysis A Gaussian Kernel emphasizes changes at the center, and deemphasizes the edges (c.f. Hann Windows)