DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

Similar documents
DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

MUSI-6201 Computational Music Analysis

Outline. Why do we classify? Audio Classification

Music Information Retrieval

CTP431- Music and Audio Computing Music Information Retrieval. Graduate School of Culture Technology KAIST Juhan Nam

A System for Acoustic Chord Transcription and Key Extraction from Audio Using Hidden Markov models Trained on Synthesized Audio

GCT535- Sound Technology for Multimedia Timbre Analysis. Graduate School of Culture Technology KAIST Juhan Nam

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Music Information Retrieval

Data Driven Music Understanding

THE importance of music content analysis for musical

Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University


Beethoven, Bach, and Billions of Bytes

Music Information Retrieval. Juan Pablo Bello MPATE-GE 2623 Music Information Retrieval New York University

Music Information Retrieval (MIR)

PULSE-DEPENDENT ANALYSES OF PERCUSSIVE MUSIC

Music Information Retrieval. Juan P Bello

Singer Traits Identification using Deep Neural Network

Introductions to Music Information Retrieval

Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods

Music Complexity Descriptors. Matt Stabile June 6 th, 2008

Making Sense of Sound and Music

Automatic Construction of Synthetic Musical Instruments and Performers

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST)

A Survey of Audio-Based Music Classification and Annotation

Computational Modelling of Harmony

Music Radar: A Web-based Query by Humming System

A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models

The MPC X & MPC Live Bible 1

The MAMI Query-By-Voice Experiment Collecting and annotating vocal queries for music information retrieval

Music Information Retrieval

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

Content-based Music Structure Analysis with Applications to Music Semantics Understanding

Tempo and Beat Analysis

Music Information Retrieval (MIR)

User-Specific Learning for Recognizing a Singer s Intended Pitch

IMPROVING RHYTHMIC SIMILARITY COMPUTATION BY BEAT HISTOGRAM TRANSFORMATIONS

MUSIC CONTENT ANALYSIS : KEY, CHORD AND RHYTHM TRACKING IN ACOUSTIC SIGNALS

Week 14 Music Understanding and Classification

Computer Coordination With Popular Music: A New Research Agenda 1

Audio. Meinard Müller. Beethoven, Bach, and Billions of Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Music Understanding and the Future of Music

Music Information Retrieval

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

Drum Stroke Computing: Multimodal Signal Processing for Drum Stroke Identification and Performance Metrics

MODELING RHYTHM SIMILARITY FOR ELECTRONIC DANCE MUSIC

Singer Recognition and Modeling Singer Error

Singing Voice Detection for Karaoke Application

Automatic Rhythmic Notation from Single Voice Audio Sources

An Examination of Foote s Self-Similarity Method

Scoregram: Displaying Gross Timbre Information from a Score

Music Information Retrieval for Jazz

Statistical Modeling and Retrieval of Polyphonic Music

Rhythm related MIR tasks

Content-based music retrieval

Classification of Timbre Similarity

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng

The song remains the same: identifying versions of the same piece using tonal descriptors

Perceptual dimensions of short audio clips and corresponding timbre features

A Categorical Approach for Recognizing Emotional Effects of Music

Music structure information is

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Resources. Composition as a Vehicle for Learning Music

TOWARDS IMPROVING ONSET DETECTION ACCURACY IN NON- PERCUSSIVE SOUNDS USING MULTIMODAL FUSION

Music 209 Advanced Topics in Computer Music Lecture 4 Time Warping

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models

A Bootstrap Method for Training an Accurate Audio Segmenter

Pattern Recognition in Music

Timing In Expressive Performance

Music Information Retrieval with Temporal Features and Timbre

A REAL-TIME SIGNAL PROCESSING FRAMEWORK OF MUSICAL EXPRESSIVE FEATURE EXTRACTION USING MATLAB

Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900)

TechNote: MuraTool CA: 1 2/9/00. Figure 1: High contrast fringe ring mura on a microdisplay

CTP 431 Music and Audio Computing. Course Introduction. Graduate School of Culture Technology (GSCT) Juhan Nam

CSC475 Music Information Retrieval

Setting up your Roland V-Drums with Melodics.

Extracting Information from Music Audio

Breakscience. Technological and Musicological Research in Hardcore, Jungle, and Drum & Bass

Music Genre Classification and Variance Comparison on Number of Genres

PREDICTING THE PERCEIVED SPACIOUSNESS OF STEREOPHONIC MUSIC RECORDINGS

Lets go through the chart together step by step looking at each bit and understanding what the Chart is asking us to do.

A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES

EXPLORING THE USE OF ENF FOR MULTIMEDIA SYNCHRONIZATION

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

MUSICAL INSTRUMENT RECOGNITION USING BIOLOGICALLY INSPIRED FILTERING OF TEMPORAL DICTIONARY ATOMS

TIMECLOCK ENROLLMENT GUIDE (5000 SERIES)

Efficient Vocal Melody Extraction from Polyphonic Music Signals

Violin Timbre Space Features

... A Pseudo-Statistical Approach to Commercial Boundary Detection. Prasanna V Rangarajan Dept of Electrical Engineering Columbia University

Indexing local features. Wed March 30 Prof. Kristen Grauman UT-Austin

Transcription An Historical Overview

Music out of Digital Data

Contextual music information retrieval and recommendation: State of the art and challenges

Lecture 9 Source Separation

Beethoven, Bach und Billionen Bytes

Brain Computer Music Interfacing Demo

Phone-based Plosive Detection

Transcription:

DAY 1 Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval Jay LeBoeuf Imagine Research jay{at}imagine-research.com Kyogu Lee Gracenote Kglee{at}ccrma.stanford.edu June 2009

These lecture notes contain hyperlinks to the CCRMA Wiki. On these pages, you can find supplemental material for lectures - providing extra tutorials, support, references for further reading, or demonstration code snippets for those interested in a given topic. Click on the symbol on the lower-left corner of a slide to access additional resources. WIKI REFERENCES

Administration https://cm-wiki.stanford.edu/wiki/mir_workshop_2009 Daily schedule Introductions Our background A little about yourself List of your region of interest, and any specific items of interest that you d like to see covered.

Example Seed

Why MIR? Find specific item Find something vague Find something interesting or new

Commercial Applications Retrieval based on similarity (IR and creative applications) Live analysis of audio Music Discovery / Recommendation Query for music Assisted Music Transcription Audio fingerprint Creative applications

Queries Query by Humming Lots of academic work Query by audio ID Gracenote ID, Shazam, Audible Magic Noisy audio snippet Query by example Find more like this (where this has to be specified or inferred)

Current Hot research areas Analysis of commercial music tracks, such as: Genre ID (labels exist, but even humans disagree!) Artist classification Tricks: use voice only to improve accuracy to 70% (out of 100 artists) Artist similarity Really, what is the similarity? Augment recommenders with new data

Motivations / Demos Transcriptionist vs. Descriptionist approach Music Transcription (restoration) piano from MIDI

Motivations / Demos Transcriptionist vs. Descriptionist approach Music Transcription (restoration) piano from MIDI http://zenph.com/listen.html More info: http://www.pragprog.com/articles/a-pragmatic-project-livein-concert/the-methodology

Motivations / Demos Transcriptionist vs. Descriptionist approach Music Transcription (restoration) piano company from MIDI http://zenph.com/listen.html - More transcription (drum transcription demo)

BASIC SYSTEM OVERVIEW

Basic system overview Segmentation (Frames, Onsets, Beats, Bars, Chord Changes, etc)

Basic system overview Segmentation (Frames, Onsets, Beats, Bars, Chord Changes, etc) Feature Extraction (Time-based, spectral energy, MFCC, etc)

Basic system overview Segmentation (Frames, Onsets, Beats, Bars, Chord Changes, etc) Feature Extraction (Time-based, spectral energy, MFCC, etc) Analysis / Decision Making (Classification, Clustering, etc)

TIMING AND SEGMENTATION

Timing and Segmentation Slicing up by fixed time slices 1 second, 80 ms, 100 ms, 20-40ms, etc. Frames Different problems call for different frame lengths

Frames 1 second 1 second

Timing and Segmentation Slicing up by fixed time slices 1 second, 80 ms, 100 ms, 20-40ms, etc. Frames Different problems call for different frame lengths Onset detection Beat detection Beat Measure / Bar / Harmonic changes Segments Musically relevant boundaries Separate by some perceptual cue

Onset detection What is an Onset? How to detect? Envelope is not enough Need to examine frequency bands

FEATURE EXTRACTION

Frame 1

FRAME 1

ZERO CROSSING RATE FRAME 1 Zero crossing rate = 9

Frame 2 Zero crossing rate = 423

Features : SimpleLoop.wav Frame ZCR 1 9 2 423 3 22 4 28 5 390 Warning: example results only - not actual results from audio analysis

FEATURE EXTRACTION

FFT?

Spectral Features Spectral Centroid Spectral Bandwidth/Spread Spectral Skewness Spectral Kurtosis Spectral Tilt Spectral Roll-Off Spectral Flatness Measure Spectral Crest Factor Spectral moments

Frame 1 85% 15%

Skewness Kurtosis http://www.jyu.fi/hum/laitokset/musiikki/en/research/coe/materials/mirtoolbox/userguide1.1

Frame 2

ANALYSIS AND DECISION MAKING

Heuristic Analysis Example: Cowbell on just the snare drum of a drum loop. Simple instrument recognition! Use basic thresholds or simple decision tree to form rudimentary transcription of kicks and snares. Time for more sophistication!

> End of Lecture 1