Outline. Why do we classify? Audio Classification

Similar documents
MUSI-6201 Computational Music Analysis

jsymbolic and ELVIS Cory McKay Marianopolis College Montreal, Canada

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

jsymbolic 2: New Developments and Research Opportunities

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Automatic Music Clustering using Audio Attributes

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

Music Similarity and Cover Song Identification: The Case of Jazz

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng

Singer Traits Identification using Deep Neural Network

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

Beethoven, Bach, and Billions of Bytes

Statistical Modeling and Retrieval of Polyphonic Music

Subjective Similarity of Music: Data Collection for Individuality Analysis

Robert Alexandru Dobre, Cristian Negrescu

Music Genre Classification and Variance Comparison on Number of Genres

Hidden Markov Model based dance recognition

CSC475 Music Information Retrieval

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Categorization of ICMR Using Feature Extraction Strategy And MIR With Ensemble Learning

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

Computational Modelling of Harmony

Analytic Comparison of Audio Feature Sets using Self-Organising Maps

Music Information Retrieval

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

Music Alignment and Applications. Introduction

TREE MODEL OF SYMBOLIC MUSIC FOR TONALITY GUESSING

Jazz Melody Generation and Recognition

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

Creating a Feature Vector to Identify Similarity between MIDI Files

A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES

Singer Recognition and Modeling Singer Error

Predicting Variation of Folk Songs: A Corpus Analysis Study on the Memorability of Melodies Janssen, B.D.; Burgoyne, J.A.; Honing, H.J.

Introductions to Music Information Retrieval

Music Understanding and the Future of Music

Tool-based Identification of Melodic Patterns in MusicXML Documents

Music Processing Introduction Meinard Müller

Progressive Music Examples.

HUMAN PERCEPTION AND COMPUTER EXTRACTION OF MUSICAL BEAT STRENGTH

Analysing Musical Pieces Using harmony-analyser.org Tools

Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification

Rethinking Reflexive Looper for structured pop music

T Y H G E D I. Music Informatics. Alan Smaill. Jan 21st Alan Smaill Music Informatics Jan 21st /1

Week 14 Music Understanding and Classification

A Survey of Audio-Based Music Classification and Annotation

Take a Break, Bach! Let Machine Learning Harmonize That Chorale For You. Chris Lewis Stanford University

Automatic Labelling of tabla signals

Evaluating Melodic Encodings for Use in Cover Song Identification

Semi-supervised Musical Instrument Recognition

PKUES Grade 10 Music Pre-IB Curriculum Outline. (adapted from IB Music SL)

Music Information Retrieval with Temporal Features and Timbre

Chords not required: Incorporating horizontal and vertical aspects independently in a computer improvisation algorithm

NCEA Level 2 Music (91275) 2012 page 1 of 6. Assessment Schedule 2012 Music: Demonstrate aural understanding through written representation (91275)

Mood Tracking of Radio Station Broadcasts

2 2. Melody description The MPEG-7 standard distinguishes three types of attributes related to melody: the fundamental frequency LLD associated to a t

a start time signature, an end time signature, a start divisions value, an end divisions value, a start beat, an end beat.

6.UAP Project. FunPlayer: A Real-Time Speed-Adjusting Music Accompaniment System. Daryl Neubieser. May 12, 2016

A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models

CSC475 Music Information Retrieval

Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900)

Music Information Retrieval (MIR)

Advanced Placement Music Theory

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

A probabilistic framework for audio-based tonal key and chord recognition

Automatic Rhythmic Notation from Single Voice Audio Sources

Machine Learning Term Project Write-up Creating Models of Performers of Chopin Mazurkas

IMPROVING RHYTHMIC SIMILARITY COMPUTATION BY BEAT HISTOGRAM TRANSFORMATIONS

Pattern Based Melody Matching Approach to Music Information Retrieval

Year 7 Curriculum Overview Subject: Music

Elements of Music David Scoggin OLLI Understanding Jazz Fall 2016

A TEXT RETRIEVAL APPROACH TO CONTENT-BASED AUDIO RETRIEVAL

Harmonic Generation based on Harmonicity Weightings

Exploring Melodic Features for the Classification and Retrieval of Traditional Music in the Context of Cultural Source

Music Mood. Sheng Xu, Albert Peyton, Ryan Bhular

Music Information Retrieval (MIR)

Music Database Retrieval Based on Spectral Similarity

AP MUSIC THEORY 2006 SCORING GUIDELINES. Question 7

Figured Bass and Tonality Recognition Jerome Barthélemy Ircam 1 Place Igor Stravinsky Paris France

Algorithms for melody search and transcription. Antti Laaksonen

Articulation Clarity and distinct rendition in musical performance.

THE importance of music content analysis for musical

Curriculum Standard One: The student will listen to and analyze music critically, using the vocabulary and language of music.

AudioRadar. A metaphorical visualization for the navigation of large music collections

Modeling memory for melodies

Melody classification using patterns

Proc. of NCC 2010, Chennai, India A Melody Detection User Interface for Polyphonic Music

PLANE TESSELATION WITH MUSICAL-SCALE TILES AND BIDIMENSIONAL AUTOMATIC COMPOSITION

use individual notes, chords, and chord progressions to analyze the structure of given musical selections. different volume levels.

Contextual music information retrieval and recommendation: State of the art and challenges

2. Problem formulation

HINSDALE MUSIC CURRICULUM

A MULTI-PARAMETRIC AND REDUNDANCY-FILTERING APPROACH TO PATTERN IDENTIFICATION

MorpheuS: constraining structure in automatic music generation

Audio Feature Extraction for Corpus Analysis

Composer Style Attribution

Classification of Timbre Similarity

Polyphonic Audio Matching for Score Following and Intelligent Audio Editors

Transcription:

Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification Implementation Future Work Why do we classify Increasing importance of digital music distribution Effectively navigating through large web-based music collections Structuring on-line music stores & radio stations Creating intelligent Internet music search engines and Peer-to-Peer systems Can be used in other type of analysis like similarity retrieval or summarization Audio Classification Folk Classical Jazz New Age Country Rock World Reggae Electronica 1

Audio Classification (cont.) Audio Classification (cont.) Music Information Retrieval (MIR) The process of indexing and searching music collections. Symbolic MIR Structured signals such as MIDI files are used. Melodic information is typically utilized. Two different approaches: Query-by-melody (manual) and Query-by-humming Audio MIR Arbitrary unstructured audio signals are used. Timbral and rhythmic (beat) information is utilized. What is MIDI Musical Instrument Digital Interface A music definition language Communication protocol supports 128 different voices includes 16 channels 2

Classification Process Steps Pitch Histograms MIDI file Audio-from-MIDI file Arbitrary Audio file Histogram Construction Algorithm Pitch Histogram 4D Feature Vector (Pitch Content Feature Set) Timbral & Rhythmic Features Multiple Pitch Detection Algorithm Labeled Feature Vectors used by Statistical Classifiers Unfolded Histogram an array of 128 integer values (bins) indexed by MIDI note numbers showing the frequency of occurrence of each note in a musical piece contains information regarding the pitch range of the music Folded Histogram All notes are transposed into a single octave and mapped to a circle of fifths an array of 12 integer values contains information regarding the pitch content of the music Genre Classification Result by comparing the feature vectors Folded Pitch Histogram Index Numbers Unfolded Pitch Histograms Index Numbers 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 Fig.1 - Unfolded Pitch Histograms of 2 Jazz pieces (left) and 2 Irish songs (right). 3

Pitch Histogram features Four dimensional feature vector PITCH-Fold AMPL-Fold PITCH-Unfold DIST-Fold Pitch Histogram Calculation For MIDI files: The algorithm increments the corresponding note s frequency counter while using linear traversal over all MIDI events in the file. Normalization For arbitrary audio files: Multiple Pitch Detection Algorithm Multiple Pitch Detection Algorithm Experiment Details Types of music contents: symbolic (refers to MIDI) audio-from-midi (generated using a synthesizer playing a MIDI file) audio (digital audio files like mp3 s found on the web) Five musical genres are used: Electronica, Classical, Jazz, Irish Folk and Rock Experiment Set: A set of 100 musical pieces in MIDI format for each genre A set of 100 audio-from-midi pieces for each genre A set of 100 general audio files KNN(3) Classifier Fig.2 Multiple Pitch Detection Flow Chart 4

Classification Results in MIDI Classification Results in MIDI Fig.3 Classification accuracy comparison of random and MIDI Classification Results in MIDI Classification Results in MIDI Fig.4 Pair-wise evaluation in MIDI Fig.5 Average classification accuracy as a function of the length of input MIDI data 5

Classification Results in Audio-from-MIDI Classification Results in Audio-from-MIDI Fig.6 - Classification accuracy comparison of random and Audio-from-MIDI Comparison of Classification Results Implementation MARSYAS MusicAl Research SYstem for Analysis and Synthesis the software used for audio Pitch Histogram calculation and musical genre classification. Three distinct modes of visualization: Standard Pitch Histogram plots 3D pitch-time surfaces Projection of the pitch-time surfaces onto a 2D bitmap Fig.7 Classification accuracy comparison 6

MARSYAS Visualization Summary Symbolic representation is more preferable in the sense of computing Pitch Information. This work can be viewed as an attempt to bridge the two distinct MIR approaches by using Pitch Histograms. Pitch Histograms do carry a certain amount of genreidentifying information. Multiple Pitch Detection Algorithm is not perfect, but it works by a certain degree. Fig.8 Examples of grayscale pitch-time surfaces. Jazz (top) and Irish Folk music (bottom) Future Work Thanks Real-time running version of Pitch Histogram. for better classification performance. to conduct more detailed harmonic analysis such as figured bass extraction, tonality recognition, and chord detection. The features derived from Pitch Histograms might be applicable to the problem of content-based audio identification or audio fingerprinting. Alternative feature sets are needed. Query-based retrieval mechanism for audio music signals. Cosku Turhan for the art work on my slides 4 Non Blondes for their song, What's Up :) 7