Music Information Retrieval Community

Similar documents

Production. Old School. New School. Personal Studio. Professional Studio

MUSI-6201 Computational Music Analysis

Music Information Retrieval

Automatic Rhythmic Notation from Single Voice Audio Sources

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

Towards Musical Query-by-Semantic-Description using the CAL500 Data Set

Subjective Similarity of Music: Data Collection for Individuality Analysis

CTP431- Music and Audio Computing Music Information Retrieval. Graduate School of Culture Technology KAIST Juhan Nam

Classification of Timbre Similarity

On Human Capability and Acoustic Cues for Discriminating Singing and Speaking Voices

A New Method for Calculating Music Similarity

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

MODELS of music begin with a representation of the

Music Genre Classification and Variance Comparison on Number of Genres

SIGNAL + CONTEXT = BETTER CLASSIFICATION

Predicting Time-Varying Musical Emotion Distributions from Multi-Track Audio

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST)

Singer Traits Identification using Deep Neural Network

Automatic Music Similarity Assessment and Recommendation. A Thesis. Submitted to the Faculty. Drexel University. Donald Shaul Williamson

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

Time Series Models for Semantic Music Annotation Emanuele Coviello, Antoni B. Chan, and Gert Lanckriet

Supervised Learning in Genre Classification

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

AUTOREGRESSIVE MFCC MODELS FOR GENRE CLASSIFICATION IMPROVED BY HARMONIC-PERCUSSION SEPARATION

Content-based music retrieval

13 Matching questions

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models

Topics in Computer Music Instrument Identification. Ioanna Karydi

Semi-supervised Musical Instrument Recognition

Singer Recognition and Modeling Singer Error

Audio. Meinard Müller. Beethoven, Bach, and Billions of Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Music Recommendation from Song Sets

Introductions to Music Information Retrieval

Music Mood Classification - an SVM based approach. Sebastian Napiorkowski

ISMIR 2008 Session 2a Music Recommendation and Organization

Music Information Retrieval

Tempo and Beat Analysis

Music Information Retrieval

Unifying Low-level and High-level Music. Similarity Measures

AudioRadar. A metaphorical visualization for the navigation of large music collections

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

A MUSIC CLASSIFICATION METHOD BASED ON TIMBRAL FEATURES

HIT SONG SCIENCE IS NOT YET A SCIENCE

Music Information Retrieval with Temporal Features and Timbre

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Beethoven, Bach, and Billions of Bytes

The song remains the same: identifying versions of the same piece using tonal descriptors

Automatic Music Clustering using Audio Attributes

GRADIENT-BASED MUSICAL FEATURE EXTRACTION BASED ON SCALE-INVARIANT FEATURE TRANSFORM

Toward Evaluation Techniques for Music Similarity

USING ARTIST SIMILARITY TO PROPAGATE SEMANTIC INFORMATION

Autotagger: A Model For Predicting Social Tags from Acoustic Features on Large Music Databases

Music Similarity and Cover Song Identification: The Case of Jazz

Learning to Tag from Open Vocabulary Labels

Music Information Retrieval. Juan P Bello

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

638 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 18, NO. 3, MARCH 2010

Detecting Musical Key with Supervised Learning

IEEE TRANSACTIONS ON MULTIMEDIA, VOL. X, NO. X, MONTH Unifying Low-level and High-level Music Similarity Measures

Types of music SPEAKING

Application Of Missing Feature Theory To The Recognition Of Musical Instruments In Polyphonic Audio

International Journal of Advance Engineering and Research Development MUSICAL INSTRUMENT IDENTIFICATION AND STATUS FINDING WITH MFCC

TOWARDS IMPROVING ONSET DETECTION ACCURACY IN NON- PERCUSSIVE SOUNDS USING MULTIMODAL FUSION

Music Information Retrieval

Features for Audio and Music Classification

Contextual music information retrieval and recommendation: State of the art and challenges

Can Song Lyrics Predict Genre? Danny Diekroeger Stanford University

HUMAN PERCEPTION AND COMPUTER EXTRACTION OF MUSICAL BEAT STRENGTH

MUSICAL INSTRUMENT IDENTIFICATION BASED ON HARMONIC TEMPORAL TIMBRE FEATURES

THE importance of music content analysis for musical

Singing Pitch Extraction and Singing Voice Separation

Chord Classification of an Audio Signal using Artificial Neural Network

Effects of acoustic degradations on cover song recognition

The MAMI Query-By-Voice Experiment Collecting and annotating vocal queries for music information retrieval

Further Topics in MIR

A Survey of Audio-Based Music Classification and Annotation

Creating a Feature Vector to Identify Similarity between MIDI Files

Quality of Music Classification Systems: How to build the Reference?

A Categorical Approach for Recognizing Emotional Effects of Music

TRACKING THE ODD : METER INFERENCE IN A CULTURALLY DIVERSE MUSIC CORPUS

Assigning and Visualizing Music Genres by Web-based Co-Occurrence Analysis

Automatic Music Genre Classification

On human capability and acoustic cues for discriminating singing and speaking voices

Automatic Labelling of tabla signals

Outline. Why do we classify? Audio Classification

Analysis of Peer Reviews in Music Production

Enhancing Music Maps

A Study of Synchronization of Audio Data with Symbolic Data. Music254 Project Report Spring 2007 SongHui Chon

Music Mood Classication Using The Million Song Dataset

Musical Instrument Identification based on F0-dependent Multivariate Normal Distribution

Music Conversation Practice

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Curtis Blues: The Roots of Rock and Rap - Rhythm and Lyrics in Acoustic Delta Blues

ON FINDING MELODIC LINES IN AUDIO RECORDINGS. Matija Marolt

Research Article A Model-Based Approach to Constructing Music Similarity Functions

Analyzing the Relationship Among Audio Labels Using Hubert-Arabie adjusted Rand Index

The Million Song Dataset

/$ IEEE

Transcription:

Music Information Retrieval Community What: Developing systems that retrieve music When: Late 1990 s to Present Where: ISMIR - conference started in 2000 Why: lots of digital music, lots of music lovers, lots of powerful computers 1

How can we find find music? Query-by-Metadata - artist, song, album, year We must know what we want Query-by-(Humming, Tapping, Beatboxing) Requires talent Query-by-Song-Similarity We must possess acoustically similar songs Query-by-Semantic-Description Google seems to work pretty well for text Semantic Image Labeling is a hot topic in Computer Vision Can it work for music? 2

Semantic Music Annotation and Retrieval Our goal is build a system that can 1. Annotate a song with meaningful words 2. Retrieve songs given a text-based description Plan: Learn a probabilistic model that captures a relationship between the audio content of a song and words that describe the song. 3

Collecting Semantic Music Data CAL500 Data Set: We have collected 1700 annotations for 500 western popular songs by having 55 individuals listen to and evaluate music. Each song is annotated by at least 3 individuals. An annotation reflects the strength of association between a song and 173 words. Words relate to Instrumentation, Genre, Emotion, Vocals, Usages, Quality, Tempo, Collected using a standard survey 4

System Overview Data Features Modeling Evaluation Training Data Vocabulary Parametric Model: Set of GMMs T T Annotation Document Vectors (y) Audio-Feature Extraction (X) Parameter Estimation: EM Algorithm Novel Song Evaluation (annotation) Inference Music Review Text Query (retrieval) 5

Our Model Each song is represented by a time series X= {x 1,,x t } dynamic Mel-Frequency Cepstral Coefficients [McKinney03] For each word w, we learn a word-model p(x w) using songs that are associated with the word. p(x w) is modeled using a Gaussian Mixture Model (GMM) Annotation: Given a novel song, we pick words by comparing the likelihood of the audio features under each word-model. Retrieval: Given a text query, we pick songs that are likely under the word-models associated with the words in the query. 6

Annotation: Automatic Music Reviews Dr. Dre (feat. Snoop Dogg) - Nuthin' but a 'G' thang This is dance poppy, hip-hop song that is arousing and exciting. It features drum machine, backing vocals, male vocal, a nice acoustic guitar solo, and rapping, strong vocals. It is a song that is very danceable and with a heavy beat that you might like listen to while at a party. Frank Sinatra - Fly me to the moon This is a jazzy, singer / songwriter song that is calming and sad. It features acoustic guitar, piano, saxophone, a nice male vocal solo, and emotional, high-pitched vocals. It is a song with a light beat and a slow tempo that you might like listen to while hanging with friends. 7

Retrieval: Query-by-Semantic-Description Query Retrieved Songs Tender Female Vocals Tender AND Female Vocals Crosby, Stills and Nash - Guinnevere Jewel - Enter from the East Art Tatum - Willow Weep for Me John Lennon - Imagine Tom Waits - Time Alicia Keys - Fallin Shakira - The One Christina Aguilera - Genie in a Bottle Junior Murvin - Police and Thieves Britney Spears - I'm a Slave 4 U Jewel - Enter from the East Evanescence - My Immortal Cowboy Junkies - Postcard Blues Everly Brothers - Take a Message to Mary Sheryl Crow - I Shall Believe 8

Quantifying Annotation Our system annotates the Cal-500 songs with 10 words from our vocabulary of 173 words. Population Annotation Ground Truth Metric: Word Precision & Recall Consider word w, Precision = # songs correctly annotated with w # songs annotated with w Recall = # songs correctly annotated with w # songs that should have been annotated w Mean Word Recall and Word Precision are the averages over all words in our vocabulary. 9

Quantifying Annotation Our system annotates the Cal-500 songs with 10 words from our vocabulary of 173 words. Model Precision Recall Random 0.17 0.05 Upper Bound 1.00 0.30 Our System 0.31 0.14 Human 0.34 0.11 Key point: By pooling human annotations, our model can produce annotations that are as consistent as annotations produced by individuals, when compared against a population average. 10

What s next Going Bigger Larger Vocabulary More Annotation Novel Applications Modeling dependencies Words Correlations (e.g., Classic Rock & Electric Guitar ) Audio Features have temporal dependencies Modeling individuals rather than populations Comparing alternative models Modeling heterogeneous data is hot topic in Machine Learning Many models have been proposed. Applications: Image Labeling, Text-Document Classification Exploring Novel Query Paradigms - Query-by-semantic-example - Heterogeneous queries 11

To learn more The mathematics (parameter estimation, inference), a description of the Cal-500 data set, evaluation of annotation and retrieval performance, and much more is available at: cosmal.ucsd.edu/cal 12

Talking about music is like dancing about architecture - origins unknown 13

A biased view of Music Classification 2000-03: Music classification (by genre, emotion, instrumentation) becomes a popular MIR task Undergrad Thesis on Genre Classification with G. Tzanetakis 2003-04: MIR community starts to criticize music classification problems ill-posed problem due to subjectivity not an end in itself performance glass ceiling 2004-06: Focus turns to Music Similarity research Recommendation Playlist generation 2006-07: We view Music Annotation as a supervised multiclass labeling problem Like classification but with large, less-restrictive vocabulary 14