The Latin Music Database A Database for Automatic Music Genre Classification

Similar documents
THE LATIN MUSIC DATABASE

Kent Academic Repository

A FEATURE SELECTION APPROACH FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

ADDITIONAL EVIDENCE THAT COMMON LOW-LEVEL FEATURES OF INDIVIDUAL AUDIO FRAMES ARE NOT REPRESENTATIVE OF MUSIC GENRE

Capturing the Temporal Domain in Echonest Features for Improved Classification Effectiveness

ARTICLE IN PRESS. Signal Processing

Computational Rhythm Similarity Development and Verification Through Deep Networks and Musically Motivated Analysis

Outline. Why do we classify? Audio Classification

Set of texture descriptors for music genre classification

Singer Traits Identification using Deep Neural Network

LMS301: Reference Management Software (Mendeley)

ABSOLUTE OR RELATIVE? A NEW APPROACH TO BUILDING FEATURE VECTORS FOR EMOTION TRACKING IN MUSIC

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

MUSIC SHAPELETS FOR FAST COVER SONG RECOGNITION

International Journal of Advance Engineering and Research Development MUSICAL INSTRUMENT IDENTIFICATION AND STATUS FINDING WITH MFCC

Can Song Lyrics Predict Genre? Danny Diekroeger Stanford University

Music Genre Classification and Variance Comparison on Number of Genres

Supervised Learning in Genre Classification

The MAMI Query-By-Voice Experiment Collecting and annotating vocal queries for music information retrieval

Topics in Computer Music Instrument Identification. Ioanna Karydi

EVALUATING THE GENRE CLASSIFICATION PERFORMANCE OF LYRICAL FEATURES RELATIVE TO AUDIO, SYMBOLIC AND CULTURAL FEATURES

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

MUSICAL STRUCTURAL ANALYSIS DATABASE BASED ON GTTM

A Basis for Characterizing Musical Genres

Bi-Modal Music Emotion Recognition: Novel Lyrical Features and Dataset

A FUNCTIONAL CLASSIFICATION OF ONE INSTRUMENT S TIMBRES

Hidden Markov Model based dance recognition

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

AutoDewey. Julianne Beall, Assistant Editor, DDC Caroline Saccucci, Head, Dewey Section Library of Congress

Enhancing Music Maps

Speech Recognition and Signal Processing for Broadcast News Transcription

MUSI-6201 Computational Music Analysis

EVALUATION OF FEATURE EXTRACTORS AND PSYCHO-ACOUSTIC TRANSFORMATIONS FOR MUSIC GENRE CLASSIFICATION

jsymbolic 2: New Developments and Research Opportunities

Lyric-Based Music Mood Recognition

Subjective Similarity of Music: Data Collection for Individuality Analysis

PLAYSOM AND POCKETSOMPLAYER, ALTERNATIVE INTERFACES TO LARGE MUSIC COLLECTIONS

Music and Text: Integrating Scholarly Literature into Music Data

Musical Hit Detection

Sound and music computing at the University of Porto and the m4m initiative

ISMIR 2008 Session 2a Music Recommendation and Organization

A personalized TV Guide System

Arts Application & Audition Guidelines

Mood Tracking of Radio Station Broadcasts

Automatic Extraction of Popular Music Ringtones Based on Music Structure Analysis

Cataloguing pop music recordings at the British Library. Ian Moore, Reference Specialist, Sound and Vision Reference Team, British Library

IMPROVING GENRE CLASSIFICATION BY COMBINATION OF AUDIO AND SYMBOLIC DESCRIPTORS USING A TRANSCRIPTION SYSTEM

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models

A Survey of Audio-Based Music Classification and Annotation

MUSICAL INSTRUMENT RECOGNITION WITH WAVELET ENVELOPES

HUMAN PERCEPTION AND COMPUTER EXTRACTION OF MUSICAL BEAT STRENGTH

Lyrics Classification using Naive Bayes

Perceptual Evaluation of Automatically Extracted Musical Motives

Research Article A Model-Based Approach to Constructing Music Similarity Functions

Using Genre Classification to Make Content-based Music Recommendations

Kandinsky Inspired. Latin Infused. Rhythm Sculptures

WHAT'S HOT: LINEAR POPULARITY PREDICTION FROM TV AND SOCIAL USAGE DATA Jan Neumann, Xiaodong Yu, and Mohamad Ali Torkamani Comcast Labs

Base, Pulse, and Trace File Reference Guide

An Introduction to Deep Image Aesthetics

Combination of Audio & Lyrics Features for Genre Classication in Digital Audio Collections

Music Recommendation from Song Sets

Analytic Comparison of Audio Feature Sets using Self-Organising Maps

A Pattern Recognition Approach for Melody Track Selection in MIDI Files

arxiv: v1 [cs.lg] 16 Dec 2017

An ecological approach to multimodal subjective music similarity perception

EE391 Special Report (Spring 2005) Automatic Chord Recognition Using A Summary Autocorrelation Function

Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting

Youthful efforts paying off By Anthony Sciales

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

Creating a Feature Vector to Identify Similarity between MIDI Files

Algorithmic Music Composition

Music Information Retrieval

Research & Development. White Paper WHP 232. A Large Scale Experiment for Mood-based Classification of TV Programmes BRITISH BROADCASTING CORPORATION

Automated extraction of motivic patterns and application to the analysis of Debussy s Syrinx

METHOD TO DETECT GTTM LOCAL GROUPING BOUNDARIES BASED ON CLUSTERING AND STATISTICAL LEARNING

Supplementary Note. Supplementary Table 1. Coverage in patent families with a granted. all patent. Nature Biotechnology: doi: /nbt.

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

A Large Scale Experiment for Mood-Based Classification of TV Programmes

A Graph-Based Method for Playlist Generation

Davis Senior High School Symphonic Band Audition Information

The Million Song Dataset

CS229 Project Report Polyphonic Piano Transcription

DRAFT UC VENDOR/SHARED CATALOGING STANDARDS FOR AUDIO RECORDINGS JUNE 4, 2013 EDIT

Automatic Music Clustering using Audio Attributes

... A Pseudo-Statistical Approach to Commercial Boundary Detection. Prasanna V Rangarajan Dept of Electrical Engineering Columbia University

TOWARDS CHARACTERISATION OF MUSIC VIA RHYTHMIC PATTERNS

Shades of Music. Projektarbeit

6.UAP Project. FunPlayer: A Real-Time Speed-Adjusting Music Accompaniment System. Daryl Neubieser. May 12, 2016

Image Steganalysis: Challenges

CALCULATING SIMILARITY OF FOLK SONG VARIANTS WITH MELODY-BASED FEATURES

CHAPTER 6. Music Retrieval by Melody Style

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

jsymbolic and ELVIS Cory McKay Marianopolis College Montreal, Canada

Evaluating Melodic Encodings for Use in Cover Song Identification

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES

AUTOMATIC MAPPING OF SCANNED SHEET MUSIC TO AUDIO RECORDINGS

ETVA Policy Manual. All-East/All-State Auditions. Table of Contents

Wipe Scene Change Detection in Video Sequences

Motion Video Compression

Transcription:

The Latin Music Database A Database for Automatic Music Genre Classification Carlos N. Silla Jr., Celso A. A. Kaestner, Alessandro L. Koerich 11 th Brazilian Symposium on Computer Music (SBCM2007) São Paulo Setembro/2007

Outline Introduction Other Databases Main Characteristics of the LM Database Experiments with the LM Database Limitations Concluding Remarks Future Work

Introduction Problem Statement It is very difficult to assign a genre to a music based only on the auditive human perception It implies in a previous knowledge about the genre Lack of ground-truth in the area Most of databases normally contains few musical recordings, sometimes only excerpts of the full music and also a small number of recordings per class

Introduction Motivation To build a database for research of machine learning algorithms To build a database where the music pieces are labeled by specialists Minimize the subjectiveness of assigning a genre to a music piece

Introduction Goal To build a clean database to carry out experiences on automatic music genre classification using machine learning algorithms. To build a database that allows reproducing experiments No duplicity of music pieces Make easy the input of new music pieces and genres

Other Databases CODAICH 20.894 music pieces in MP3 format from 1.941 artists. GTZAN Database 1.000 music pieces from 10 genres (100/genre) Homburg 1.886 music pieces from 9 genres

Main Characteristics of the LM Database How the genres were assigned to the music pieces? Human Inspection Based on the perspective of the human perception on how the music is danced. By professional dance teachers with over ten years of experience in teaching ballroom and Brazilian cultural dances.

Main Characteristics of the LM Database First Stage Dance teachers make a selection of the musical recordings that they judged representative of a specific genre, according to how that musical pieces are danced. Second Stage Each selected music piece was verified to avoid mistakes that were expected to happen due to the stress produced by manually listening and labeling each one of the pieces.

Main Characteristics of the LM Database About 300 musical recordings were classified by month, and the total duration of the development of the Latin Music Database took a year. The Latin Music Database: 3.160 music pieces in MP3 format 10 different musical genres 543 artists

Main Characteristics of the LM Database Musical Genres and the number of samples Tango (404) Salsa (303) Forró (315) Axé (304) Bachata (308) Bolero (302) Merengue (307) Gaúcha (306) Sertaneja (310) Pagode (301) At least 300 samples per genre

Main Characteristics of the LM Database The procedure to insert a music piece in the database: 1. Assign a genre (specialist) 2. Inspection and correction of the ID3 tag (artist and title). 3. Enrollment of the music piece in the database DIRECTORY_GENRE\ARTIST-TITLE-ALBUM- TRACK.MP3

Main Characteristics of the LM Database Other approaches CD Collections Artist Profile In the case of Latin Music, such approaches have some drawbacks.

Main Characteristics of the LM Database Example: In the 4 CD collection Los 100 Mayores Exitos De La Musica Salsa only half (50 out of 100) of the music pieces can be considered as Salsa! Only 400 out of more than 500 Carlos Gardel compositions are really Tango! While building the database, we have found that in average, one to three music pieces are in conflict with the artist profile

Experiments with the LM Database

Experiments with the LM Database C. N. Silla Jr., C. A. A. Kaestner & A. L. Koerich. Automatic Music Genre Classification Using Ensemble of Classifiers. IEEE International Conference on Systems, Man and Cybernetics (SMC2007), Montreal, Canada, to appear, October 2007.

Concluding Remarks A novel approach to assign genres to music pieces Labeled by specialists Based on how a music piece is danced. Extends the auditive human perception Maybe it is the first thematic database

Limitations The raw data (MP3 files) is not available. 30-dimensional feature vectors generated from the Marsyas framework 30 are publicly available at www.ppgia.pucpr.br/~alekoe

Future Work Make it available at the On-demand Metadata Extraction Network (OMEN project). A tool that overcomes the copyright limitation and make databases widely available to the MIR community Inclusion of new musical genres to the current database Introduce an hierarchy of genres. Example: forró will be the main genre of subgenres xote, xaxado and baião.

Questions?