Combination of Audio & Lyrics Features for Genre Classication in Digital Audio Collections

Similar documents
Multi-modal Analysis of Music: A large-scale Evaluation

Multi-modal Analysis of Music: A large-scale Evaluation

Combination of Audio and Lyrics Features for Genre Classification in Digital Audio Collections

Analytic Comparison of Audio Feature Sets using Self-Organising Maps

Enhancing Music Maps

EVALUATION OF FEATURE EXTRACTORS AND PSYCHO-ACOUSTIC TRANSFORMATIONS FOR MUSIC GENRE CLASSIFICATION

arxiv: v1 [cs.ir] 16 Jan 2019

MUSI-6201 Computational Music Analysis

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

IMPROVING GENRE CLASSIFICATION BY COMBINATION OF AUDIO AND SYMBOLIC DESCRIPTORS USING A TRANSCRIPTION SYSTEM

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

Music Genre Classification and Variance Comparison on Number of Genres

DISCOURSE ANALYSIS OF LYRIC AND LYRIC-BASED CLASSIFICATION OF MUSIC

INDEX. classical works 60 sources without pagination 60 sources without date 60 quotation citations 60-61

Sarcasm Detection in Text: Design Document

Music Recommendation from Song Sets

Capturing the Temporal Domain in Echonest Features for Improved Classification Effectiveness

Lyric-Based Music Genre Classification. Junru Yang B.A.Honors in Management, Nanjing University of Posts and Telecommunications, 2014

Lyric-Based Music Mood Recognition

Outline. Why do we classify? Audio Classification

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

Automatic Music Genre Classification

ALF-200k: Towards Extensive Multimodal Analyses of Music Tracks and Playlists

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST)

Basic English. Robert Taggart

PLAYSOM AND POCKETSOMPLAYER, ALTERNATIVE INTERFACES TO LARGE MUSIC COLLECTIONS

Music Mood Classification - an SVM based approach. Sebastian Napiorkowski

EVALUATING THE GENRE CLASSIFICATION PERFORMANCE OF LYRICAL FEATURES RELATIVE TO AUDIO, SYMBOLIC AND CULTURAL FEATURES

Using Genre Classification to Make Content-based Music Recommendations

Can Song Lyrics Predict Genre? Danny Diekroeger Stanford University

Music Genre Classification

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models

Lyrics Classification using Naive Bayes

Automatic Labelling of tabla signals

Content-based music retrieval

Week 14 Music Understanding and Classification

Mood Tracking of Radio Station Broadcasts

Music Mood. Sheng Xu, Albert Peyton, Ryan Bhular

Automatic Rhythmic Notation from Single Voice Audio Sources

Methods for the automatic structural analysis of music. Jordan B. L. Smith CIRMMT Workshop on Structural Analysis of Music 26 March 2010

A FEATURE SELECTION APPROACH FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

Write for College. Using. Introduction. Sequencing Assignments 2 Scope and Sequence 4 Yearlong Timetable 6

th International Conference on Information Visualisation

Music Similarity and Cover Song Identification: The Case of Jazz

Automatic Laughter Detection

MUSICAL MOODS: A MASS PARTICIPATION EXPERIMENT FOR AFFECTIVE CLASSIFICATION OF MUSIC

STYLISTIC ANALYSIS OF MAYA ANGELOU S EQUALITY

Improving Frame Based Automatic Laughter Detection

Lecture 15: Research at LabROSA

winter but it rained often during the summer

WHEN LYRICS OUTPERFORM AUDIO FOR MUSIC MOOD CLASSIFICATION: A FEATURE ANALYSIS

English Language Arts 600 Unit Lesson Title Lesson Objectives

Bi-Modal Music Emotion Recognition: Novel Lyrical Features and Dataset

K-12 ELA Vocabulary (revised June, 2012)

Automatic Music Clustering using Audio Attributes

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

Subjective Similarity of Music: Data Collection for Individuality Analysis

Useful Definitions. a e i o u. Vowels. Verbs (doing words) run jump


tech-up with Focused Poetry

Assigning and Visualizing Music Genres by Web-based Co-Occurrence Analysis

COMBINING FEATURES REDUCES HUBNESS IN AUDIO SIMILARITY

USING ARTIST SIMILARITY TO PROPAGATE SEMANTIC INFORMATION

The Million Song Dataset

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues

ABSOLUTE OR RELATIVE? A NEW APPROACH TO BUILDING FEATURE VECTORS FOR EMOTION TRACKING IN MUSIC

Scope and Sequence for NorthStar Listening & Speaking Intermediate

Lauderdale County School District Pacing Guide Sixth Grade Language Arts / Reading First Nine Weeks

Semi-supervised Musical Instrument Recognition

Predicting Time-Varying Musical Emotion Distributions from Multi-Track Audio

Research & Development. White Paper WHP 228. Musical Moods: A Mass Participation Experiment for the Affective Classification of Music

Automatic Laughter Detection

What s New in the 17th Edition

EE373B Project Report Can we predict general public s response by studying published sales data? A Statistical and adaptive approach

Features for Audio and Music Classification

,, or. by way of a passing reference. The reader has to make a connection. Extended Metaphor a comparison between things that

General Educational Development (GED ) Objectives 8 10

Recommending Music for Language Learning: The Problem of Singing Voice Intelligibility

Cambridge Primary English as a Second Language Curriculum Framework mapping to English World

Exploring Melodic Features for the Classification and Retrieval of Traditional Music in the Context of Cultural Source

Toward Multi-Modal Music Emotion Classification

Context-based Music Similarity Estimation

Part IV: Personalization, Context-awareness, and Hybrid Methods

Automatic Musical Pattern Feature Extraction Using Convolutional Neural Network

Ameliorating Music Recommendation

A Categorical Approach for Recognizing Emotional Effects of Music

UWaterloo at SemEval-2017 Task 7: Locating the Pun Using Syntactic Characteristics and Corpus-based Metrics

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

Automatic Analysis of Musical Lyrics

LANGLEY SCHOOL. Your Little Literacy Book

World Journal of Engineering Research and Technology WJERT

LING/C SC 581: Advanced Computational Linguistics. Lecture Notes Feb 6th

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues

ASSOCIATIONS BETWEEN MUSICOLOGY AND MUSIC INFORMATION RETRIEVAL

Multimodal Sentiment Analysis of Telugu Songs

AUTOREGRESSIVE MFCC MODELS FOR GENRE CLASSIFICATION IMPROVED BY HARMONIC-PERCUSSION SEPARATION

Using Generic Summarization to Improve Music Information Retrieval Tasks

A repetition-based framework for lyric alignment in popular songs

Singer Traits Identification using Deep Neural Network

Transcription:

1/23 Combination of Audio & Lyrics Features for Genre Classication in Digital Audio Collections Rudolf Mayer, Andreas Rauber Vienna University of Technology {mayer,rauber}@ifs.tuwien.ac.at Robert Neumayer Norwegian University of Science and Technology neumayer@idi.ntnu.no ACM Multimedia 2008

2/23 Motivation Music Information Retrieval (MIR) Search & find music, organise music collections Music is inherently multi-modal Music: audio, symbolic, scores,... Text: Song lyrics, artist biographies, websites,... Community data: playlists,... Video, image: album covers, music videos

3/23 Motivation Musical genre classification: automatically assign genre labels to new music, usually based on Digital signal processing (zero crossings, MFCCs, Rhythm Patterns) Cultural data (artist biographies, album reviews) Social network info (playlists of users, e.g. last.fm) Our contribution: extend scope to lyrics New feature sets based on song lyrics Motivation: complementary characteristics improved results

4/23 Contributions Develop new feature sets based on song lyrics Rhymes, part-of-speech, text genre descriptions Compare to traditional bag-of-words Compare to audio features Rhythm Patterns (RP) Rhythm Histograms (RH) Statistical Spectrum Descriptors (SSD) Build various combinations of feature sets Evaluate genre classification performance

5/23 Outline 1. Introduction Motivation 2. Lyrics feature sets Feature representations for song lyrics 3. Experiments Test collections in Music IR and experimental setting 4. Conclusions and future work Things to do and see

6/23 Bag-of-words features Different genres different topics Covered by bag-of-words approach Index every word as feature, count frequencies Optional: remove stop words (manual list, frequency thresholding) Optional: apply stemming Apply tf idf weighting to vector values

7/23 Text genre features: statistics (1/2) Assumption that some genres use simpler or just fewer unique words than others Some genres might use more explicit language - different punctuations, usage of numbers, etc. Measures for text genre descriptions

8/23 Text genre features: statistics (2/2) Feature name ExclamationMark, colon, single- Quote, comma, questionmark, dot, hyphen, semicolon d0 - d9 WordsPerLine UniqueWordsPerLine UniqueWordsRatio CharsPerWord WordsPerMinute Description simple counts Counts of digits Words / #of lines Unique words / #of lines Unique words / words # of chars / # of words # of words / length

9/23 Text genre features: part-of-speech Assumption that categories of words used will differ across genres lexical categorisation or grammatical tagging nouns, verbs, pronouns, prepositions, adverbs, articles, modals, and adjectives We use simple counts, normalised by song length

10/23 Rhyme features (1/2) Assumption that different genres use different rhyme styles (and that they can be detected from lyrics text) e.g. Hip-Hop: sound with a dominant bass, lyrics make heavy use of rhymes Rhymes Linguistic style, based on consonance of similar sound of two or more syllables or whole words We consider only rhymes at ends of lines We perform a phoneme transcription (rather than using lexical word endings)

11/23 Rhyme features used (2/2) Feature name AA AABB ABAB ABBA RhymePercent UniqueRhymeWords Description Sequence of rhyming lines ( Couplet ) Two blocks of rhyming lines ( Clerihew) Alternating rhymes Nested rhyme sequence ( Enclosing rhyme ) Percentage of blocks that rhyme Fraction of unique terms used to build rhymes

12/23 Test collections in MIR Legal situation Music is a big business... Copyright restrictions apply Rather delicate to publish test corpora officially Well-known collections not suitable: No lyrics available/retrievable ISMIR/MIREX Genre and Rhythm collections No meta-data available to automatically fetch lyrics Collection used with MARSYAS

Compiling test collections Western popular music 10 genres Country, Folk, Grunge, Hip-Hop, Metal, Pop, Punk Rock, R&B, Reggae, Slow Rock Small Collection: 600 songs 159 artists Classes of equal size Lyrics manually cleansed! Large Collection: 3010 songs 188 artists 180-380 songs per class Lyrics automatically fetched, no manual cleansing 13/23

14/23 Text genre statistic feature analysis (a) question marks (b) words per minute

15/23 Part-of-speech feature analysis (c) articles (d) nouns

16/23 Rhyme feature analysis (e) unique rhyme words (f) rhymes AABB

17/23 Experimental setup 25 combinations of all feature sets (RP, RH, SSD, BOW, Rhyme, Part-of-Speech, Text genre statistic) Different classifiers: k-nn, Naïve Bayes, Decision Trees, Support Vector Machines Similar trends with all classifiers Assuming SSD as best audio-only classifier to be baseline Statistical significance tests against that baseline 10-fold cross-validation

18/23 Classification results (600 songs) Feature combination Dim SVM ssd (base classifier) 168 59.17 rh 60 35.37 rp 1440 48.37 textstatistic 23 29.83 pos 9 19.21 rhyme 6 14.46 textstatistic/pos 32 31.29 BOW/ssd 9434 53.46 BOW/ssd/textstatistic/pos/rhyme 9472 54.21 ssd/textstatistic 191 64.33 ssd/textstatistic/pos 200 64.50 ssd/textstatistic/rhyme 197 63.71

19/23 Classification results (3010 songs) Feature combination Dim SVM ssd (base classifier) 168 66.32 rh 60 35.01 rp 1440 55.37 textstatistic 23 28.72 pos 9 12.66 rhyme 6 15.83 textstatistic/pos 32 28.72 BOW/ssd 2140 66.44 BOW/ssd/textstatistic/pos/rhyme 2178 67.06 ssd/textstatistic 191 68.72 ssd/textstatistic/pos 200 68.72 ssd/textstatistic/rhyme 197 68.16

20/23 Experiment variations Analyse effect of stemming Stemming lead to slightly better results Analyse effect of manual cleansing of lyrics Cleansed lyrics yielded slightly better results

21/23 Recap Music is inherently multi-modal New feature sets for lyrics genre categorisation Classification results on combinations Clearly outperforms bag-of-words only approach Improves classification of audio-only features Automatically fetched lyrics still are significantly better New features strong where audio already strong...

Future work More sophisticated text and rhyme features for lyrics Ensemble learning Maybe one classifier per feature set? Integrate automated lyrics alignment / preprocessing Extend multi-modal classification to other modalities Album covers Music videos 22/23

23/23 Thomas Lidy and Andreas Rauber. Evaluation of feature extractors and psycho-acoustic transformations for music genre classification. In Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR 05), pages 34 41, London, UK, September 11-15 2005. Rudolf Mayer, Robert Neumayer, and Andreas Rauber. Rhyme and style features for musical genre classification by song lyrics. In Proceedings of the 9th International Conference on Music Information Retrieval (ISMIR 08), Philadelphia, PA, USA, September 14-18 2008.