Music Information Retrieval

Similar documents
CTP431- Music and Audio Computing Music Information Retrieval. Graduate School of Culture Technology KAIST Juhan Nam

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval


Introductions to Music Information Retrieval

Content-based music retrieval

MUSI-6201 Computational Music Analysis

Music Genre Classification and Variance Comparison on Number of Genres

Outline. Why do we classify? Audio Classification

Music Information Retrieval Community

Music Similarity and Cover Song Identification: The Case of Jazz

Computational Modelling of Harmony

Data Driven Music Understanding

The Million Song Dataset

Music Information Retrieval

Analysing Musical Pieces Using harmony-analyser.org Tools

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

Music Information Retrieval (MIR)

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

Music Information Retrieval. Juan P Bello

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST)

Singer Traits Identification using Deep Neural Network

Contextual music information retrieval and recommendation: State of the art and challenges

A Survey of Audio-Based Music Classification and Annotation

Singer Recognition and Modeling Singer Error

KÜNSTLICHE INTELLIGENZ ALS PERSONALISIERTER KOMPONIST AUTOMATISCHE MUSIKERZEUGUNG ALS DAS ENDE DER TANTIEMEN?

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Subjective Similarity of Music: Data Collection for Individuality Analysis

Beethoven, Bach, and Billions of Bytes

GCT535- Sound Technology for Multimedia Timbre Analysis. Graduate School of Culture Technology KAIST Juhan Nam

Music Information Retrieval (MIR)

Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900)

Music Information Retrieval. Juan Pablo Bello MPATE-GE 2623 Music Information Retrieval New York University

Music Mood Classification - an SVM based approach. Sebastian Napiorkowski

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

Topic 11. Score-Informed Source Separation. (chroma slides adapted from Meinard Mueller)

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

Classification of Timbre Similarity

The song remains the same: identifying versions of the same piece using tonal descriptors

Music Information Retrieval

Using Genre Classification to Make Content-based Music Recommendations

Shades of Music. Projektarbeit

Audio. Meinard Müller. Beethoven, Bach, and Billions of Bytes. International Audio Laboratories Erlangen. International Audio Laboratories Erlangen

Music Information Retrieval for Jazz

Music Understanding and the Future of Music

Automatic Music Genre Classification

MUSIC SHAPELETS FOR FAST COVER SONG RECOGNITION

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng

Music Genre Classification

Extracting Information from Music Audio

Automatic Music Clustering using Audio Attributes

Singer Identification

Frankenstein: a Framework for musical improvisation. Davide Morelli

Deep learning for music data processing

A TEXT RETRIEVAL APPROACH TO CONTENT-BASED AUDIO RETRIEVAL

Music Processing Introduction Meinard Müller

Automatic Music Similarity Assessment and Recommendation. A Thesis. Submitted to the Faculty. Drexel University. Donald Shaul Williamson

Statistical Modeling and Retrieval of Polyphonic Music

A CLASSIFICATION APPROACH TO MELODY TRANSCRIPTION

Video-based Vibrato Detection and Analysis for Polyphonic String Music

CTP 431 Music and Audio Computing. Course Introduction. Graduate School of Culture Technology (GSCT) Juhan Nam

BEYOND radio. Amy Pearl Pospiech UX Design Project Spring 17

LEARNING AUDIO SHEET MUSIC CORRESPONDENCES. Matthias Dorfer Department of Computational Perception

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

Music Alignment and Applications. Introduction

Effects of acoustic degradations on cover song recognition

Aalborg Universitet. Feature Extraction for Music Information Retrieval Jensen, Jesper Højvang. Publication date: 2009

A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models

Chord Recognition. Aspects of Music. Musical Chords. Harmony: The Basis of Music. Musical Chords. Musical Chords. Music Processing.

Greeley-Evans School District 6 High School Vocal Music Curriculum Guide Unit: Men s and Women s Choir Year 1 Enduring Concept: Expression of Music

Detecting Musical Key with Supervised Learning

A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES

Lecture 15: Research at LabROSA

Music Structure Analysis

Supervised Learning in Genre Classification

Musical Hit Detection

Music Mood. Sheng Xu, Albert Peyton, Ryan Bhular

CS 591 S1 Computational Audio

Tempo and Beat Analysis

Part IV: Personalization, Context-awareness, and Hybrid Methods

The following General Music performance objectives are integrated throughout the entire course: MUSIC SKILLS

International Journal of Advance Engineering and Research Development MUSICAL INSTRUMENT IDENTIFICATION AND STATUS FINDING WITH MFCC

Chord Classification of an Audio Signal using Artificial Neural Network

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas

Digital audio and computer music. COS 116, Spring 2012 Guest lecture: Rebecca Fiebrink

A CHROMA-BASED SALIENCE FUNCTION FOR MELODY AND BASS LINE ESTIMATION FROM MUSIC AUDIO SIGNALS

Week 14 Music Understanding and Classification

Lecture 9 Source Separation

Music Synchronization. Music Synchronization. Music Data. Music Data. General Goals. Music Information Retrieval (MIR)

Data-Driven Solo Voice Enhancement for Jazz Music Retrieval

Music Information Retrieval: An Inspirational Guide to Transfer from Related Disciplines

Audio Structure Analysis

TEST SUMMARY AND FRAMEWORK TEST SUMMARY

Music Information Retrieval with Temporal Features and Timbre

Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification

Can Song Lyrics Predict Genre? Danny Diekroeger Stanford University

Data Driven Music Understanding

2 2. Melody description The MPEG-7 standard distinguishes three types of attributes related to melody: the fundamental frequency LLD associated to a t

MUSIC CONTENT ANALYSIS : KEY, CHORD AND RHYTHM TRACKING IN ACOUSTIC SIGNALS

Transcription:

CTP 431 Music and Audio Computing Music Information Retrieval Graduate School of Culture Technology (GSCT) Juhan Nam 1

Introduction ü Instrument: Piano ü Composer: Chopin ü Key: E-minor ü Melody - ELO ADer all - Radiohead Exit Music ü Transcrip7on Music nota7on ü Genre: Classical ü Mood: Melancholy, Sad, 2

Music Information Retrieval (MIR) Information in Music Factual: track, artist, years Acoustic: loudness, pitch, timbre Symbolic: Instrument, melody, rhythm, chords, structure Semantic: genre, mood, user preference Area of research that aims to infer various types of information from music data Make computer understand music as human does Provide intelligent solutions to enhance human musical activities 3

MIR Tasks Audio fingerprinting Cover song detection Music transcription: melody, notes, tempo, chords Segmentation, structure, alignment Similarity-based retrieval, playlists, recommendation Classification: genre, mood, tags, Query by humming Source separation: vocal removal Symbolic MIR: score retrieval or harmony analysis Optical Music Recognition (OMR) MIREX: http://www.music-ir.org/mirex/wiki/mirex_home 4

MIR Research Disciplines Digital Signal Processing Acoustics Music theory Machine Learning Natural language processing / Computer vision Psychology Human-Computer Interaction 5

Application: Music Search Query by music Search a single unique song identified by the query Audio fingerprint Applied to movies, TV and ads, too Query by humming Sing with humming and find closest matches Melody match 6

Application: Music Recommendation Personalized Radio Generate Playlist Based on user data, similarity and context itunes Radio Pandora 7

Application: Score Following Listen to performance and track the notes Example: JKU, Tonara 8

Application: Score Following The Piano Music Companion (2013) Along with song identification 9

Application: Automatic Accompaniment Score following + Interactive Performance Examples: IRCAM s Antefesco, Sonation s Cadenza 10

Application: Entertainment / Education Focus on performance evaluation Learning musical instrument Examples: Ovelin s Yousician, MakeMusic s Smartmusic, Ubisoft s RockSmith, RockProdigy 11

Application: Music Production Sound Sample search Imagine Research s MediaMind: search sound effect sample for media production (e.g. film, drama) Izotope s Breaktweaker: search similar timbre of drum sounds 12

Application: Music Composition Automatic Song writing Automatic arrangement Example: MSR s Songsmith 13

CASE STUDY: Music Recommendation 14

Backgrounds Music record market Offline à Online music services CD à MP3 à Streaming audio Scale and diversity of music contents Commercial music tracks Spotify: 30M+ songs (2015) Bugs music: 4.1M+ songs (2015) User contents YouTube: 300h+ video uploaded per min (2015) SoundCloud: 12h+ audio uploaded per minute (2014) TV, cables and online media Music program, concert, music videos, audition, 15

Backgrounds Connection with human data Number of users Spotify: +24M active users (as of Jan, 2014) YouTube: +1B unique users visit each month (as of Dec, 2014) Personal data Play history, rate, personal music library Profile: age, occupation, Social data The majority of online services can be logged in via SNS Friends, followers Daily posting, blog (reviews), comments 16

Challenges There are too many choices of music contents How can we find music more easily or in a human-friendly way? Searching music with various queries (e.g. text, humming, audio tracks) Recommendation based on user data (e.g. play history, rating, location) We need to extract semantic or musical information from audio tracks, and match them to the query or user data Music Genre, Mood, Instrument, Song characteris7cs Query word, Play history, Rate Profile, Loca7on Discovery/Familiarity Users 17

Current Approaches Manual Curation Human Expert Analysis Collaborative Filtering Content-based Analysis (by computers) 18

Manual Curation Playlist generation by music experts (or users) Traditional: AM/FM radio The majority of current music services are based on this approach Advantages Effective for usage-based music services (workout, study, driving or prenatal education) Good for music discovery Often with story-telling Limitations No personalization Not scalable [www.soribada.com] 19

Human Expert Analysis Pandora: music genome project (1999) Musicologists analyze a song for about 450 musical attributes in various categories Big success as a music service Advantages High-quality analysis Good for music discovery Limitations Expensive: take 20-30 minutes for a song to be analyzed Not scalable : only for commercial tracks? 20

Collaborative Filtering (CF) Basic idea Person A: I like songs A, B, C and D. Person B: I like songs A, B, C and E. Person A: Really? You should check out song D. Person B: Wow, you also should check out song E. Formation Matrix factorization (or matrix completion) problem Song Preference p us = x u T y s y s User Similarity q u1u2 = x T u1 x u2 Juhan Gangnam Style x u Gangnam Style s latent vector Juhan s latent vector Song Similarity r s1s2 = y T s1 y s2 21

Collaborative Filtering Advantages Capture semantics of music in the aspect of human Enable personalized recommendation (by nature) Limitations The cold start problem: what if a song was never played by anyone? Popularity bias: likely to recommend (already) well-known songs or songs from the same musician or album 22

Collaborative Filtering Bad examples Can you find songs similar to this musician? These songs are already what I know well! [Oord et. al, 2013] 23

Content-Based Analysis: Music Auto-tagging Google has music service as part of Google play Their main features Instant mix, which automatically generates a playlist based on user s music collections or play history They do CF but also make use of audio content. How? Fast Company, July, 2013 24

Content-Based Analysis: Music Auto-tagging An intelligent approach that makes computers listen to music and predict descriptive words (i.e. tags) from audio tracks Features: MFCC, Chroma, Algorithms: GMM, SVM, Neural Networks Tags: genre, mood, instrument, voice quality, usage Basic Framework Audio Files Audio Features Algorithms Classical Jazz Metal 25

Example of Auto-tagging This is a [ ] song that is [ ], [ ] and [ ]. It features [ ] and [ ] vocal. It is a song with [ ] and [ ] that you might like to listen to while [ ]. This is a [ very danceable ] song that is [ arousing/awakening ], [ exci5ng/ thrilling ] and [ happy ]. It features [ strong ] and [ fast tempo ] vocal. It is a song with [ high energy ] and [ high beat ] that you might like to listen to while [ at a party ]. James Brown Give it up or turn it a loose This is a [ pop ] song that is [ happy ], [ carefree/lighthearted ] and [ light/ playful ]. It features [ high-pitched ] vocal and [ altered with effects ] vocal. It is a song with [ posi5ve feeling ] that you might like to listen to while [ at a party ]. Cardigans - Lovefool 26

Text-based Music Retrieval by Auto-tagging Sort the probability of the query tag and choose top-n songs Like text-based Google search Query word: Female Lead Vocals Top 5 ranked songs Norah Jones Don t know why Dido Here with me Sheryl Crow I shall believe No doubt Simple kind of like Carpenters Rainy days and Mondays We also can compute similarity between songs using the estimated tag probabilities E.g. cosine distance between two tag probability vectors Applicable to query by audio 27

Content-based Music Recommendation Blending audio and user data Replace the text-based tags with the latent vector of a song user song Gangnam Style s latent vector Matrix factoriza7on from collabora7ve filtering [Oord et. al, 2013] Audio Track of Gangnam Style 28

Music Retrieval Results Collabora7ve Filtering only Collabora7ve Filtering + Audio Content [Oord et. al, 2013] 29

Content-Based Analysis: Music Auto-tagging Advantages Free of cold-start and popularity bias Highly scalable: using high-performance computing Works for music in other media or user content as well Can be combined with other approaches Limitations Some tags are unpredictable: indy, idol, Hard to measure music quality (or level of performance), especially for user contents 30

CASE STUDY: Score Following 31

Music Score Following Tracking played notes while listening to the music Temporally align different representations or renditions of music Audio to Audio, Audio to Score (or MIDI)

Music Score Following Extracting Chroma Features Capture harmonic (or tonal) characteristics of music MIDI Lisitsa CENS : Normalized Chroma Features (Muller, 2005) 33

Music Score Following Computing (Dis)similarity Matrix 34

Music Score Following Computing the Shortest Path using Dynamic Time Warping Local Similarity Accumulated Similarity 35

Score Following Demo