Complex audio feature extraction: Transcription

Size: px
Start display at page:

Download "Complex audio feature extraction: Transcription"

Transcription

1 Complex audio feature extraction: Transcription Anja Volk Sound and Music Technoloy, 14 Dec,

2 Outline What is transcription? basic representations of musical content Application: query by Hummin Audio and symbolic representations in folk son (multiple) F0 estimation Melody transcription Chord transcription 2

3 Recap Audio features for corpus analysis Why do we undertake corpus analysis Examples: What makes a chorus? What makes a hook? What type of audio features: Psycho-acoustic audio features Corpus-relative features Evolution of Contemporary Western Popular music Pitch, timbre, loudness Million Son Database Graphs constructed Results Pitch variety remains constant Timbre homoenizes Loudness increases 3

4 Transcription: Basic representations of musical content musical content example compare imae compare text structure convert to above convert to below Diital audio (MP3, Wav) level 1: primitive features speech none - hard Timestamped events (MIDI) level 2: objects text little easy fairly hard (OK job) Music notation (Finale, Sibelius, MusicXML) level 2: compound objects text + markup much easy (OK job) - 4

5 Transcription: Basic representations of musical content transcription musical content example compare imae compare text structure convert to above convert to below Diital audio (MP3, Wav) level 1: primitive features speech none - hard Timestamped events (MIDI) level 2: objects text little easy fairly hard (OK job) Music notation (Finale, Sibelius, MusicXML) level 2: compound objects text + markup much easy (OK job) - 5

6 Audio transcription reconstruction of sound events, or even music notation, from audio sinals sound events is what we perceive music notation can be seen as an approximation of our perception but other symbolic representations of sound events may be equally useful audio transcription problem is a major obstacle lots of research usable solutions exist for controlled situations 6

7 What does audio transcription involve? many different tasks, often covered by MIREX (multiple) F0 estimation onset detection melody extraction audio chord estimation instrument reconition 7

8 Applications Cover son detection Query by hummin Audio notation alinment (score followin) Example: PHENICX project on enhancin experience of concert audience: Performance analysis not all of these require a full transcription 8

9 9 Questionnaire: musicoloy challenes for MIR

10 Where it started Ghias et al., Query By Hummin: Musical Information Retrieval in an Audio Database (1995) actually, searchin MIDI files but the query is user-enerated audio matchin after audio transcription 11

11 Query by Hummin How (Ghias et al. 1995)? F0 estimation human voice has peculiar resonance properties formants (hih frequency ranes that we perceive as vowels) autocorrelation based method worked best output turned into 3-letter alphabet a.k.a. Parson s code U, D, S (up, down, same) approximate strin matchin results sequences of pitch transitions were sufficient to discriminate 90% of the sons hih hopes for MIR on the Internet 12

12 QBH as a research paradim attractive scenario with very stron impact in the next 10 years (and even up to now) audio-to-symbolic matchin audio-to-audio matchin how realistic is it? MIREX Query By Sinin/Hummin (QBSH) run since 2006, led by Roer Jan several datasets, includin Jan s MIR-SBSH corpus 4431 audio queries 48 MIDI round truths noise from Essen Folk Sons evaluation: Mean Reciprocal Rank scores up to (overfittin?) 13

13 SoundHound currently, the most visible commercial application of QBH (from 1:27) SoundHound documentation doesn t explain much nd2sound vaue compact and flexible Crystal representation matches multiple aspects of the user's rendition (includin melody, rhythm, and lyrics) with millions of user recordins from midomi.com) Midomi database consists of monophonic melodies linked to metadata unclear whether and to what extent melodies are transcribed matchin is monophonic 14

14 Human performance in QBH constraints on QBH input sound production (i.e. sinin) melody accuracy usually, beinnin PhD research Micheline Lesaffre (Ghent U.) Music Information Retrieval: Conceptual Framework, Annotation and User Behavior (2005) experiment: what musical queries do people create when iven complete freedom? rane is lare 15

15 F0 estimation determinin the perceptual pitch from the acoustic sinal the monophonic case is often considered solved see e.. hih scores for QBSH query by sinin hummin no separate MIREX task in practice, pitch transcription is far from easy sinal vs. perception, octave errors recordin quality musical competence of performers 16

16 The WITCHCRAFT case task: build a content-based search enine for 7000 Dutch folkson recordins initial plan: use MAMI transcriber (Ghent University) state-of-the-art in 2005 based on computational model of chochlea produces neural firin patterns selection of pitch candidates sementation (division into notes) based on sun lyrics outpreformed many other methods with realistic input 17

17 The challene lots of sons were just too hard to be transcribed reliably son OGL is just an averae case beinnin end that s why we chose to use the existin paper transcriptions and encode these instead 18

18 symbolic-audio alinment Meertens Tune Collections audio AND encodin AND notation AND metadata AND annotations useful e.. as round truth for F0 estimation example: research Müller, Grosche & Wierin 2009 MIDI is 1 strophe, audio contains many strophes matchin miht produce sementation chroma from MIDI chroma from audio 19

19 dealin with pitch drift chroma shifted over 1 / 24 octave distances sementation performance 0,9-0,95 (F measure) may support further variation / music performance research 20

20 Audio folk son retrieval in the end, we did experiment with audio retrieval (PhD thesis Peter van Kranenbur, 2010) F0 estimation usin Yin (De Cheveiné & Kawahara) autocorrelation based; error prevention adaptations: smoothin over 11 frames, usin median initial retrieval experiment with sements: OK results 21

21 Melody transcription 2 steps: (1) estimate when the melody is present and when it is not (2) estimate the correct pitch of the melody when it is present Considered as a still unsolved problem Issue 1: separatin components of polyphonic mixture of complex sounds very difficult Issue 2: estimatin pitch from the separated stream is still a challene in itself Issue 3: determinin note boundaries challenin for several instruments, includin voice (onset and transition blurred) 22

22 Melody transcription Example a male siner (melody) accompaniment (drums, piano, uitar) M. Mueller: Fundamentals of Music Processin P

23 Melody transcription Example Spectroram isolated sinin voice melody M. Mueller: Fundamentals of Music Processin P

24 Melody transcription Example Spectroram full recordin Siner is active M. Mueller: Fundamentals of Music Processin P

25 Melody transcription YIN Cheveine & Kawahara (2001) pitch-estimation alorithm operatin in the time domain (no Fourier transformation) Works with autocorrelation function (ACF) of the sinal 26

26 Melody transcription YIN Cheveine & Kawahara (2001) pitch-estimation alorithm operatin in the time domain (no Fourier transformation) Works with autocorrelation function (ACF) of the sinal 27

27 Melody transcription YIN Cheveine & Kawahara (2001) pitch-estimation alorithm operatin in the time domain (no Fourier transformation) Works with autocorrelation function (ACF) of the sinal the sinals on which YIN is typically used are larely monophonic, with only one, or one very dominant pitch present 28

28 Melody transcription Melodia: Justin Salamon and Emilia Gomez. Melody Extraction From Polyphonic Music Sinals Usin Pitch Contour Characteristics. IEEE Transactions on Audio, Speech, and Lanuae Processin, 20(6): ,

29 Melody transcription Melodia: Justin Salamon and Emilia Gomez. Melody Extraction From Polyphonic Music Sinals Usin Pitch Contour Characteristics. IEEE Transactions on Audio, Speech, and Lanuae Processin, 20(6): , Which frequencies are present in audio sinal at every point in time? 30

30 Melody transcription Melodia: Justin Salamon and Emilia Gomez. Melody Extraction From Polyphonic Music Sinals Usin Pitch Contour Characteristics. IEEE Transactions on Audio, Speech, and Lanuae Processin, 20(6): , search for a harmonic series of frequencies that would contribute to our perception of this pitch Salience = (weihted) sum of eneries of these harmonic frequencies 31

31 Melody transcription Melodia: Justin Salamon and Emilia Gomez. Melody Extraction From Polyphonic Music Sinals Usin Pitch Contour Characteristics. IEEE Transactions on Audio, Speech, and Lanuae Processin, 20(6): , pitch contour = a series of consecutive pitch values which are continuous in both time and frequency 32

32 Melody transcription Melodia: Justin Salamon and Emilia Gomez. Melody Extraction From Polyphonic Music Sinals Usin Pitch Contour Characteristics. IEEE Transactions on Audio, Speech, and Lanuae Processin, 20(6): , By studyin the distribution of these characteristics for contours that belon to melodies and contours that belon to accompaniments, we were able to devise a set of rules for filterin out non-melodic contours! calculation of contour characteristics. 33

33 Melody transcription Melodia main melody extraction from polyphonic sinals start from a frequency representation of the sinal, and assess, the salience of all possible candidate pitches based on a hih-resolution STFT, from which peaks are found Estimate of exact location: instantaneous frequency is found by considerin the manitudes of the spectrum in each frame, andinterpolatin between the phases of peaks in consecutive frames of the STFT Justin Salamon and Emilia Gomez. Melody Extraction From Polyphonic Music Sinals Usin Pitch Contour Characteris- tics. IEEE Transactions on Audio, Speech, and Lanuae Processin, 20(6): ,

34 Melody transcription Melodia Then: harmonic summarization on the set of peaks peak candidates are rouped in time-varyin melodic contours usin a set of heuristics based on perceptual streamin cues candidate contours are then iven a score based on their total salience and shape, and post-processed selects the set of contours that most likely constitutes the melody. Justin Salamon and Emilia Gomez. Melody Extraction From Polyphonic Music Sinals Usin Pitch Contour Characteris- tics. IEEE Transactions on Audio, Speech, and Lanuae Processin, 20(6): ,

35 Melody transcription Melodia Demo: Justin Salamon and Emilia Gomez. Melody Extraction From Polyphonic Music Sinals Usin Pitch Contour Characteris- tics. IEEE Transactions on Audio, Speech, and Lanuae Processin, 20(6): ,

36 Melody transcription Data-driven and source separation-based systems extract the melody by separatin it from the rest of the mix E..: use a trained timbre model to describe each of two sources (melody and accompaniment) timbre models can be Gaussian mixture models (GMM s), in which each source is seen as a weihted sum of a finite set of multidimensional Gaussians, each describin a particular spectral shape, or hidden Markov models (HMM), a eneralisation of GMM s Models are trained on on source-separated round truth data usin expectation maximization Graham E. Poliner and Dan Ellis. A Classification Approach to Music Transcription. Proc. 6th International Society for Music Information Retrieval Conference,

37 The matter of onset detection most transcriptions so far present a pitch contour onset-offset detection missin Audio onset detection is (aain) MIREX task difficulty of subtasks is very different 38

38 Multiple F0 estimation many approaches tried Deep Learnin (neural networks) Non-neative Matrix Factorization (NMF) for example, in NMF the audio sinal is rendered as a matrix, which is the a product of a matrix of activations (sound events) a matrix of (learned) note templates aim is to calculate the matrix of activations that has the best match with the audio sinal activations correspond to notes automatic transcription of piano piece (black) and round truth (white) 39

39 Chord reconition Melody vs Chords: Note: a sinle soundin tone with a pitch (heiht) Melody: a sequence of monophonic notes 40

40 Chord reconition Melody vs Chords: Note: a sinle soundin tone with a pitch (heiht) Melody: a sequence of monophonic notes Chords: soundin of simultaneous (at least two) notes Chord sequence: a sequence of chords 41

41 From Score to Audio Score Easy Audio recordin 42

42 From Audio to Score Score Hard Audio recordin 43

43 Pipeline: chord reconition Sound Chromaram Chord candidates Key chanes Model based selection Chords Match a the beat synchronised chroma features with chord templates Different approaches: Use knowlede-based templates Match by Euclidean distance Learn an averae profile from the data Learn 12 dimensional Gaussian distribution for all chords 44

44 Case study: chord reconition Sound Chromaram Chord candidates Key chanes Model based selection Chords Some models require information about the Key General approach Krumhansl-Kessler profiles Match by Pearson correlation with chroma feature Variations exist Approach is similar to chord candidate selection 45

45 Short excerpt: What is a key? pitch is enerally not equally distributed within a piece of music if it is, you et atonal music e.. Webern s piano variations when we use only a subset, music enerally sounds much more structured 46

46 Short excerpt: What is a key? subsets are often visualised as musical scales perceptually, they help us identify tonality music hovers around certain pitch, the final or tonic most common scales: major and minor 7 different pitches within octave most audible difference: third note of scale chane can be quite dramatic major minor 47 ex. Gustav Mahler, 1 st symphony, 3 rd mvt

47 Case study: chord reconition Sound Chromaram Chord candidates Key chanes Model based selection Chords Some models require information about the Key General approach Krumhansl-Kessler profiles Match by Pearson correlation with chroma feature Variations exist Approach is similar to chord candidate selection 48

48 Case study: chord reconition Sound Chromaram Chord candidates Key chanes Model based selection Chords Two kind of approaches: Knowlede driven: Musical knowlede is modelled and used to select a plausible chord sequence Data driven: The Transition probabilities between chords are learned from a lare corpus of chords 49

49 Case study: chord reconition Sound Chromaram Chord candidates Key chanes Model based selection Chords Knowlede driven: HarmTrace model (de Haas): Analyses the function of chord Needs key information Robust aainst noisy data Flexible Based on functional prorammin 50

50 Case study: chord reconition Sound Chromaram Chord candidates Key chanes Model based selection Chords Data Driven: Hidden Markov models: Estimate the probability of a chord-transition by countin the chord transitions in a corpus Use chroma features to estimate the probability of a chord candidate Goal: to find the most likely sequence of chords that results on the curren chromaram: Viterbi alorithm Many variants exist 51

51 Case study: chord reconition Sound Chromaram Chord candidates Key chanes Model based selection Chords 52

52 Practical approaches based on consideration of (western) predominant musical structures melody + accompaniment is common complex polyphony is relatively scarce, especially in popular music counterpoint between the inner voices does matter in classical works such as fuues (or the ex. of the Shostakovich strin quartet) pramatic approaches are thus focussin on melody extraction chord reconition 53

53 Summary transcription = derivin a symbolic representation from musical audio F0 estimation: derivin perceived pitch from audio Query by Hummin as early MIR paradim F0 estimation for monophony considered solved but in practice multiple F0 estimation: very hard task practical restrictions: melody extraction, chord labellin onset detection: yet another important task no functional systems yet that combine F0 extraction and onset detection 54

Transcription of the Singing Melody in Polyphonic Music

Transcription of the Singing Melody in Polyphonic Music Transcription of the Singing Melody in Polyphonic Music Matti Ryynänen and Anssi Klapuri Institute of Signal Processing, Tampere University Of Technology P.O.Box 553, FI-33101 Tampere, Finland {matti.ryynanen,

More information

Audio Feature Extraction for Corpus Analysis

Audio Feature Extraction for Corpus Analysis Audio Feature Extraction for Corpus Analysis Anja Volk Sound and Music Technology 5 Dec 2017 1 Corpus analysis What is corpus analysis study a large corpus of music for gaining insights on general trends

More information

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University Week 14 Query-by-Humming and Music Fingerprinting Roger B. Dannenberg Professor of Computer Science, Art and Music Overview n Melody-Based Retrieval n Audio-Score Alignment n Music Fingerprinting 2 Metadata-based

More information

Tempo and Beat Analysis

Tempo and Beat Analysis Advanced Course Computer Science Music Processing Summer Term 2010 Meinard Müller, Peter Grosche Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Tempo and Beat Analysis Musical Properties:

More information

Topic 10. Multi-pitch Analysis

Topic 10. Multi-pitch Analysis Topic 10 Multi-pitch Analysis What is pitch? Common elements of music are pitch, rhythm, dynamics, and the sonic qualities of timbre and texture. An auditory perceptual attribute in terms of which sounds

More information

THE importance of music content analysis for musical

THE importance of music content analysis for musical IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 15, NO. 1, JANUARY 2007 333 Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With

More information

Music Radar: A Web-based Query by Humming System

Music Radar: A Web-based Query by Humming System Music Radar: A Web-based Query by Humming System Lianjie Cao, Peng Hao, Chunmeng Zhou Computer Science Department, Purdue University, 305 N. University Street West Lafayette, IN 47907-2107 {cao62, pengh,

More information

MELODY EXTRACTION FROM POLYPHONIC AUDIO OF WESTERN OPERA: A METHOD BASED ON DETECTION OF THE SINGER S FORMANT

MELODY EXTRACTION FROM POLYPHONIC AUDIO OF WESTERN OPERA: A METHOD BASED ON DETECTION OF THE SINGER S FORMANT MELODY EXTRACTION FROM POLYPHONIC AUDIO OF WESTERN OPERA: A METHOD BASED ON DETECTION OF THE SINGER S FORMANT Zheng Tang University of Washington, Department of Electrical Engineering zhtang@uw.edu Dawn

More information

Singer Recognition and Modeling Singer Error

Singer Recognition and Modeling Singer Error Singer Recognition and Modeling Singer Error Johan Ismael Stanford University jismael@stanford.edu Nicholas McGee Stanford University ndmcgee@stanford.edu 1. Abstract We propose a system for recognizing

More information

EE391 Special Report (Spring 2005) Automatic Chord Recognition Using A Summary Autocorrelation Function

EE391 Special Report (Spring 2005) Automatic Chord Recognition Using A Summary Autocorrelation Function EE391 Special Report (Spring 25) Automatic Chord Recognition Using A Summary Autocorrelation Function Advisor: Professor Julius Smith Kyogu Lee Center for Computer Research in Music and Acoustics (CCRMA)

More information

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes hello Jay Biernat Third author University of Rochester University of Rochester Affiliation3 words jbiernat@ur.rochester.edu author3@ismir.edu

More information

Piano Transcription MUMT611 Presentation III 1 March, Hankinson, 1/15

Piano Transcription MUMT611 Presentation III 1 March, Hankinson, 1/15 Piano Transcription MUMT611 Presentation III 1 March, 2007 Hankinson, 1/15 Outline Introduction Techniques Comb Filtering & Autocorrelation HMMs Blackboard Systems & Fuzzy Logic Neural Networks Examples

More information

A CHROMA-BASED SALIENCE FUNCTION FOR MELODY AND BASS LINE ESTIMATION FROM MUSIC AUDIO SIGNALS

A CHROMA-BASED SALIENCE FUNCTION FOR MELODY AND BASS LINE ESTIMATION FROM MUSIC AUDIO SIGNALS A CHROMA-BASED SALIENCE FUNCTION FOR MELODY AND BASS LINE ESTIMATION FROM MUSIC AUDIO SIGNALS Justin Salamon Music Technology Group Universitat Pompeu Fabra, Barcelona, Spain justin.salamon@upf.edu Emilia

More information

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS Andrew N. Robertson, Mark D. Plumbley Centre for Digital Music

More information

Multiple instrument tracking based on reconstruction error, pitch continuity and instrument activity

Multiple instrument tracking based on reconstruction error, pitch continuity and instrument activity Multiple instrument tracking based on reconstruction error, pitch continuity and instrument activity Holger Kirchhoff 1, Simon Dixon 1, and Anssi Klapuri 2 1 Centre for Digital Music, Queen Mary University

More information

Data-Driven Solo Voice Enhancement for Jazz Music Retrieval

Data-Driven Solo Voice Enhancement for Jazz Music Retrieval Data-Driven Solo Voice Enhancement for Jazz Music Retrieval Stefan Balke1, Christian Dittmar1, Jakob Abeßer2, Meinard Müller1 1International Audio Laboratories Erlangen 2Fraunhofer Institute for Digital

More information

Topic 4. Single Pitch Detection

Topic 4. Single Pitch Detection Topic 4 Single Pitch Detection What is pitch? A perceptual attribute, so subjective Only defined for (quasi) harmonic sounds Harmonic sounds are periodic, and the period is 1/F0. Can be reliably matched

More information

Effects of acoustic degradations on cover song recognition

Effects of acoustic degradations on cover song recognition Signal Processing in Acoustics: Paper 68 Effects of acoustic degradations on cover song recognition Julien Osmalskyj (a), Jean-Jacques Embrechts (b) (a) University of Liège, Belgium, josmalsky@ulg.ac.be

More information

Introductions to Music Information Retrieval

Introductions to Music Information Retrieval Introductions to Music Information Retrieval ECE 272/472 Audio Signal Processing Bochen Li University of Rochester Wish List For music learners/performers While I play the piano, turn the page for me Tell

More information

A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES

A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES 12th International Society for Music Information Retrieval Conference (ISMIR 2011) A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES Erdem Unal 1 Elaine Chew 2 Panayiotis Georgiou

More information

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC G.TZANETAKIS, N.HU, AND R.B. DANNENBERG Computer Science Department, Carnegie Mellon University 5000 Forbes Avenue, Pittsburgh, PA 15213, USA E-mail: gtzan@cs.cmu.edu

More information

Statistical Modeling and Retrieval of Polyphonic Music

Statistical Modeling and Retrieval of Polyphonic Music Statistical Modeling and Retrieval of Polyphonic Music Erdem Unal Panayiotis G. Georgiou and Shrikanth S. Narayanan Speech Analysis and Interpretation Laboratory University of Southern California Los Angeles,

More information

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications Matthias Mauch Chris Cannam György Fazekas! 1 Matthias Mauch, Chris Cannam, George Fazekas Problem Intonation in Unaccompanied

More information

Computational Modelling of Harmony

Computational Modelling of Harmony Computational Modelling of Harmony Simon Dixon Centre for Digital Music, Queen Mary University of London, Mile End Rd, London E1 4NS, UK simon.dixon@elec.qmul.ac.uk http://www.elec.qmul.ac.uk/people/simond

More information

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval DAY 1 Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval Jay LeBoeuf Imagine Research jay{at}imagine-research.com Rebecca

More information

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng Introduction In this project we were interested in extracting the melody from generic audio files. Due to the

More information

Automatic music transcription

Automatic music transcription Music transcription 1 Music transcription 2 Automatic music transcription Sources: * Klapuri, Introduction to music transcription, 2006. www.cs.tut.fi/sgn/arg/klap/amt-intro.pdf * Klapuri, Eronen, Astola:

More information

MUSI-6201 Computational Music Analysis

MUSI-6201 Computational Music Analysis MUSI-6201 Computational Music Analysis Part 9.1: Genre Classification alexander lerch November 4, 2015 temporal analysis overview text book Chapter 8: Musical Genre, Similarity, and Mood (pp. 151 155)

More information

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES Vishweshwara Rao and Preeti Rao Digital Audio Processing Lab, Electrical Engineering Department, IIT-Bombay, Powai,

More information

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC Vishweshwara Rao, Sachin Pant, Madhumita Bhaskar and Preeti Rao Department of Electrical Engineering, IIT Bombay {vishu, sachinp,

More information

A CLASSIFICATION APPROACH TO MELODY TRANSCRIPTION

A CLASSIFICATION APPROACH TO MELODY TRANSCRIPTION A CLASSIFICATION APPROACH TO MELODY TRANSCRIPTION Graham E. Poliner and Daniel P.W. Ellis LabROSA, Dept. of Electrical Engineering Columbia University, New York NY 127 USA {graham,dpwe}@ee.columbia.edu

More information

2 2. Melody description The MPEG-7 standard distinguishes three types of attributes related to melody: the fundamental frequency LLD associated to a t

2 2. Melody description The MPEG-7 standard distinguishes three types of attributes related to melody: the fundamental frequency LLD associated to a t MPEG-7 FOR CONTENT-BASED MUSIC PROCESSING Λ Emilia GÓMEZ, Fabien GOUYON, Perfecto HERRERA and Xavier AMATRIAIN Music Technology Group, Universitat Pompeu Fabra, Barcelona, SPAIN http://www.iua.upf.es/mtg

More information

NOTE-LEVEL MUSIC TRANSCRIPTION BY MAXIMUM LIKELIHOOD SAMPLING

NOTE-LEVEL MUSIC TRANSCRIPTION BY MAXIMUM LIKELIHOOD SAMPLING NOTE-LEVEL MUSIC TRANSCRIPTION BY MAXIMUM LIKELIHOOD SAMPLING Zhiyao Duan University of Rochester Dept. Electrical and Computer Engineering zhiyao.duan@rochester.edu David Temperley University of Rochester

More information

Content-based music retrieval

Content-based music retrieval Music retrieval 1 Music retrieval 2 Content-based music retrieval Music information retrieval (MIR) is currently an active research area See proceedings of ISMIR conference and annual MIREX evaluations

More information

Music Similarity and Cover Song Identification: The Case of Jazz

Music Similarity and Cover Song Identification: The Case of Jazz Music Similarity and Cover Song Identification: The Case of Jazz Simon Dixon and Peter Foster s.e.dixon@qmul.ac.uk Centre for Digital Music School of Electronic Engineering and Computer Science Queen Mary

More information

Subjective Similarity of Music: Data Collection for Individuality Analysis

Subjective Similarity of Music: Data Collection for Individuality Analysis Subjective Similarity of Music: Data Collection for Individuality Analysis Shota Kawabuchi and Chiyomi Miyajima and Norihide Kitaoka and Kazuya Takeda Nagoya University, Nagoya, Japan E-mail: shota.kawabuchi@g.sp.m.is.nagoya-u.ac.jp

More information

Efficient Vocal Melody Extraction from Polyphonic Music Signals

Efficient Vocal Melody Extraction from Polyphonic Music Signals http://dx.doi.org/1.5755/j1.eee.19.6.4575 ELEKTRONIKA IR ELEKTROTECHNIKA, ISSN 1392-1215, VOL. 19, NO. 6, 213 Efficient Vocal Melody Extraction from Polyphonic Music Signals G. Yao 1,2, Y. Zheng 1,2, L.

More information

A COMPARISON OF MELODY EXTRACTION METHODS BASED ON SOURCE-FILTER MODELLING

A COMPARISON OF MELODY EXTRACTION METHODS BASED ON SOURCE-FILTER MODELLING A COMPARISON OF MELODY EXTRACTION METHODS BASED ON SOURCE-FILTER MODELLING Juan J. Bosch 1 Rachel M. Bittner 2 Justin Salamon 2 Emilia Gómez 1 1 Music Technology Group, Universitat Pompeu Fabra, Spain

More information

A probabilistic framework for audio-based tonal key and chord recognition

A probabilistic framework for audio-based tonal key and chord recognition A probabilistic framework for audio-based tonal key and chord recognition Benoit Catteau 1, Jean-Pierre Martens 1, and Marc Leman 2 1 ELIS - Electronics & Information Systems, Ghent University, Gent (Belgium)

More information

Topic 11. Score-Informed Source Separation. (chroma slides adapted from Meinard Mueller)

Topic 11. Score-Informed Source Separation. (chroma slides adapted from Meinard Mueller) Topic 11 Score-Informed Source Separation (chroma slides adapted from Meinard Mueller) Why Score-informed Source Separation? Audio source separation is useful Music transcription, remixing, search Non-satisfying

More information

Chroma-based Predominant Melody and Bass Line Extraction from Music Audio Signals

Chroma-based Predominant Melody and Bass Line Extraction from Music Audio Signals Chroma-based Predominant Melody and Bass Line Extraction from Music Audio Signals Justin Jonathan Salamon Master Thesis submitted in partial fulfillment of the requirements for the degree: Master in Cognitive

More information

Lecture 9 Source Separation

Lecture 9 Source Separation 10420CS 573100 音樂資訊檢索 Music Information Retrieval Lecture 9 Source Separation Yi-Hsuan Yang Ph.D. http://www.citi.sinica.edu.tw/pages/yang/ yang@citi.sinica.edu.tw Music & Audio Computing Lab, Research

More information

Pattern Recognition in Music

Pattern Recognition in Music Pattern Recognition in Music SAMBA/07/02 Line Eikvil Ragnar Bang Huseby February 2002 Copyright Norsk Regnesentral NR-notat/NR Note Tittel/Title: Pattern Recognition in Music Dato/Date: February År/Year:

More information

A Survey of Audio-Based Music Classification and Annotation

A Survey of Audio-Based Music Classification and Annotation A Survey of Audio-Based Music Classification and Annotation Zhouyu Fu, Guojun Lu, Kai Ming Ting, and Dengsheng Zhang IEEE Trans. on Multimedia, vol. 13, no. 2, April 2011 presenter: Yin-Tzu Lin ( 阿孜孜 ^.^)

More information

Timing In Expressive Performance

Timing In Expressive Performance Timing In Expressive Performance 1 Timing In Expressive Performance Craig A. Hanson Stanford University / CCRMA MUS 151 Final Project Timing In Expressive Performance Timing In Expressive Performance 2

More information

Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods

Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods Kazuyoshi Yoshii, Masataka Goto and Hiroshi G. Okuno Department of Intelligence Science and Technology National

More information

Query By Humming: Finding Songs in a Polyphonic Database

Query By Humming: Finding Songs in a Polyphonic Database Query By Humming: Finding Songs in a Polyphonic Database John Duchi Computer Science Department Stanford University jduchi@stanford.edu Benjamin Phipps Computer Science Department Stanford University bphipps@stanford.edu

More information

ON FINDING MELODIC LINES IN AUDIO RECORDINGS. Matija Marolt

ON FINDING MELODIC LINES IN AUDIO RECORDINGS. Matija Marolt ON FINDING MELODIC LINES IN AUDIO RECORDINGS Matija Marolt Faculty of Computer and Information Science University of Ljubljana, Slovenia matija.marolt@fri.uni-lj.si ABSTRACT The paper presents our approach

More information

Singer Traits Identification using Deep Neural Network

Singer Traits Identification using Deep Neural Network Singer Traits Identification using Deep Neural Network Zhengshan Shi Center for Computer Research in Music and Acoustics Stanford University kittyshi@stanford.edu Abstract The author investigates automatic

More information

Evaluating Melodic Encodings for Use in Cover Song Identification

Evaluating Melodic Encodings for Use in Cover Song Identification Evaluating Melodic Encodings for Use in Cover Song Identification David D. Wickland wickland@uoguelph.ca David A. Calvert dcalvert@uoguelph.ca James Harley jharley@uoguelph.ca ABSTRACT Cover song identification

More information

CALCULATING SIMILARITY OF FOLK SONG VARIANTS WITH MELODY-BASED FEATURES

CALCULATING SIMILARITY OF FOLK SONG VARIANTS WITH MELODY-BASED FEATURES CALCULATING SIMILARITY OF FOLK SONG VARIANTS WITH MELODY-BASED FEATURES Ciril Bohak, Matija Marolt Faculty of Computer and Information Science University of Ljubljana, Slovenia {ciril.bohak, matija.marolt}@fri.uni-lj.si

More information

Automatic Piano Music Transcription

Automatic Piano Music Transcription Automatic Piano Music Transcription Jianyu Fan Qiuhan Wang Xin Li Jianyu.Fan.Gr@dartmouth.edu Qiuhan.Wang.Gr@dartmouth.edu Xi.Li.Gr@dartmouth.edu 1. Introduction Writing down the score while listening

More information

Analysing Musical Pieces Using harmony-analyser.org Tools

Analysing Musical Pieces Using harmony-analyser.org Tools Analysing Musical Pieces Using harmony-analyser.org Tools Ladislav Maršík Dept. of Software Engineering, Faculty of Mathematics and Physics Charles University, Malostranské nám. 25, 118 00 Prague 1, Czech

More information

SINGING VOICE MELODY TRANSCRIPTION USING DEEP NEURAL NETWORKS

SINGING VOICE MELODY TRANSCRIPTION USING DEEP NEURAL NETWORKS SINGING VOICE MELODY TRANSCRIPTION USING DEEP NEURAL NETWORKS François Rigaud and Mathieu Radenen Audionamix R&D 7 quai de Valmy, 7 Paris, France .@audionamix.com ABSTRACT This paper

More information

Classification of Timbre Similarity

Classification of Timbre Similarity Classification of Timbre Similarity Corey Kereliuk McGill University March 15, 2007 1 / 16 1 Definition of Timbre What Timbre is Not What Timbre is A 2-dimensional Timbre Space 2 3 Considerations Common

More information

MELODY EXTRACTION BASED ON HARMONIC CODED STRUCTURE

MELODY EXTRACTION BASED ON HARMONIC CODED STRUCTURE 12th International Society for Music Information Retrieval Conference (ISMIR 2011) MELODY EXTRACTION BASED ON HARMONIC CODED STRUCTURE Sihyun Joo Sanghun Park Seokhwan Jo Chang D. Yoo Department of Electrical

More information

/$ IEEE

/$ IEEE 564 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 18, NO. 3, MARCH 2010 Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals Jean-Louis Durrieu,

More information

Music Database Retrieval Based on Spectral Similarity

Music Database Retrieval Based on Spectral Similarity Music Database Retrieval Based on Spectral Similarity Cheng Yang Department of Computer Science Stanford University yangc@cs.stanford.edu Abstract We present an efficient algorithm to retrieve similar

More information

Algorithms for melody search and transcription. Antti Laaksonen

Algorithms for melody search and transcription. Antti Laaksonen Department of Computer Science Series of Publications A Report A-2015-5 Algorithms for melody search and transcription Antti Laaksonen To be presented, with the permission of the Faculty of Science of

More information

Voice & Music Pattern Extraction: A Review

Voice & Music Pattern Extraction: A Review Voice & Music Pattern Extraction: A Review 1 Pooja Gautam 1 and B S Kaushik 2 Electronics & Telecommunication Department RCET, Bhilai, Bhilai (C.G.) India pooja0309pari@gmail.com 2 Electrical & Instrumentation

More information

Music Information Retrieval

Music Information Retrieval CTP 431 Music and Audio Computing Music Information Retrieval Graduate School of Culture Technology (GSCT) Juhan Nam 1 Introduction ü Instrument: Piano ü Composer: Chopin ü Key: E-minor ü Melody - ELO

More information

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

Music Emotion Recognition. Jaesung Lee. Chung-Ang University Music Emotion Recognition Jaesung Lee Chung-Ang University Introduction Searching Music in Music Information Retrieval Some information about target music is available Query by Text: Title, Artist, or

More information

Improving Frame Based Automatic Laughter Detection

Improving Frame Based Automatic Laughter Detection Improving Frame Based Automatic Laughter Detection Mary Knox EE225D Class Project knoxm@eecs.berkeley.edu December 13, 2007 Abstract Laughter recognition is an underexplored area of research. My goal for

More information

Music Information Retrieval

Music Information Retrieval Music Information Retrieval When Music Meets Computer Science Meinard Müller International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de Berlin MIR Meetup 20.03.2017 Meinard Müller

More information

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound Pitch Perception and Grouping HST.723 Neural Coding and Perception of Sound Pitch Perception. I. Pure Tones The pitch of a pure tone is strongly related to the tone s frequency, although there are small

More information

MUSIC CONTENT ANALYSIS : KEY, CHORD AND RHYTHM TRACKING IN ACOUSTIC SIGNALS

MUSIC CONTENT ANALYSIS : KEY, CHORD AND RHYTHM TRACKING IN ACOUSTIC SIGNALS MUSIC CONTENT ANALYSIS : KEY, CHORD AND RHYTHM TRACKING IN ACOUSTIC SIGNALS ARUN SHENOY KOTA (B.Eng.(Computer Science), Mangalore University, India) A THESIS SUBMITTED FOR THE DEGREE OF MASTER OF SCIENCE

More information

CSC475 Music Information Retrieval

CSC475 Music Information Retrieval CSC475 Music Information Retrieval Monophonic pitch extraction George Tzanetakis University of Victoria 2014 G. Tzanetakis 1 / 32 Table of Contents I 1 Motivation and Terminology 2 Psychacoustics 3 F0

More information

Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification

Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification 1138 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 16, NO. 6, AUGUST 2008 Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification Joan Serrà, Emilia Gómez,

More information

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST)

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST) Computational Models of Music Similarity 1 Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST) Abstract The perceived similarity of two pieces of music is multi-dimensional,

More information

A Note Based Query By Humming System using Convolutional Neural Network

A Note Based Query By Humming System using Convolutional Neural Network INTERSPEECH 2017 August 20 24, 2017, Stockholm, Sweden A Note Based Query By Humming System using Convolutional Neural Network Naziba Mostafa, Pascale Fung The Hong Kong University of Science and Technology

More information

Sparse Representation Classification-Based Automatic Chord Recognition For Noisy Music

Sparse Representation Classification-Based Automatic Chord Recognition For Noisy Music Journal of Information Hiding and Multimedia Signal Processing c 2018 ISSN 2073-4212 Ubiquitous International Volume 9, Number 2, March 2018 Sparse Representation Classification-Based Automatic Chord Recognition

More information

Addressing user satisfaction in melody extraction

Addressing user satisfaction in melody extraction Addressing user satisfaction in melody extraction Belén Nieto MASTER THESIS UPF / 2014 Master in Sound and Music Computing Master thesis supervisors: Emilia Gómez Julián Urbano Justin Salamon Department

More information

Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900)

Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900) Music Representations Lecture Music Processing Sheet Music (Image) CD / MP3 (Audio) MusicXML (Text) Beethoven, Bach, and Billions of Bytes New Alliances between Music and Computer Science Dance / Motion

More information

Automatic Rhythmic Notation from Single Voice Audio Sources

Automatic Rhythmic Notation from Single Voice Audio Sources Automatic Rhythmic Notation from Single Voice Audio Sources Jack O Reilly, Shashwat Udit Introduction In this project we used machine learning technique to make estimations of rhythmic notation of a sung

More information

Analysis of local and global timing and pitch change in ordinary

Analysis of local and global timing and pitch change in ordinary Alma Mater Studiorum University of Bologna, August -6 6 Analysis of local and global timing and pitch change in ordinary melodies Roger Watt Dept. of Psychology, University of Stirling, Scotland r.j.watt@stirling.ac.uk

More information

AN ACOUSTIC-PHONETIC APPROACH TO VOCAL MELODY EXTRACTION

AN ACOUSTIC-PHONETIC APPROACH TO VOCAL MELODY EXTRACTION 12th International Society for Music Information Retrieval Conference (ISMIR 2011) AN ACOUSTIC-PHONETIC APPROACH TO VOCAL MELODY EXTRACTION Yu-Ren Chien, 1,2 Hsin-Min Wang, 2 Shyh-Kang Jeng 1,3 1 Graduate

More information

Detecting Musical Key with Supervised Learning

Detecting Musical Key with Supervised Learning Detecting Musical Key with Supervised Learning Robert Mahieu Department of Electrical Engineering Stanford University rmahieu@stanford.edu Abstract This paper proposes and tests performance of two different

More information

Tempo and Beat Tracking

Tempo and Beat Tracking Tutorial Automatisierte Methoden der Musikverarbeitung 47. Jahrestagung der Gesellschaft für Informatik Tempo and Beat Tracking Meinard Müller, Christof Weiss, Stefan Balke International Audio Laboratories

More information

Outline. Why do we classify? Audio Classification

Outline. Why do we classify? Audio Classification Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification Implementation Future Work Why do we classify

More information

A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models

A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models A System for Automatic Chord Transcription from Audio Using Genre-Specific Hidden Markov Models Kyogu Lee Center for Computer Research in Music and Acoustics Stanford University, Stanford CA 94305, USA

More information

Melody Retrieval On The Web

Melody Retrieval On The Web Melody Retrieval On The Web Thesis proposal for the degree of Master of Science at the Massachusetts Institute of Technology M.I.T Media Laboratory Fall 2000 Thesis supervisor: Barry Vercoe Professor,

More information

Take a Break, Bach! Let Machine Learning Harmonize That Chorale For You. Chris Lewis Stanford University

Take a Break, Bach! Let Machine Learning Harmonize That Chorale For You. Chris Lewis Stanford University Take a Break, Bach! Let Machine Learning Harmonize That Chorale For You Chris Lewis Stanford University cmslewis@stanford.edu Abstract In this project, I explore the effectiveness of the Naive Bayes Classifier

More information

Studying the effects of bass estimation for chord segmentation in pop-rock music

Studying the effects of bass estimation for chord segmentation in pop-rock music Studying the effects of bass estimation for chord segmentation in pop-rock music Urbez Capablo Riazuelo MASTER THESIS UPF / 2014 Master in Sound and Music Computing Master thesis supervisor: Dr. Perfecto

More information

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM A QUER B EAMPLE MUSIC RETRIEVAL ALGORITHM H. HARB AND L. CHEN Maths-Info department, Ecole Centrale de Lyon. 36, av. Guy de Collongue, 69134, Ecully, France, EUROPE E-mail: {hadi.harb, liming.chen}@ec-lyon.fr

More information

Automatic Construction of Synthetic Musical Instruments and Performers

Automatic Construction of Synthetic Musical Instruments and Performers Ph.D. Thesis Proposal Automatic Construction of Synthetic Musical Instruments and Performers Ning Hu Carnegie Mellon University Thesis Committee Roger B. Dannenberg, Chair Michael S. Lewicki Richard M.

More information

Rhythm related MIR tasks

Rhythm related MIR tasks Rhythm related MIR tasks Ajay Srinivasamurthy 1, André Holzapfel 1 1 MTG, Universitat Pompeu Fabra, Barcelona, Spain 10 July, 2012 Srinivasamurthy et al. (UPF) MIR tasks 10 July, 2012 1 / 23 1 Rhythm 2

More information

Soundprism: An Online System for Score-Informed Source Separation of Music Audio Zhiyao Duan, Student Member, IEEE, and Bryan Pardo, Member, IEEE

Soundprism: An Online System for Score-Informed Source Separation of Music Audio Zhiyao Duan, Student Member, IEEE, and Bryan Pardo, Member, IEEE IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, VOL. 5, NO. 6, OCTOBER 2011 1205 Soundprism: An Online System for Score-Informed Source Separation of Music Audio Zhiyao Duan, Student Member, IEEE,

More information

Robert Alexandru Dobre, Cristian Negrescu

Robert Alexandru Dobre, Cristian Negrescu ECAI 2016 - International Conference 8th Edition Electronics, Computers and Artificial Intelligence 30 June -02 July, 2016, Ploiesti, ROMÂNIA Automatic Music Transcription Software Based on Constant Q

More information

WE ADDRESS the development of a novel computational

WE ADDRESS the development of a novel computational IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 18, NO. 3, MARCH 2010 663 Dynamic Spectral Envelope Modeling for Timbre Analysis of Musical Instrument Sounds Juan José Burred, Member,

More information

The song remains the same: identifying versions of the same piece using tonal descriptors

The song remains the same: identifying versions of the same piece using tonal descriptors The song remains the same: identifying versions of the same piece using tonal descriptors Emilia Gómez Music Technology Group, Universitat Pompeu Fabra Ocata, 83, Barcelona emilia.gomez@iua.upf.edu Abstract

More information

Week 14 Music Understanding and Classification

Week 14 Music Understanding and Classification Week 14 Music Understanding and Classification Roger B. Dannenberg Professor of Computer Science, Music & Art Overview n Music Style Classification n What s a classifier? n Naïve Bayesian Classifiers n

More information

Probabilist modeling of musical chord sequences for music analysis

Probabilist modeling of musical chord sequences for music analysis Probabilist modeling of musical chord sequences for music analysis Christophe Hauser January 29, 2009 1 INTRODUCTION Computer and network technologies have improved consequently over the last years. Technology

More information

Music Genre Classification and Variance Comparison on Number of Genres

Music Genre Classification and Variance Comparison on Number of Genres Music Genre Classification and Variance Comparison on Number of Genres Miguel Francisco, miguelf@stanford.edu Dong Myung Kim, dmk8265@stanford.edu 1 Abstract In this project we apply machine learning techniques

More information

REpeating Pattern Extraction Technique (REPET): A Simple Method for Music/Voice Separation

REpeating Pattern Extraction Technique (REPET): A Simple Method for Music/Voice Separation IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 21, NO. 1, JANUARY 2013 73 REpeating Pattern Extraction Technique (REPET): A Simple Method for Music/Voice Separation Zafar Rafii, Student

More information

CS229 Project Report Polyphonic Piano Transcription

CS229 Project Report Polyphonic Piano Transcription CS229 Project Report Polyphonic Piano Transcription Mohammad Sadegh Ebrahimi Stanford University Jean-Baptiste Boin Stanford University sadegh@stanford.edu jbboin@stanford.edu 1. Introduction In this project

More information

Proc. of NCC 2010, Chennai, India A Melody Detection User Interface for Polyphonic Music

Proc. of NCC 2010, Chennai, India A Melody Detection User Interface for Polyphonic Music A Melody Detection User Interface for Polyphonic Music Sachin Pant, Vishweshwara Rao, and Preeti Rao Department of Electrical Engineering Indian Institute of Technology Bombay, Mumbai 400076, India Email:

More information

Chord Classification of an Audio Signal using Artificial Neural Network

Chord Classification of an Audio Signal using Artificial Neural Network Chord Classification of an Audio Signal using Artificial Neural Network Ronesh Shrestha Student, Department of Electrical and Electronic Engineering, Kathmandu University, Dhulikhel, Nepal ---------------------------------------------------------------------***---------------------------------------------------------------------

More information

A Study of Synchronization of Audio Data with Symbolic Data. Music254 Project Report Spring 2007 SongHui Chon

A Study of Synchronization of Audio Data with Symbolic Data. Music254 Project Report Spring 2007 SongHui Chon A Study of Synchronization of Audio Data with Symbolic Data Music254 Project Report Spring 2007 SongHui Chon Abstract This paper provides an overview of the problem of audio and symbolic synchronization.

More information

Classification of Musical Instruments sounds by Using MFCC and Timbral Audio Descriptors

Classification of Musical Instruments sounds by Using MFCC and Timbral Audio Descriptors Classification of Musical Instruments sounds by Using MFCC and Timbral Audio Descriptors Priyanka S. Jadhav M.E. (Computer Engineering) G. H. Raisoni College of Engg. & Mgmt. Wagholi, Pune, India E-mail:

More information

Lecture 10 Harmonic/Percussive Separation

Lecture 10 Harmonic/Percussive Separation 10420CS 573100 音樂資訊檢索 Music Information Retrieval Lecture 10 Harmonic/Percussive Separation Yi-Hsuan Yang Ph.D. http://www.citi.sinica.edu.tw/pages/yang/ yang@citi.sinica.edu.tw Music & Audio Computing

More information