Binning based algorithm for Pitch Detection in Hindustani Classical Music

Similar documents
Raga Identification by using Swara Intonation

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES

Topic 4. Single Pitch Detection

CSC475 Music Information Retrieval

Categorization of ICMR Using Feature Extraction Strategy And MIR With Ensemble Learning

International Journal of Computer Architecture and Mobility (ISSN ) Volume 1-Issue 7, May 2013

Outline. Why do we classify? Audio Classification

Pitch Based Raag Identification from Monophonic Indian Classical Music

AUTOMATICALLY IDENTIFYING VOCAL EXPRESSIONS FOR MUSIC TRANSCRIPTION

Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Available online at ScienceDirect. Procedia Computer Science 46 (2015 )

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

Voice & Music Pattern Extraction: A Review

SINGING PITCH EXTRACTION BY VOICE VIBRATO/TREMOLO ESTIMATION AND INSTRUMENT PARTIAL DELETION

DISTINGUISHING MUSICAL INSTRUMENT PLAYING STYLES WITH ACOUSTIC SIGNAL ANALYSES

Proc. of NCC 2010, Chennai, India A Melody Detection User Interface for Polyphonic Music

Efficient Vocal Melody Extraction from Polyphonic Music Signals

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng

CS229 Project Report Polyphonic Piano Transcription

Query By Humming: Finding Songs in a Polyphonic Database

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

Study of White Gaussian Noise with Varying Signal to Noise Ratio in Speech Signal using Wavelet

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics)

Effects of acoustic degradations on cover song recognition

Robert Alexandru Dobre, Cristian Negrescu

Music Radar: A Web-based Query by Humming System

AN ALGORITHM FOR LOCATING FUNDAMENTAL FREQUENCY (F0) MARKERS IN SPEECH

Automatic Music Clustering using Audio Attributes

ACCURATE ANALYSIS AND VISUAL FEEDBACK OF VIBRATO IN SINGING. University of Porto - Faculty of Engineering -DEEC Porto, Portugal

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

Automatic music transcription

Music Information Retrieval with Temporal Features and Timbre

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

GCT535- Sound Technology for Multimedia Timbre Analysis. Graduate School of Culture Technology KAIST Juhan Nam

IMPROVED MELODIC SEQUENCE MATCHING FOR QUERY BASED SEARCHING IN INDIAN CLASSICAL MUSIC

Objective Assessment of Ornamentation in Indian Classical Singing

Topic 10. Multi-pitch Analysis

DATA COMPRESSION USING THE FFT

Automatic Rhythmic Notation from Single Voice Audio Sources

A Parametric Autoregressive Model for the Extraction of Electric Network Frequency Fluctuations in Audio Forensic Authentication

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound

Semi-automated extraction of expressive performance information from acoustic recordings of piano music. Andrew Earis

Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods

Article Music Melodic Pattern Detection with Pitch Estimation Algorithms

Automated extraction of motivic patterns and application to the analysis of Debussy s Syrinx

Classification of Musical Instruments sounds by Using MFCC and Timbral Audio Descriptors

Evaluating Melodic Encodings for Use in Cover Song Identification

Audio Compression Technology for Voice Transmission

Analyzing & Synthesizing Gamakas: a Step Towards Modeling Ragas in Carnatic Music

Speech and Speaker Recognition for the Command of an Industrial Robot

A prototype system for rule-based expressive modifications of audio recordings

Transcription of the Singing Melody in Polyphonic Music

Doubletalk Detection

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

THE importance of music content analysis for musical

Upgrading E-learning of basic measurement algorithms based on DSP and MATLAB Web Server. Milos Sedlacek 1, Ondrej Tomiska 2

Melody transcription for interactive applications

TANSEN: A QUERY-BY-HUMMING BASED MUSIC RETRIEVAL SYSTEM. M. Anand Raju, Bharat Sundaram* and Preeti Rao

Pitch. The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high.

HST 725 Music Perception & Cognition Assignment #1 =================================================================

Automatic Music Transcription: The Use of a. Fourier Transform to Analyze Waveform Data. Jake Shankman. Computer Systems Research TJHSST. Dr.

Transcription An Historical Overview

ADDING (INJECTING) NOISE TO IMPROVE RESULTS.

Automatic Piano Music Transcription

Topics in Computer Music Instrument Identification. Ioanna Karydi

Available online at International Journal of Current Research Vol. 9, Issue, 08, pp , August, 2017

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas

Auto-Tune. Collection Editors: Navaneeth Ravindranath Tanner Songkakul Andrew Tam

Design of a Speaker Recognition Code using MATLAB

TERRESTRIAL broadcasting of digital television (DTV)

Music Similarity and Cover Song Identification: The Case of Jazz

LOUDNESS EFFECT OF THE DIFFERENT TONES ON THE TIMBRE SUBJECTIVE PERCEPTION EXPERIMENT OF ERHU

International Journal of Advance Engineering and Research Development MUSICAL INSTRUMENT IDENTIFICATION AND STATUS FINDING WITH MFCC

MELODY EXTRACTION BASED ON HARMONIC CODED STRUCTURE

BBN ANG 141 Foundations of phonology Phonetics 3: Acoustic phonetics 1

Automatic characterization of ornamentation from bassoon recordings for expressive synthesis

Digital Signal. Continuous. Continuous. amplitude. amplitude. Discrete-time Signal. Analog Signal. Discrete. Continuous. time. time.

Music Database Retrieval Based on Spectral Similarity

A NEW LOOK AT FREQUENCY RESOLUTION IN POWER SPECTRAL DENSITY ESTIMATION. Sudeshna Pal, Soosan Beheshti

PHYSICS OF MUSIC. 1.) Charles Taylor, Exploring Music (Music Library ML3805 T )

A Parametric Autoregressive Model for the Extraction of Electric Network Frequency Fluctuations in Audio Forensic Authentication

Pitch correction on the human voice

ON FINDING MELODIC LINES IN AUDIO RECORDINGS. Matija Marolt

Pattern Recognition in Music

Tempo and Beat Analysis

Reduction of Noise from Speech Signal using Haar and Biorthogonal Wavelet

AN ACOUSTIC-PHONETIC APPROACH TO VOCAL MELODY EXTRACTION

Pitch Detection/Tracking Strategy for Musical Recordings of Solo Bowed-String and Wind Instruments

Detection and demodulation of non-cooperative burst signal Feng Yue 1, Wu Guangzhi 1, Tao Min 1

MELODY EXTRACTION FROM POLYPHONIC AUDIO OF WESTERN OPERA: A METHOD BASED ON DETECTION OF THE SINGER S FORMANT

CZT vs FFT: Flexibility vs Speed. Abstract

A PERPLEXITY BASED COVER SONG MATCHING SYSTEM FOR SHORT LENGTH QUERIES

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng

Chord Classification of an Audio Signal using Artificial Neural Network

Statistical Modeling and Retrieval of Polyphonic Music

EE391 Special Report (Spring 2005) Automatic Chord Recognition Using A Summary Autocorrelation Function

Swept-tuned spectrum analyzer. Gianfranco Miele, Ph.D

Author Index. Absolu, Brandt 165. Montecchio, Nicola 187 Mukherjee, Bhaswati 285 Müllensiefen, Daniel 365. Bay, Mert 93

Transcription:

1 Binning based algorithm for Pitch Detection in Hindustani Classical Music Malvika Singh, BTech 4 th year, DAIICT, 201401428@daiict.ac.in Abstract Speech coding forms a crucial element in speech communications. An important area concerning it lies in feature extraction which can be used for analyzing Hindustani Classical Music. An important feature in this respect is the fundamental frequency often referred to as the pitch. In this work, the terms pitch and its acoustical sensation, the frequency is used interchangeably. There exists numerous pitch detection algorithms which detect the main/ fundamental frequency in a given musical piece, but we have come up with a unique algorithm for pitch detection using the binning method as described in the paper using appropriate bin size. In particular, the paper [1] provides light on pitch identification for Hindustani Classical Music. Pitch Class Distribution has been employed in this work. It can be used to identify pitches in Hindustani Classical Music which is based on suitable intonations and swaras. It follows a particular ratio pattern which is a tuning for diatonic scale proposed by Ptolemy [8] and confirmed by Zarlino [9] is explored in this paper. We have also given our estimated of these ratios and compared the error with the above. The error produced by varying the bin size in our algorithm is investigated and an estimate for an appropriate bin size is suggested and tested. The binning algorithm thus helps to segregate the important pitches in a given musical piece. Index Terms pitch detection, raga, intonation, binning method. I. INTRODUCTION Raga is a fundamental idea central to Hindustani Classical Music. They express characteristic moods and express various emotions in the way they are sung and performed. A veteran Hindustani Classical Vocalist has the perfect acoustical sense of timing, duration and frequency of the swaras and a high tonal quality, even though this art form is not documented and has been transferred through generations by oral recitation. In work [8], the authors have exhaustively discussed throughout the length and breadth of the paper the pitch detection for Hindustani Classical Music. However, this method utilizes pitch extractor, Pitch-class Distributions (PCDs) and Pitchclass Dyad Distributions (PCDDs) which have been used as features for raga identification. Our present work is based on developing a binning based algorithm which helps to identify the important pitches in Hindustani Classical Music which can be used to find a few interesting songs among the millions available in music retrieval systems for Hindustani Classical Music. As a consequence of the easy access to music, the field of music information retrieval (MIR) has emerged. Various pitch detection algorithms exists like autocorrelation method [1], HPS [2], RAPT [3], AMDF method [4], CPD [5], SIFT [6], DFE [7]. Although the algorithms are effectively robust, yet their accuracy deteriorates as the noise component increases. Pitch detection algorithms can be classified into the following categories: 1) Time domain 2) Frequency domain 3) Time and frequency domain jointly In this paper, we discuss the method to identify the important pitches in Hindustani Classical Music by employing the binning method which helps in the process provided an accurate estimate of the bin size is made. It uses time and frequency domain jointly. II. MOTIVATION FOR DEVELOPMENT OF THE ALGORITHM As demonstrated in [1], the paper explores various pitch class distribution and intonations for Hindustani Classical Music. It employs raga identification and performs pitch extraction using polyphonic melody extractor and obtaining pitch contours. In our experiment, however, we use the binning method to extract the pitches important in Hindustani Classical Music. In the research work of [1], it has been noticed in various performances like those of Pandit Bhimsen Joshi, one of the very notable artistes, Hindustani Classical Music vocalists in general are meticulous about the particular position on the music scale in which they intone a specific swara within its pitch interval. They have also observed and verified that these positions on the music scale are such that their frequencies are in ratios of small integers. It is important that the artiste adjusts his tone and swara so that it is in consonance in a musical performance with respect to the sequence of the notes in a particular musical piece. Professional performers would closely stick to these ratios. In work of [9], the variation in the frequencies of each swara for many ragas has been shown. In our work, we have calculated the these frequency ratios and compared them to the already existent Ptolemy and Gioseffo Zarlinos ratios along with the errors as depicted in Table 1 on the last page. III. ALGORITHM The binning method focused on separating frequencies on the basis of how frequently they occurred during the music piece. This was done recursively by creating bins of appropriate sizes which then shortlisted the most frequently occurring frequency n the bin by comparison of frequencies within the bin. In this manner, frequencies were obtained locally which were then merged with similar frequencies to finally give the top ten frequencies. Here, care had to be taken so that we ignore very small frequencies outside the audible range even though they were the local maximums.

2 Finding the top 10 frequencies: In the first attempt, findpeaks method in MATLAB was used to find the peaks in the fft of the signal and the audio files. In this method, the peaks based on the amplitude magnitude were identified. This gave the frequencies which were very close,like, for example, for a given audio file, 168 and 168.2 Hz etc. which would practically be indistinguishable by the normal adult human ear. So, the need for an appropriate bin size was felt so as to group these closely related frequencies. Figure 1 shows the graph plotted based on the frequency distribution of the signal, and so the top ten peaks can be manually found by visual inspection by observing the plot on looking at the number of times a particular frequency appears in the whole audio recording. performed. Hence, we considered the komal rishabh and the ratio comes out to be 15/16 for it. In the above method, bins of an appropriate (described later), window size were made, and then, as soon as a frequency lying in that particular frequency appeared, we had to increment the bin count by 1 unit. The bin count was initially initialized with zeros. In this way we could find the top ten windows. Since the number of samples of the audio files was huge, in order to increase efficiency of the algorithm, we had to appropriately scale the signal. The bin size for classical Hindustani MUsic will depend on the frequency distribution and the ratios involved in this form of music. The following section describes as to why have we considered ratios and frequency distribution in order to figure out a good estimate of the bin size to be used in our algorithm. A. Bin Size Estimation The bin size needs to be chosen in accordance to the inclination of most of the frequencies in the top ten frequency spectrum obtained earlier. The factors will vary depending upon the position of the majority of the frequencies falling as per the spectrum. The music scale can have the following divisions as per Hindustani Classical Music. Figure below illustrates these ratios: Fig 1. Probability distribution of top 10 frequencies Algorithm 1 Binning algorithm 0: procedure BINMETHOD 0: 0: music arrayof audiodata fs samplingf requencyof audiodata 0: freq f f t(music) 0: binsize EstimateBinSize 0: cnt 0 0: distribution P robabilitydistribution(f req) 0: top: 0: for i 1tolength(distribution) do 0: bin[cnt] f req[i]tof req[i + binsize] 0: F indmaximum(bini) 0: if maximum(bini threshold) then 0: accept cnt cnt + 1 0: else 0: reject 0: end if 0: end for 0: end procedure=0 In the algorithm described above, bin size of 2Hz is taken into account. However, if the bin size is too small then the computation suffers and if it is too large then we miss out on the fundamental frequency. So a tradeoff needs to be Fig 1. Music Scale in Hindustani Classical Music with ratios Now, we performed the experiment by taking into consideration each of the ratio in the above music scale and used the bin size estimate as 15/16 * Maximum Frequency*factor chosen. We plotted the original spectrum as below and the drone frequency obtained by using the appropriate bin size(the factor which gave the best estimate) was the factor where maximum frequencies lay in the original spectrum. Our dataset included classical Hindustani music like raag behagra, Meera Kabir Bhajan, Raag Puria Kalyan etc. Here are the results of the samples and the percentage error of the drone frequency with the original frequency obtained by xcorr function in MATLAB:

3 Fig 5. Probability distribution of frequencies for sample3 Fig 2. Error Percentage for bin estimate factor which gives minimum error for drone frequency for 8 music samples As indicated by the plot above, the minimum error percentages are for those factors where the maximum number of frequency samples lie. The following 8 plots demostrate the frequency spectrums of which the error is plotted in order. Fig 6. Probability distribution of frequencies for sample4 Fig 3. Probability distribution of frequencies for sample1 Fig 7. Probability distribution of frequencies for sample5 Fig 4. Probability distribution of frequencies for sample2

4 Fig 8. Probability distribution of frequencies for sample6 Fig 9. Probability distribution of frequencies for sample7 Fig 10. Probability distribution of frequencies for sample8 I V. COMPARISON OF ERROR WITH DIFFERENT BIN SIZES The experiment was performed to evaluate the error in binning algorithm when different bin sizes were considered. The experiment was performed using the classical Hindustani ragas bhoopali, music on the harmonium, the sitar and the Meera-Kabir Bhajan as well as some noise-free sound signal with the following specifications generated in MATLAB: partial1 = cos(omega1*t + phi)*amplitude; %sinusoidal 1 partial2 = cos(omega2*t + phi)*amplitude; %sinusoidal 2 partial3 = cos(omega3*t + phi)*amplitude; %sinusoidal 3 signal = (partial1 + partial2 + partial3)/3; and varying frequencies between 100Hz to 500 Hz to generate different sounds. Figure 3 below shows the observation made in terms of the error percentage: Fig 11. Comparing error percentage by varying bin size for sounds generated in MATLAB As per the results obtained, the 2Hz bin size provides better performance than the 5Hz and 10Hz bins because the frequency taken into consideration is high, ranging from 100Hz to 500Hz. So the bin size has to be s mall so as to be more sensitive to frequency changes. V. CONCLUSION Thus, in this work, we introduced a new method for obtaining the pitch of a music sound of moderate frequencies using matlab generated sound signals. We further observed the effect of varying the bin size for these sound signals. In this process, we provided an estimate of an appropriate bin size. In addition to this, the error analysis shows the percentage error when bin size is varied for different sound signals. This helps in identifying the ragas in Hindustani Classical Music which is unique in its own way. It does not have accurate documented musical notes like in case of Western Music, rather every artiste adjusts the intonation slightly to suit his/her voice while adhering to the ratios, but via an acoustical feel and not a precise measurement. The work contributes to get a measurement for the fundamental frequency so that learning of this form of music can be learnt even by novice musicians by knowing the frequency they need to achieve along with the tacit ACKNOWLEDGMENT The authors would like to mention Miss Ahana Pradhan, PHD, IITBombay and Miss Sobha Singh, MTech IIT -Bombay for their invaluable suggestions in the process.

5 Musical Note Ptolemy s ratios O ur calculation of ratios Gioseffo Zarlinos ratios Diff.(Ptolemy& our ratios) Diff.(Zarlino&our ratio) C 1/1 1/1 1/1 0 0 C# 16/15 256/243 25/24 0.01316 0.01183 B 15/8 243/128 15/8 0.02343 0.02343 B# 9/5 9/5 9/5 0 0 E 5/4 81/64 5/4 0.01562 0.01562 F 4/3 4/3 4/3 0 0 F# 45/32 729/512 45/32 0.01757 0.01757 G 3/2 3/2 3/2 0 0 G# 8/5 128/81 25/16 0.01975 0.01774 A 5/3 27/16 5/3 0.02083 0.02083 A# 9/5 16/9 16/9 0.02222 0.02222 C 2/1 2/1 2/1 0 0 TABLE I THE RATIOS FOR DIFFERENT TONES FROM PTOLEMY S WORK, GIOSEFFO ZARLINO S WORK AND OUR CALCULATIONS REFERENCES [1] Shreyas Belle, Rushikesh Joshi, and Preeti Rao. Raga identification by using swara intonation. Journal of ITC Sangeet Research Academy, 23, 2009. [2] Ahmet M Kondoz. Digital speech: coding for low bit rate communication systems. John Wiley & Sons, 2005. [3] Wolfgang Hess. Pitch determination of speech signals: algorithms and devices, volume 3. Springer Science & Business Media, 2012. [4] David Talkin. A robust algorithm for pitch tracking (rapt). Speech coding and synthesis, 495:518, 1995. [5] M Ross, H Shaffer, Andrew Cohen, Richard Freudberg, and H Manley. Average magnitude difference function pitch extractor. IEEE Transactions on Acoustics, Speech, and Signal Processing, 22(5):353 362, 1974. [6] Lawrence Rabiner, Md Cheng, A Rosenberg, and C McGonegal. A comparative performance study of several pitch detection algorithms. IEEE Transactions on Acoustics, Speech, and Signal Processing, 24(5):399 418, 1976. [7] Hyn ek Bo řil and P etr Pollá k. Di r ect time do main fund a m ent al fr equ en cy estimation of speech in noisy conditions. In Signal Processing Conference, 2004 12th European, pages 1003 1006. IEEE, 2004. [8] DONALD McLEAN. The science behind europe s music scale. Interdisciplinary Science Reviews, 26(3):211 221, 2001. [9] GD Halsey and Edwin Hewitt. More on the superparticular ratios in music. The American Mathematical Monthly, 79(10):1096 1100, 1972.