Classification of Different Indian Songs Based on Fractal Analysis

Similar documents
FRACTAL BEHAVIOUR ANALYSIS OF MUSICAL NOTES BASED ON DIFFERENT TIME OF RENDITION AND MOOD

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

Automatic Music Clustering using Audio Attributes

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

A NON LINEAR APPROACH TOWARDS AUTOMATED EMOTION ANALYSIS IN HINDUSTANI MUSIC

About Giovanni De Poli. What is Model. Introduction. di Poli: Methodologies for Expressive Modeling of/for Music Performance

Analysis of local and global timing and pitch change in ordinary

Comparison Parameters and Speaker Similarity Coincidence Criteria:

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Pitch-Synchronous Spectrogram: Principles and Applications

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models

Outline. Why do we classify? Audio Classification

Simple Harmonic Motion: What is a Sound Spectrum?

VCE VET MUSIC INDUSTRY: SOUND PRODUCTION

2. AN INTROSPECTION OF THE MORPHING PROCESS

Chord Classification of an Audio Signal using Artificial Neural Network

Predicting the immediate future with Recurrent Neural Networks: Pre-training and Applications

International Journal of Advance Engineering and Research Development MUSICAL INSTRUMENT IDENTIFICATION AND STATUS FINDING WITH MFCC

A Matlab toolbox for. Characterisation Of Recorded Underwater Sound (CHORUS) USER S GUIDE

Hidden melody in music playing motion: Music recording using optical motion tracking system

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng

Automatic Rhythmic Notation from Single Voice Audio Sources

Query By Humming: Finding Songs in a Polyphonic Database

Dither Explained. An explanation and proof of the benefit of dither. for the audio engineer. By Nika Aldrich. April 25, 2002

Measurement of overtone frequencies of a toy piano and perception of its pitch

Arts, Computers and Artificial Intelligence

EXPLORING THE USE OF ENF FOR MULTIMEDIA SYNCHRONIZATION

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

Subjective evaluation of common singing skills using the rank ordering method

CSC475 Music Information Retrieval

Chapter 1. Introduction to Digital Signal Processing

Automatic Construction of Synthetic Musical Instruments and Performers

Department of Electrical & Electronic Engineering Imperial College of Science, Technology and Medicine. Project: Real-Time Speech Enhancement

PulseCounter Neutron & Gamma Spectrometry Software Manual

Music Genre Classification and Variance Comparison on Number of Genres

Tempo and Beat Analysis

Noise. CHEM 411L Instrumental Analysis Laboratory Revision 2.0

Real-Time Spectrogram (RTS tm )

Voice & Music Pattern Extraction: A Review

Topics in Computer Music Instrument Identification. Ioanna Karydi

Music Radar: A Web-based Query by Humming System

Acoustic Measurements Using Common Computer Accessories: Do Try This at Home. Dale H. Litwhiler, Terrance D. Lovell

MIE 402: WORKSHOP ON DATA ACQUISITION AND SIGNAL PROCESSING Spring 2003

UNIVERSAL SPATIAL UP-SCALER WITH NONLINEAR EDGE ENHANCEMENT

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng

Characterisation of the far field pattern for plastic optical fibres

Physical Modelling of Musical Instruments Using Digital Waveguides: History, Theory, Practice

Similarity matrix for musical themes identification considering sound s pitch and duration

EE373B Project Report Can we predict general public s response by studying published sales data? A Statistical and adaptive approach

THE EFFECT OF PERFORMANCE STAGES ON SUBWOOFER POLAR AND FREQUENCY RESPONSES

Topic 10. Multi-pitch Analysis

ACTIVE SOUND DESIGN: VACUUM CLEANER

MUSI-6201 Computational Music Analysis

Swept-tuned spectrum analyzer. Gianfranco Miele, Ph.D

Hidden Markov Model based dance recognition

Lecture 17 Microwave Tubes: Part I

Investigation of Aesthetic Quality of Product by Applying Golden Ratio

Fast Ethernet Consortium Clause 25 PMD-EEE Conformance Test Suite v1.1 Report

Automatic LP Digitalization Spring Group 6: Michael Sibley, Alexander Su, Daphne Tsatsoulis {msibley, ahs1,

jsymbolic 2: New Developments and Research Opportunities

On time: the influence of tempo, structure and style on the timing of grace notes in skilled musical performance

Speech and Speaker Recognition for the Command of an Industrial Robot

Implementation of a turbo codes test bed in the Simulink environment

Music Source Separation

Precise Digital Integration of Fast Analogue Signals using a 12-bit Oscilloscope

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 21, NO. 4, APRIL

International Journal of Computer Architecture and Mobility (ISSN ) Volume 1-Issue 7, May 2013

For the SIA. Applications of Propagation Delay & Skew tool. Introduction. Theory of Operation. Propagation Delay & Skew Tool

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES

Quarterly Progress and Status Report. Musicians and nonmusicians sensitivity to differences in music performance

Proc. of NCC 2010, Chennai, India A Melody Detection User Interface for Polyphonic Music

Analyzing & Synthesizing Gamakas: a Step Towards Modeling Ragas in Carnatic Music

Application Note AN-708 Vibration Measurements with the Vibration Synchronization Module

Experiment PP-1: Electroencephalogram (EEG) Activity

Perceptual Evaluation of Automatically Extracted Musical Motives

Full Disclosure Monitoring

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Jazz Melody Generation and Recognition

Crash Course in Digital Signal Processing

A few white papers on various. Digital Signal Processing algorithms. used in the DAC501 / DAC502 units

Detection of Panoramic Takes in Soccer Videos Using Phase Correlation and Boosting

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

Analysis, Synthesis, and Perception of Musical Sounds

Sound Measurement. V2: 10 Nov 2011 WHITE PAPER. IMAGE PROCESSING TECHNIQUES

Take a Break, Bach! Let Machine Learning Harmonize That Chorale For You. Chris Lewis Stanford University

Automatic Piano Music Transcription

1 Introduction to PSQM

Work Package 9. Deliverable 32. Statistical Comparison of Islamic and Byzantine chant in the Worship Spaces

Removal of Decaying DC Component in Current Signal Using a ovel Estimation Algorithm

Adaptive Resampling - Transforming From the Time to the Angle Domain

Analysis of Different Pseudo Noise Sequences

Figure 2: Original and PAM modulated image. Figure 4: Original image.

A repetition-based framework for lyric alignment in popular songs

A TEXT RETRIEVAL APPROACH TO CONTENT-BASED AUDIO RETRIEVAL

Advance Certificate Course In Audio Mixing & Mastering.

Guide to Analysing Full Spectrum/Frequency Division Bat Calls with Audacity (v.2.0.5) by Thomas Foxley

Music Understanding and the Future of Music

THE DIGITAL DELAY ADVANTAGE A guide to using Digital Delays. Synchronize loudspeakers Eliminate comb filter distortion Align acoustic image.

DISTINGUISHING MUSICAL INSTRUMENT PLAYING STYLES WITH ACOUSTIC SIGNAL ANALYSES

Transcription:

Classification of Different Indian Songs Based on Fractal Analysis Atin Das Naktala High School, Kolkata 700047, India Pritha Das Department of Mathematics, Bengal Engineering and Science University, Shibpur, Howrah 711103, India In this paper some of the prevailing classifications of Indian songs are quantified by measuring their fractal dimension. Samples were collected from three categories: Classical, Semiclassical, and Light. After appropriate processing, the samples were converted into time series datasets and their fractal dimension was computed. Based on these results, an online method for classification is offered. 1. Introduction Even though mathematical analysis of music is not new there is no standard method and different analysts apply different tools. In this paper we try to apply nonlinear tools to quantify some of the prevailing classifications in Indian songs. Our method can be generalized and applied to other areas of music. We confine our discussion to music as a purely acoustic phenomenon. The basic point is that the way music is composed makes it fall into three categories: Classical, Semiclassical, and Light. This categorization is largely from popular perception and is not very rigorous in nature. A detailed study of the origin and classification of Indian music can be found in [1]. Here we consider only vocal performances; that is, songs, more particularly, the melodic lines from each of the categories. Classical songs are composed by strictly following a set of compositional rules that are of fundamental importance for this type of composition. This type of song is quite difficult to learn and is supposed to be the foundation for all other types of songs. The procedure results in a complex melodic line that is a characteristic for a variety of songs in this category. In Light music, in contrast, more stress is given to lyrics, the melodic line is less complex, and the song is sung more smoothly. Electronic mail address: dasatin@yahoo.co.in. ; 2005 Complex Systems Publications, Inc.

254 A. Das and P. Das Semiclassical songs lie in between. See [2, 3] for a brief discussion on Indian songs. See [4, 5] for the addresses of some Internet sites to listen to songs. Here, we shall attempt to find if the mentioned categories can be mathematically defined using fractal dimension (D) as a measure. Voss and Clark [6, 7] determined that music exhibits 1/f -power spectra at low frequencies. This fact allows us to consider music as a time series and analyze the fractal dimension of a particular piece of music. Bigerelle and Iost [8] found the global D to be an invariant for different types of music. In another work [9], D in the music of Mozart and Bach was calculated. Hsu and Hsu [10] discussed the application of D to music in detail and for a work of Bach, found D to be 2.418. We collected several data samples of each category as discussed in section 2. We discuss the tools in section 3. We then apply the tools in section 4 to calculate D of proposed excerpts and compare the result derived from examples of different types of melodic lines. Some figures are drawn for frequency analyses. Based on the results obtained, some conclusions are made in section 5. 2. Data We have selected three samples, each from a different category of song. The original soundtracks are in MP3 format and we extracted a roughly 11 second clip from each song. The selection had to contain the least amount of usual accompanying musical instruments since they are not a subject of our analysis and act as unwanted noise. Also, spectrogram analysis shows that even a highly trained voice produces several frequencies simultaneously due to the very nature of human voice production. For more details on this see, for example, [11]. We extracted the waveforms of the musical excerpts and converted them to ASCII data; that is, in text form to produce a data file for each song. While recording and converting audio files on a computer, we used the following parameters: single channel, 8 bit, and and a sampling rate of 11 kbps. These parameters were chosen in order to keep the data file size small (each file still has nearly 120000 data points). These data files are fed to a computer program that plays back the original soundtracks and makes a real-time calculation of D simultaneously. We may add that all of the musical excerpts, regardless of their complexity, were performed by great singers. Our selections were not intended to reflect any aspect of the performance of the songs in question, they only present a particular category of song. 3. Analysis of the data The repetition of frequency in the time scale may lead to some selfsimilarity in the time series plot. To investigate this property, standard

Classification of Different Indian Songs Based on Fractal Analysis 255 statistical tools are not sufficient and a more appropriate tool seems to be the fractal dimension. 3.1 Fractal dimension (D) D is a measure of the extent to which trajectories on the attractor fill a region in the phase space, a strange attractor has a noninteger dimension. There are at least five different well-established definitions of D, although their interrelations are by no means completely understood [12]. If we try to cover, suppose, a line segment with squares of some finite side, say R, then let N be the number of squares of that size required to cover the set. Now let us make the square small enough so that the curve (whose D is being estimated) is approximated well. If we plot in phase space, each point represents a state of the system. In the limiting case where scaling R makes each square contain approximately a single point, then N represents the number of states. So, for one-dimensional objects with a finite set, in the general form, we can write D lim log N. (1) R 0 log R Since the number of data points is so large for each sample, we used a real-time analysis program to find D which takes a chunk of 20000 points at a time from the input file as described in section 2. Details of D and the program used have been discussed in our earlier work [13]. We also used the software Dataplore 2.0-6 and Nlyzer 3.2. In this case, we have six D for each file and present the minimum (Min. D) and maximum (Max. D) of them for each sample. Details of the results are given in Table 1 and the values of parameters used are given in Table 2. The same parameter values are used to calculate D for each sample. 3.2 Frequency histograms Every song, being a musical sound, has its own harmonics and subharmonics repeated over time. So frequencies are repeated in each sample following the rules depending on which category of song the sample belongs to. To visualize this feature, we have drawn frequency histograms for each sample. 4. Numerical results From the results, we can see that Classical songs have higher values of maximum D (well above three) than Light songs (below three) while D from Semiclassical songs lie in between the other two types. This is well expected as D measures how kinky a curve is. On the other hand, a curve, which has several folds that are self-similar, has higher D. This fact is also evident in Figure 1 where we have plotted two time series,

256 A. Das and P. Das Figure 1. Comparing Classical (above) song sample Josraj to Light (below) song sample Hemanta to show the more complex nature of the former. Only 600 datapoints of each sample are plotted. Number Sample Maximum D Minimum D A. Light A.1 Hemanta 2.45 0.79 A.2 Nachiketa 1.83 0.53 A.3 Bhupen 0.81 0.51 B. Semiclassical B.1 Manna 2.90 1.63 B.2 Firoza 3.14 0.53 B.3 Sandhya 2.21 0.93 C. Classical C.1 Rashid 4.09 0.51 C.2 Josraj 3.14 0.66 C.3 Ajoy 3.46 1.88 Table 1. Sample name, type of sample, and maximum and minimum value of D calculated for each. Embedding Dimension Delay Reference Points Data Points 8 4 512 20000 Table 2. Parameter values (with usual meaning) used in calculating D.

Classification of Different Indian Songs Based on Fractal Analysis 257 Figure 2. Frequency histograms for all of the samples. The sample number shown in the right-hand corner of each box is taken from Table 1. The vertical scale for A.2, B.2, and C.2 is 1000 and for all others is 10000. one each from a Classical and a Light song. To have a closer look, we used only 600 points for each series. In a Classical song the singer stresses producing a wide range of frequencies or repetition of frequencies over a short period. Naturally, this produces a more fractal curve associated with higher D. Also, from the frequency histograms in Figure 2, one can see that the data range is much wider for Classical songs than the other two types. 5. Discussion The analysis presented here can be generalized to categorize different types of songs. Taking larger samples, in both number and time-scale, can give a more accurate analysis. We further propose to build up a fullfledged online system to classify Indian songs whose type is unknown. The scheme can be represented by Figure 3. For this purpose, huge computational resources are needed to handle more samples of longer duration. Samples can be chosen from playing a prerecorded song or directly from the recorder device. Samples would be filtered to remove sounds from accompanying musical instruments to get only the sound of the voice. In the present case this was done manually. For a large number of samples a more efficient vocal-extracting program

258 A. Das and P. Das Figure 3. Showing how an online Indian song categorization scheme can be implemented. is needed. applicable. But in any case, the method demonstrated here is quite References [1] J. L. Martinez, Semiotics and the Art Music of India, Music Theory Online, 6(1) (2000); http://www.societymusictheory.org/mto/issues/issues.html. [2] P. Abraham, Einstein Lived Here (Oxford University Press, New York, 1994). [3] Theoretical discussion on Indian music can be found at http://www.lsi.usp.br/usp/rod/sounds/indian/indian.html. [4] Listen to any kind of Indian music, Classical or Light at http://www.musicindiaonline.com or http://www.indianmelody.com. [5] Freely downloadable vocal and instrumental Classical Indian music clips are available at http://homepage.mac.com/patrickmoutal/macmoutal/rag.html. [6] R. F. Voss and J. Clarke, 1/f Noise in Music and Speech, Nature, 258 (1975) 317 318. [7] R. F. Voss and J. Clarke, 1/f Noise in Music: Music from 1/f Noise, Journal of Acoustical Society of America, 63 (1978) 258 263. [8] M. Bigerelle and A. Iost, Fractal Dimension and Classification of Music, Chaos, Solitons, and Fractals, 11 (2000) 2179 2192. [9] J. Hemenway, Fractal Dimensions in the Music of Mozart and Bach, The Nonlinear Journal, 2 (2000) 86 90. [10] K. J. Hsu and A. J. Hsu, Fractal Geometry of Music, Proceedings of The National Academy of Sciences USA, 87 (1990) 938 941.

Classification of Different Indian Songs Based on Fractal Analysis 259 [11] R. SenGupta, N. Dey, and D. Nag, Comparative Study of Fractal Behaviour in Quasi-random and Quasi-periodic Speech Wave Map, Fractal, 9(4) (2001) 403 414. [12] G. C. W. Sabin and D. Summers, Chaos in a Periodically Forced Predatorprey Ecosystem Model, Mathematical Bioscience, 113 (1993) 91 113. [13] A. Das, Pritha Das, and A. B. Roy, Nonlinear Data Analysis: A Comparison Between Experimental [EEG] Data and Theoretical [ANN] Data, Complexity, 7(3) (2002) 30 40.