Rhythmic Structure based segmenta3on for Hindustani music

Similar documents
Tempo and Beat Analysis

Musicological perspective. Martin Clayton

STRUCTURAL SEGMENTATION AND VISUALIZATION OF SITAR AND SAROD CONCERT AUDIO

Hindustani Music: Appreciating its grandeur. Dr. Lakshmi Sreeram

IMPROVED MELODIC SEQUENCE MATCHING FOR QUERY BASED SEARCHING IN INDIAN CLASSICAL MUSIC

Rhythm related MIR tasks

IndianRaga Certification

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES

Automatic Labelling of tabla signals

AUTOMATICALLY IDENTIFYING VOCAL EXPRESSIONS FOR MUSIC TRANSCRIPTION

A BEAT TRACKING APPROACH TO COMPLETE DESCRIPTION OF RHYTHM IN INDIAN CLASSICAL MUSIC

Automatic Notes Generation for Musical Instrument Tabla

TOWARDS IMPROVING ONSET DETECTION ACCURACY IN NON- PERCUSSIVE SOUNDS USING MULTIMODAL FUSION

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

Methods for the automatic structural analysis of music. Jordan B. L. Smith CIRMMT Workshop on Structural Analysis of Music 26 March 2010

A MID-LEVEL REPRESENTATION FOR CAPTURING DOMINANT TEMPO AND PULSE INFORMATION IN MUSIC RECORDINGS

CS 591 S1 Computational Audio

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Automatic Piano Music Transcription

Tempo and Beat Tracking

HUMAN PERCEPTION AND COMPUTER EXTRACTION OF MUSICAL BEAT STRENGTH

HINDUSTANI MUSIC VOCAL (Code 034) Examination Structure for Assessment Class IX

Raga Identification by using Swara Intonation

Unit 5. Lesson 2 By Sudha Sahgal

MITOCW watch?v=u9givsc2m8y

MUSI-6201 Computational Music Analysis

Musicians Adjustment of Performance to Room Acoustics, Part III: Understanding the Variations in Musical Expressions

Automatic music transcription

6.5 Percussion scalograms and musical rhythm

The purpose of this essay is to impart a basic vocabulary that you and your fellow

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas

MidiFind: Fast and Effec/ve Similarity Searching in Large MIDI Databases

Robert Alexandru Dobre, Cristian Negrescu

Music Segmentation Using Markov Chain Methods

An Examination of Foote s Self-Similarity Method

Automatic Rhythmic Notation from Single Voice Audio Sources

DISTINGUISHING MUSICAL INSTRUMENT PLAYING STYLES WITH ACOUSTIC SIGNAL ANALYSES

PULSE-DEPENDENT ANALYSES OF PERCUSSIVE MUSIC

MODAL ANALYSIS AND TRANSCRIPTION OF STROKES OF THE MRIDANGAM USING NON-NEGATIVE MATRIX FACTORIZATION

DISTINGUISHING RAGA-SPECIFIC INTONATION OF PHRASES WITH AUDIO ANALYSIS

The best selling band in history they have sold over 600 million records! All in only 10 years!

2 2. Melody description The MPEG-7 standard distinguishes three types of attributes related to melody: the fundamental frequency LLD associated to a t

EFFICIENT MELODIC QUERY BASED AUDIO SEARCH FOR HINDUSTANI VOCAL COMPOSITIONS

Music Structure Analysis

However, in studies of expressive timing, the aim is to investigate production rather than perception of timing, that is, independently of the listene

STRUCTURAL CHANGE ON MULTIPLE TIME SCALES AS A CORRELATE OF MUSICAL COMPLEXITY

Onset Detection and Music Transcription for the Irish Tin Whistle

University of Mauritius. Mahatma Gandhi Institute

Music Radar: A Web-based Query by Humming System

The authors objective has been to provide every material support to the teacher, to leave you free to teach your subject.

Music Representations. Beethoven, Bach, and Billions of Bytes. Music. Research Goals. Piano Roll Representation. Player Piano (1900)

Statistical Modeling and Retrieval of Polyphonic Music

Listening: choose the best answer and circle the letter.

AUTOREGRESSIVE MFCC MODELS FOR GENRE CLASSIFICATION IMPROVED BY HARMONIC-PERCUSSION SEPARATION

10 Visualization of Tonal Content in the Symbolic and Audio Domains

The song remains the same: identifying versions of the same piece using tonal descriptors

Estimating the Time to Reach a Target Frequency in Singing

Karnataka state women s university. Bijapur. Hindustani Music Syllabus. UG semester. I to VI. Hindustani Music syllabus.

EE391 Special Report (Spring 2005) Automatic Chord Recognition Using A Summary Autocorrelation Function

ON FINDING MELODIC LINES IN AUDIO RECORDINGS. Matija Marolt

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST)

UNIVERSITY OF MAURITIUS MAHATMA GANDHI INSTITUTE

CLASSIFICATION OF INDIAN CLASSICAL VOCAL STYLES FROM MELODIC CONTOURS

Appendix A Types of Recorded Chords

The Intervalgram: An Audio Feature for Large-scale Melody Recognition

Music. Curriculum Glance Cards

Transcription of the Singing Melody in Polyphonic Music

SINGING PITCH EXTRACTION BY VOICE VIBRATO/TREMOLO ESTIMATION AND INSTRUMENT PARTIAL DELETION

Research Article Multiple Scale Music Segmentation Using Rhythm, Timbre, and Harmony

MUSIC is a ubiquitous and vital part of the lives of billions

Rhythm analysis. Martin Clayton, Barış Bozkurt

Week 14 Query-by-Humming and Music Fingerprinting. Roger B. Dannenberg Professor of Computer Science, Art and Music Carnegie Mellon University

Automatic characterization of ornamentation from bassoon recordings for expressive synthesis

Citation for published version (APA): Jensen, K. K. (2005). A Causal Rhythm Grouping. Lecture Notes in Computer Science, 3310,

Music Mood Classification - an SVM based approach. Sebastian Napiorkowski

Timing In Expressive Performance

Choir Scope and Sequence Grade 6-12

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

Categorization of ICMR Using Feature Extraction Strategy And MIR With Ensemble Learning

An honourable title conferred on an eminent musician. dlineation, elaboration, unfolding of the Raga.

Proc. of NCC 2010, Chennai, India A Melody Detection User Interface for Polyphonic Music

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng

Measurement of overtone frequencies of a toy piano and perception of its pitch

A prototype system for rule-based expressive modifications of audio recordings

TOWARDS AUTOMATED EXTRACTION OF TEMPO PARAMETERS FROM EXPRESSIVE MUSIC RECORDINGS

Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods

International Journal of Computer Architecture and Mobility (ISSN ) Volume 1-Issue 7, May 2013

Audio Structure Analysis

La Salle University. I. Listening Answer the following questions about the various works we have listened to in the course so far.

Cambridge International Examinations Cambridge International General Certificate of Secondary Education. Published

Curriculum Standard One: The student will listen to and analyze music critically, using the vocabulary and language of music.

Outline. Why do we classify? Audio Classification

Music Information Retrieval

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

Query By Humming: Finding Songs in a Polyphonic Database

TANSEN: A QUERY-BY-HUMMING BASED MUSIC RETRIEVAL SYSTEM. M. Anand Raju, Bharat Sundaram* and Preeti Rao

Subjective Similarity of Music: Data Collection for Individuality Analysis

Audio Structure Analysis

B.A. General Music V & VI Semesters w.e.f

Transcription:

Rhythmic Structure based segmenta3on for Hindustani music T.P.Vinutha Prof. Preeti Rao Department of Electrical Engineering Indian Institute of Technology Bombay

OUTLINE Introduc0on Hindustani Music concert Bases of segmenta0on, our approach Problem addressed, mo0va0on Rhythm Percussion rhythmic structure as cue Implementa0on Database Audio segmenta0on system Conclusion & Future scope References

INTRODUCTION Hindustani music concert is highly structured polyphonic music, with percussive and melodic accompaniments Khyal style of Hindustani music is a predominantly improvised music tradi0on opera0ng within a well- defined raga (melodic) and tala (rhythmic) framework Raag Deshkar, PiyaJaag

Bases for segmenta0on Musical Structure Repe00ons, contrast and homogeneity of musical aspects such as Melody Steady in Alap and rendered around sa, few gamaks and spread throughout the octave in sthayi, traversing up to upper octave in Antara, more dynamic in Tan and Sargam Timbre Instrument change or change in singer Vowels are used in Alap, Aakar vistar and Tans, but consonants in Bol- alap, Sargam sec0ons Rhythm Rhythm refers to all aspects of musical 0me parerns the inherent tempo of the melodic piece, the way syllables of the lyrics are sung or the way strokes in instruments are played Rhythm is used as cue for segmentabon As it is more explicitly changing in most of the secbons

Problem addressed, Mo0va0on Problem addressed Automa0cally locate the points of significant change in the rhythmic structure of the Hindustani music concert by analyzing the local self- similarity of a rhythm representa0on computed with a sliding window over the audio recording Mo0va0on Efficient naviga0on of audio recordings to different sec0ons

Different temporal scales of segmenta0on Khyal vocal concert Major segments are, Alap, Bada khyal and Chota khyal Bada khayal or chota khyal can be further segmented into Tabla solo Sthayi, Antara, Vistar, Bol- alap, Sargam, Tan, Bol- baat sec0ons Major segments are, Peshkar, Kaidas, Gats, Relas and Rouns Each of these can have slow and fast sec0ons within it

RHYTHM OR LAYA Rhythmic parern represents the periodici0es of events in the audio and their rela0ve strengths These events could be percussion strokes, syllable onsets in lyrics, note onsets of melodic instruments Percussion rhythmic structure is taken as cue for segmenta3on Hindustani concerts use tabla as an accompaniment an explicit representa0on of the rhythm is available in terms of the parern and 0ming of tabla strokes. Eg., no tabla in alap, tempo is either slow or medium in bada khyal sec0on and increases dras0cally in chota khyal sec0on

Tal in Hindustani music Tal Is the cyclically recurring metrical parern of fixed length imposed by the accompanying percussion instrument Theka is the defined structure of a tal represen0ng par0cular parern of strokes (iden0fied by bols or names) The Theka for Tintal is, x 2 0 3 dha dhin dhin dha dha dhin dhin dha dha 3n 3n ta ta dhin dhin dha..................... Three metrical levels of tal Matra equivalent to beat Vibhag Sec3on or measure consis3ng of 2or5 matras Avart Tal cycle, theka is repeated in each avart Matra Vibhag Avart In a performance, the tabla player will deviate from the theka

Prakars of Kaherva tal Prakar is an altera0on of the basic form of tala Baisc theka of Kaherva tal Prakar1 Prakar2 2 pause bols Prakar3 Matra 2 with 2bols Prakar4 Matra 2 & 6 with 2bols X 0 1 2 3 4 5 6 7 8 dhā ge na ti na ka dhin na X 0 1 2 3 4 5 6 7 8 dhā ti ti Tā tā dhin dhin dhā X 0 1 2 3 4 5 6 7 8 gha ti Tā ga dhin nā X 0 1 2 3 4 5 6 7 8 dhā ti ra ki ti tā ge dhin nā X 0 1 2 3 4 5 6 7 8 dhā ti ra ki ti tā ti ra ghi dhin

Laya Rela0onship of Laya with tempo Tempo Represents the rate of succession of one of the pulse rates, selected according to the human percep0on. Expressed in beats per minute (bpm) Tempo in Hindustani music Tempo (bpm) A0 Vilambit Very slow 8 Vilambit Slow 4 MadhyaVilambit Medium slow Madhya Medium 2 MadhyaDrut Medium fast Drut fast 1 Vibhag dura3on (in sec) 3 1.5 A0Drut Very fast 0.75 GoRlieb, Robert S. Solo tabla drumming of North India: its repertoire, styles, and performance pracbces, Mo0lal Banarsidass Publishers, 1993. Indian music has tradi0onally three main tempos or laya» vilambit (slow)» madhya (medium)» drut (fast) matra rate defines the tempo in madhya laya Surface Rhythm In a performance, pulse rate deviates from madhya laya the listener perceives the surface rhythm but within the context of tal framework

Rhythm structure change Tala change In some concerts, bada khyal sec0on will be in a par0cular tal, while chota khyal sec0on will be in different tal Eg., In raag Marwa performance by RK, tal changes from Ektal to Tintal Tempo change In a performance, bada khyal sec0on will be presented in either madhaya laya or vilambit laya and chota khyal in drut laya. ImprovisaBon Increasing the rhythmic density Replacing some bols in theka by other bols Introducing pause bols

Audio signal characteris0cs Wave form, Spectrogram features Cycle length Tabla strokes Onsets are wide band, transient events represen0ng the loca0on of tabla stroke. Rhythmic structure from the audio can be derived by the panern of derived onsets

IMPLEMENTATION Database Prakaras of Kaherva tal (used for bhajans) To do signal dependent parameter selec0on and to have controlled variability in rhythm Tabla solo performances Having lead instrument as tabla having highest rhythmic improvisa0on Khyal Vocal Concerts in Tintal and Ektal Complexity due to Polyphonic audio ensuring that the performances are rendered in different laya Ground truth segment boundaries in these performances have been marked in PRAAT with careful hearing

Audio Segmenta0on system

General scheme suggested by Bello [7] Audio Feature Extrac0on Detec0on Func0on Post processing Onsets Or Novelty func0on Onsets detec0on Acous0c features (suggested by Bello[7]) Temporal Features Full band energy Sub band energy Spectral features (preferred by Dixon[8]) Sub band spectral amplitude Spectral amplitude Detec0on Func0on Deriva0ve Smoothed Differen3ator (suggested by Hermes[7]) Post Processing Normaliza3on Thresholding

Detec0on func0on As a deriva0ve of adjacent frames SF(n) represents the rec0fied spectral flux that is summed for all the bins. SF( n) = K H k = 1 [( X ( n, k) X ( n 1, k) ) W( k) ] As a bi- phasic func0on involves mul0- frame smoothing & differencing is less suscep0ble to rapid local fluctua0ons

Bi- phasic func0on as smoothed differen0ator Hermes[15] suggested the parameters of bi- phasic func3on as τ 1 =0.015s; τ 2 =0.02s; d 1 =0.025s; d 2 = 0.05s to simulate short term adapta0on characteris0c of human ear That emphasizes recent inputs while masking rapid modula0ons (τ 2 > τ 1 ) Parameters based on acous3cs of tabla strokes τ 1 =0.02s; τ 2 =0.0333s; d 1 =0.0289s; d 2 = 0.0067s Stroke dura0ons are ranging from 400ms for tonal strokes to 30ms for impulsive strokes. P( t) = τ 1 exp 2π τ 2 1 ( t + d2) exp 2 2π τ 2 2 ( t d1) 2 1 2τ 1 2 2

Post - Processing Normaliza0on by subtrac0ng the mean so that the average will be zero dividing by the max absolute devia0on, so that the func0on will be in the interval [- 1,1]. Fixed threshold method peaks where the detec0on func0on exceed the threshold δ(n), a posi0ve constant are considered as onsets to separate the event- related and non event- related onsets AdapBve threshold method based on the local mean is implemented as, ~ δ [n] = δ + mean( d(n - M),, d(n + M) )

Audio Segmenta0on system

Compu3ng rhythmic representa3on Autocorrela0on method avoids explicit extrac0on of onsets by comparing the novelty curve with the 0me shined copies of it to analyse the periodicity in the rhythmic parern Here, autocorrela0on of the spectral flux feature is computed. Rhythmic feature of block n can be expressed as, r n ( k) = N 1 k [ m= 0 sf ( n + m) w ' ( m)][ sf ( n + m + k) w ( k + m)] where, n is the block index, N is the block size and k is the lag. Only the informa0on between zero and 3s lag is retained as rhythmic feature. '

Acous0c descrip0on of Segments with devia0on from basic structure of tal Few cycles of 3 prakaras are concatenated and the resul0ng audio is analyzed. Here, prakaras are of nearly same tempo but of different rhythmic pakern. The cycle length of each prakara is 4.027s, 3.98s and 3.99s respec0vely. Prakar1 & Prakar2 boundary

Novelty func3on and ACF Segments with devia3on from basic structure of tal at Pr1 and Pr2 boundary & Pr2 and Pr3 boundary 4s of rhythmic window, hop 2s 3me in sec 3me- lag in sec

16s of rhythmic window, hop 0.5s Rhythmogram Prakar1, Prakar2, Prakar3 concatenated A two dimensional 3me- pulse representa3on with lag- 3me on y- axis, 3me posi3on on the x- axis and the autocorrela3on values visualized as intensity, that displays the Progression of rhythm with 3me Pr1 Pr2 Pr3 Bright peaks are at the interval of 0.5s, represen3ng the Inter Stroke Interval (ISI) Pr1, Pr2, Pr3 are of same tempo, hence the similarity in the interval between bright peaks Due to pause bols, acf peaks of Pr2 are of reduced strength Increased surface rhythm in prakar3 is due to the half beats at matra2 of the cycle. This has contributed to feeble peaks at the mul3ples of 0.25s in the rhythmogram

Audio Segmenta0on system

Loca0ng boundaries Self similarity matrix Each rhythmic frame can be compared with all other frames in a pair wise fashion to compute the Self Distance matrix(sdm) D(i,j)=d(x i,x j ) for i,j є {1, 2..N}, where the distance func0on d specifies the correla0on distance between two feature vectors x i and x j represented as, d cor = 1 ( x x )( x x ) j ( x x )( x x ) ' ( x x )( ) ' j x x j i i i i i i j j t j

Self Distance matrix at the boundary of pr1 - pr2 and pr2 - pr3 Pr1 boundary1 Pr2 Boundary2 Pr3 Pr1 Frame 1 2 3 4 5 6 7 8 9 10 1 0.0000 0.0984 0.2622 0.1840 0.1461 0.1824 0.1469 0.2125 0.0373 0.0304 boundary1 2 0.0984 0.0000 0.1613 0.1414 0.1554 0.1407 0.1565 0.1022 0.0782 0.1005 3 0.2622 0.1613 0.0000 0.1056 0.0834 0.1052 0.0834 0.1945 0.2857 0.2808 4 0.1840 0.1414 0.1056 0.0000 0.0435 0.0000 0.0437 0.1191 0.1602 0.1549 Pr2 5 0.1461 0.1554 0.0834 0.0435 0.0000 0.0431 0.0000 0.1565 0.1707 0.1520 6 0.1824 0.1407 0.1052 0.0000 0.0431 0.0000 0.0433 0.1190 0.1593 0.1542 boundary2 7 0.1469 0.1565 0.0834 0.0437 0.0000 0.0433 0.0000 0.1574 0.1719 0.1532 8 0.2125 0.1022 0.1945 0.1191 0.1565 0.1190 0.1574 0.0000 0.1432 0.1746 Pr3 9 0.0373 0.0782 0.2857 0.1602 0.1707 0.1593 0.1719 0.1432 0.0000 0.0137 10 0.0304 0.1005 0.2808 0.1549 0.1520 0.1542 0.1532 0.1746 0.0137 0.0000 Boundary1 is appearing in fr2. Fr3 is having 2 pause bols and hence the distance between fr1 (which is in Pr1) and fr3 (Which is in Pr2) is more. Boundary2 is appearing in fr8.

Similarity Matrix 16s of rhythmic window, hop 0.5s Audio of 36 cycles of Pr1, Pr2 and Pr3 Cycle length in each Prakar is about 4s Choice of window length : larger than the expected periodicity (suggested as 4 0mes the periodicity in [18]) to capture tactus (matra) periodicity to capture measure (vibhag) periodicity to capture periodicity pa@erns in a tala cycle Similarity matrix

Loca0ng boundaries Novelty Score Structure of the Self Distance matrix, D is an indica0on of the novelty measure that will have high values at the segment boundaries Correla3ng the checker board kernel C along the diagonal of the matrix D will yield the measure of novelty, novelty score. A simple 4 4 checker board parern represented by the kernel matrix C 4 with kernel length 4 as, C 4 1 1 = 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Larger kernels are formed by kronecker product of C with the matrix of ones Extrema in the novelty score correspond to large changes in the rhythmic structure and thus indicate the boundaries

Loca0ng the boundaries Audio of 36 cycles of Pr1, Pr2 and Pr3 Novelty Score indica0ng the boundaries Got by correla0ng the Kernel of width 16 along the diagonal of the matrix The width of the checker board kernel decides the scale of the novelty measure Similarity matrix and Novelty score

Repe33ve structure of Prakars Audio of 36 cycles of Pr1, Pr2 and Pr3 repeated Pr1 Pr2 Pr3 Pr4 Pr1 Pr2 Pr3 Pr4 Pr1 Pr2 Pr3 Pr4 Pr1 Pr2 Pr3 Pr4 Prakar compared with repeated Prakar is represented by dark region

Tabla Solo performance Tabla Melodic accompaniment

Segments in a tabla solo performance Segments usually appearing in a tabla solo performance are, Peshkar: Introductory sec0on like alap Kaida: The overall structure of a kaida can be divided into three sec0ons, an opening theme, a series of varia0ons based on the opening theme and a concluding Bhai In these secbons the tempo will be maintained same even though rhythmic density will increase amer the opening of kaida. Gat: very dis0nct composi0onal structure, which is pre- composed Tempo of the performance increases from this secbon From here, it is drut laya Rela is a fast and flowing composi0onal form Raun is a fast composi0onal form characterized by droning effect.

Segment (dura3on) Peshkar (8min) Slow Fast Kaidas Kaida1 S (4.1min) F Kaida2 S (2.15min) F Kaida3 S (2.23min) F Kaida4 S (3.6min) F Acous0c descrip0ons of Tabla solo by Zakir Hussain Dura3on of one cycle 6.8min 26s to 24.6s 1.2min Strokes per cycle 2.7min 1.4min 0.83min 28.3s to 24.9s 27.3s to 25.2s 128 strokes in slower sec3on and doubled in faster sec3on 1.72min 1.37min 28.6s to 25.3s 0.86min 2.2min 1.4min 21.7s to 20.9s 96 strokes in slower sec3on and increased in mul3ples of 1.5, 3 6 and 8 3mes in faster sec3ons Remarks No change in tempo Kaida5 (7.2min) 50.6s 16s to 15s Tempo increased 6.3min slightly Gat 1(2.2min) 4s Tempo has Rela (1.08min) 4s increased much. Gat2 3.7s A3- drut sec3on Gat3 2.98s

Kaida : Theme & Varia0on Kaida Rules: The main focus during a kaida is the thema3c development that is achieved through a series of varia3ons Kaida1 theme 1 2 3 1 2 3 1 2 3 4 1 2 3 4 5 6 Dha Ti Ta Dha Ti Ta Dha Dha Ti Ta Dha Ge Ti Na Ge Na Kaida1 varia3on 1 2 3 Dha Ti Ta 1 2 3 4 Dha DhaTi Ta 1 2 3 Dha Ti Ta 1 2 3 4 5 6 Dha Ge Ti Na Ge Na One must use only the bols in the original theme of the kaida in varia3ons Kaida2 theme 1 2 Dha Ti 1 2 Dha Ge 1 2 Na Dha 1 2 Tire kita 1 2 Dha Ti 1 2 Dha Ge 1 2 Ti Na 1 2 Ge Na Theme of Kaida2 cons3tuted through bols and grouping of bols is different from that of kaida1

Spectrogram of Kaida1 slower sec0on In slower sec3ons of kaida1, stroke density is 128 strokes per cycle with IOI of 0.22s In faster sec3ons of kaida1, kaida2 & kaida3, strokes density has doubled with IOI of 0.1s. 35

Spectrogram of Kaida4 slower sec0on Dedichand kaida- indicates 3 matras in 2 beats Stroke density of 96 strokes/cycle in the slower sec3on with the IOI of 0.23s 36

Rhythmogram Tabla solo by ZH, KolkaRa (Kaidas sec0on, 13 min) Kaida1 Kaida2 Kaida3 S F S F S F Kaida4 S F 37

Ideal Similarity matrix tabla solo by ZH,KolkaRa (Kaidas sec0on, 13 min) K1 K2 K3 K4 Peshkar

Similarity Matrix & Novelty Score Tabla solo by ZH,KolkaRa (Kaidas sec0on, 13 min) Rhythmic window of 28s & hop of 0.5s is used Dissimilarity of faster sec3on of kaida with that of slower sec3on, due to difference in surface rhythm (higher rhythmic density) K1 K2 K3 K4 S F S F S F S F

Segments in a Khyal Vocal Concert Khyal vocal performance is a polyphonic audio with simultaneous presence of many instruments with voice Aalap: Unmetered melodic improvisabon (no tabla). Bada Khyal : Will be rendered either in vilambit or madhya laya. Sthayi: rendered in lower octave Vistar Aakar Vistar Bol- Vistar Antara: Rendered in upper octave Bol- baat: Importance is there for lyrics Sargam: Swaras are used, matras in lyrics will increase. So, fillers appear in percussion Tan: Basic theka will be played by tabla. Dynamic in melodic aspect Aakar- tan: Intensity will display great regularity Bol- tan: Lyrics of composi0on will be used in improvisa0on. Miscellaneous Chota Khyal : Drut laya or a0- drut laya. SomeBmes this may be in different tal. Will have subsecbons of sthayi, antara, vistar, tan and sargam

Database Raga Vocalist Bada khyal Chota Khyal Bhupali Rashid Khan (RK) Bandish Tal Karo ge tum paar Tintal Laya (Dur of 1 cycle) Madhya (10.2s) Bandish Tal Laya (Dur of 1 cycle) Tu kari Tintal Drut (3.67s) Deshkar Kishori Amonkar (KA) PiyaJaag Tintal Vilambit (23s) HotoTore Tintal Drut (6.8s) Miyan ki Todi Prabha Atre (PA) Mana Panchi Ektal Vilambit (50.3s) Ja..re Parigama Ektal Drut (3.01s) Miyan ki Todi Kaivalya Kumar (KK) Sa..i.. Ektal Vilambit (67s) Ab more Ektal Drut (3.98s) Marwa Rashid Khan (RK) Piya more Ektal Vilambit (72.4s) Kavu ki rit Tintal Drut (4.58s)

Rhythmogram of Bhoopali, RK Bada Khyal in Madhya laya,tintal Chota khyal in Drut laya,tintal Alap Bada Khyal Drut bandish Bandish & Vistar Sargam Tan Bandish Tan 42

Similarity Matrix of Bhoopali, RK Bada Khyal in Madhya laya,tintal Chota khyal in Drut laya,tintal Bada Khyal Tan Drut bandish Tan

Rhythmogram of Raag Deshkar by Kishori Amonkar Bada Khyal in Vilambit laya, Tintal & Chota khyal in Drut laya, Tintal Alaap Bada Khyal Drut bandish Bandish & Aakar Vistar Bol- Vistaar 44

Conclusions Spectral features combined with the auditory processing mo0vated bi- phasic func0on achieved good 0me localiza0on of extracted onsets ACF has proved to be tough enough to reveal the inherent periodici0es and strengths of accents of complicate polyphonic music Audio segmenta0on algorithm has returned the boundaries between major sec0ons of alap, bada khyal and chota khyal sec0ons in vocal concerts and also the boundaries of sec0ons like tan, sargam even within drut segment of the performance Boundaries of the segments in the intricate rhythmic repertoire like tabla solo has also been iden0fied

Future work In similarity matrix, Kaida1 slower sec0on is showing more dis- similarity with faster sec0on of kaida1, compared to Kaida2 slower sec0on. Tempo due to surface rhythm has become a prominent criteria in this analysis. Novelty func0on has to be refined to capture the stroke level changes ie., when a bol of basic theka is replaced by other bol. For this, 0mbral dis- similarity of strokes has to be captured. Along with the percussive onsets, non- percussive onsets have to be extracted in the novelty detecbon stage so as to capture the complete rhythmic structure Efficiency of autocorrelabon approach of rhythmic pakern representabon has to be compared with other prevalent approaches like combinabon of DFT and frequency mapped autocorrelabon method, that relies on the dominant metrical level of the segment Other aspects of music like Bmbre and melody have to be combined with rhythm to extract the boundaries of segments within bada khyal other than tan and sargam

REFERENCES [1] M. Cooper and J. Foote, Summarizing popular music via structural similarity analysis, in Proceedings of IEEE Workshop on Applica0ons of Signal Processing to Audio and Acous0cs (WASPAA 03), pp. 127 130, New Paltz, NY, USA, October 2003. [2] J. Foote, Automa0c audio segmenta0on using a measure of audio novelty, in Proceedings of IEEE Interna0onal Conference on Mul0media and Expo (ICME 00), vol. 1, pp. 452 455, New York, NY, USA, July- August 2000. [3] K. Jensen, J. Xu, and M. Zachariasen, Rhythm- based segmenta0on of popular chinese music, in Proc. Int. Conf. Music Inf. Retrieval (ISMIR), London, U.K., 2005. [4] M. Miron, Automa0c Detec0on of Hindustani Talas, Master s thesis, Universitat Pompeu Fabra, Barcelona, Spain, 2011. [5] M. Clayton, Time in Indian Music : Rhythm, Metre and Form in North Indian Rag Performance, Oxford University Press, 2000. [6] K. Jensen, Mul0ple Scale Music Segmenta0on Using Rhythm, Timbre, and Harmony, EURASIP Journal on Advances in Signal Processing, Aug 2006. [7] J. P. Bello, L. Daudet, S. Abdallah, C. Duxbury, M. Davies, and M. B. Sandler: A Tutorial on Onset Detec0on in Music Signals, IEEE transac0ons on Speech and Audio, vol. 13, no. 5, pp. 1035 1047, 2005. [8] S. Dixon : Onset Detec0on Revisited, In Proceedings of the Interna0onal Conference on Digital Audio Effects (DAFx 06), Montreal, Canada. 2006 [9] P. Grosche and M. M uller, Extrac0ng predominant local pulse informa0on from music recordings, IEEE Transac0ons on Audio, Speech, and Language Processing, 19(6): 1688 1701, 2011.

REFERENCES Contd. [10] S. Gula0 and P. Rao, Rhythm parern representa0on for tempo detec0on in music, Proc. of the First Interna0onal Conference on Intelligent Interac0ve Technologies and Mul0media, Dec, 2010, Allahabad, India. [11] V.Rao, P.Rao, Vocal melody detec0on in the presence of pitched accompaniment using harmonic matching methods, Proc. of the 11th Int. Conference on Digital Audio Effects (DAFx- 08), Espoo, Finland, September 1-4, 2008 [12] A.Bapat and P.Rao, Pitch tracking of voice in tabla background by the two- way mismatch method, Proc. of the 13th Int. Conf. on Advanced Compu0ng and Communica0ons, 2005, Coimbatore, India. [13] J. Cheri Ross, Vinutha T.P. and P.Rao, Detec0ng melodic mo0fs from audio for Hindustani classical music, Proc. of ISMIR 2012. [14] P. Pradeep Kumar., P. Rao, and S. D. Roy, "Note onset detec0on in natural humming", Conference on Computa0onal Intelligence and Mul0media Applica0ons, 2007, Interna0onal Conference on. Vol. 4. IEEE, 2007. [15] D. J. Hermes, Vowel onset detec0on, J. Acoust. Soc. of America, 87(2), pp.866-873, 1990. [16] C. Ros ao, R. Ribeiro, and D. Mar0ns de Matos Influence of Peak selec0on methods on Onset detec0on,13th Interna0onal Society for Music Informa0on Retrieval Conference (ISMIR 2012) [17]T. Pr atzlich, A Cross- Version Approach for Novelty Detec0on inmusic Recordings, Dec 2011, Mast.Thesis, Saraland Univ. [18]G.Peeters, Template based Es0ma0on of Time- varying Tempo

Thank You!!