The Comparison of Selected Audio Features and Classification Techniques in the Task of the Musical Instrument Recognition

Size: px
Start display at page:

Download "The Comparison of Selected Audio Features and Classification Techniques in the Task of the Musical Instrument Recognition"

Transcription

1 POSTER 206, PRAGUE MAY 24 The Comarison of Selected Audio Features and Classification Techniques in the Task of the Musical Instrument Recognition Miroslav MALÍK, Richard ORJEŠEK Det. of Telecommunications and Multimedia, University of Žilina, Univerzitná, Žilina, Slovakia Abstract. We resent a comarative evaluation of classification of 3 tyes of Euroean orchestral musical instruments by classification methods k-nearest Neighbors, Gaussian mixture model, Artificial neural network and its imrovement, the Droout ANN. The main objective was to investigate recognition caabilities of these methods with an alication of several audio features, namely MFCC, LPC, LSP and derived features, which has been tested indeendently and in double combinations of the best resulting features. Using the mentioned features, the best ercentage of 92% has been achieved. Keywords Recognition, classification, musical instruments, knn, gmm, ann, audio features.. Introduction Perhas, the humankind was accomanied with music throughout its history. This miscellaneous sequence of tones with various rhythm, temo and style can abstractly exress author's feelings. Music also involves characteristic cultural and locale elements. Musical instruments are various in the similar way, they have a big art in musical creation, where a significant fraction cannot exist without them. Nowadays, there is a huge amount of digital musical records available and still increases, so the automatic musical instrument recognition is one of tools which can hel with data search. More ossibilities of alication are in the automatic annotation of audio content, structural coding, or some software for musicians. In the sixties of the last century, scientists began to analyze audio roerties of musical instruments. First attemts in the musical instrument recognition showed u in the nineties, mainly due to the wider availability and develoment of informational technologies. These early systems was caable to recognize only small number of musical instruments, also with limited tonal range. Later, K. Martin and Y. Kim created a system working with isolated tones of 4 instruments with their full tonal range. The k-nearest neighbor algorithm roved itself as best classification technique, accomanied with Fisher's discriminant analysis for data reduction and a hierarchic classification architecture, which select the arent class first and then the algorithm continues to selecting a articular instrument. The dataset was slitted with ratio of 70/30 for training/testing samles. For 5 instrument classes, the system reached the accuracy of 93% and 72% accuracy for articular instruments [2]. The other bigger research in the recognition of musical instruments and audio features belongs to A. Eronen [3]. His system worked with 30 orchestral instruments and reaches the accuracy of 94% for instrument class. However, in the commercial area the musical instrument recognition remains behind its affiliated sector, the seech recognition. This situation can by caused by the fact, that the seech contains much valuable information, often more simly describable than music. 2. Audio features It is necessary to describe an audio signal by the certain grou of arameters which sufficiently recise reresents roerties of an audio signal for a ossible analysis of an audio record from the view of its content. This arameters are called the audio features and in general, we could divide this features into 3 basic grous: temoral sectral statistical From inut audio signal samles, we could directly obtain temoral features, such as coefficients of FIR, IIR, LPC, or zero crossing. To obtain the sectral information, we must transform the audio signal by one of many transformations, often the Fourier transform, discrete cosine transform or wavelet transform. By this manner we can acquire sectral features, such as FFT coefficients, cestral coefficients or the sectral centroid. The mean

2 2 M. MALÍK, R. ORJEŠEK, MUSICAL INSTRUMENT RECOGNITION value of the signal energy, the skewness coefficient and the sectral sloe can be mentioned as tyical reresentatives of statistical arameters which describes the signal roerties in the term of statistics. Nowadays, there is a big amount of audio features, the ercentage of classification deends on the discrimination caability of selected audio features. For our exeriment have been selected secies of features mentioned below. The arameterization of audio recordings has been erformed using the freely available tool oensmile [5] and it has been alied on frames of the length of 30 ms with the half overlaing, using the Hamming window function. 2. MFCC The biggest emloyment in the field of audio rocessing surely belongs to Mel frequency cestral coefficients. The main area of usage of the MFCC is seech rocessing (seech and seaker recognition, authentication and verification of seaker, etc.), but MFCC is widely used in the semantic analysis of audio content, such as recognition of common sound events, musical genre recognition and lenty other alications. The algorithm of the MFCC consists from following stes. The inut signal is at first filtered using re-emhasis filter which corrects ossible weakened higher frequencies in the signal caused by signal ath. Then is the signal transformed into the sectral domain, tyically by the FFT, and then the transformed signal enters the filter bank with the Mel frequency division. Using the logarithm oeration, the signal of each Mel frequency bands is non-linearly transformed to decrease dynamic range of values, so it reduces the sensitivity of frequency estimations. Finally, the MFCC are obtained by the inverse transformation, the discrete cosine transform can be used as well as the inverse Fourier transform. To the MFCC are also added the dynamical features of signal describing temoral changes of the sectrum, which are very imortant in the human ercetion of sound - the coefficients and the coefficients. 2.2 LPC An another method originally develoed for seech rocessing alications that accomlishes very good results also in the musical instruments recognition is the method of linear rediction coefficients. The LPC are obtained using linear rediction, which is based on the simle assumtion, that the n-th samle of the audio signal can be relaced by the linear combination of Q revious samles: s(n) = Q i = a i s( n i). () For a calculation of rediction coefficients is most frequently used the autocorrelation method and Levinson- Durbin's algorithm. With sufficiency high order of the redictor, it can be ossible to very accurately describe the sectral enveloe of the audio signal. Over the years, there have been successively roosed many modifications of the linear rediction which take into account the human ercetion of sound. For this urose was designed new features such as ercetual linear rediction coefficients PLP, the linear rediction cestral coefficients LPCC or the ercetual linear rediction cestral coefficients PLPCC. The PLP are derived from the LPC by an alication of the Mel scaled filter bank, similar to MFCC. By the cestral transformation as in the case of the MFCC, the LPCC are obtained from the LPC. Using combination of Mel scaled filter bank together with cestral analysis alied on the LPC we obtain the PLPCC. Another derived grou of features tyically used in the seech coding are LSP coefficients (Line Sectral Pairs), which are gained by decomosing of the redictor to a symmetric and an asymmetric art signifying zeros and oles of the LP filter. 2.3 Formants A formant is a concentration of acoustic energy around a articular frequency in the audio wave. In the connection with the seech signal, formants reresent an indication of the resonant frequencies of the vocal tract model, the similar analogy can be alied for musical instruments too. With formants, the sectral enveloe can be described like using the LPC. 2.4 Sectral coefficients As mentioned above, by alication of suitable sectral transformation the original temoral signal can be transferred into the sectral domain. In our tests, the FFT coefficients has been used, filtered in octave bands sulemented by following coefficients derived from FFT: Sectral centroid - indicates a region with the biggest density of frequency reresentation in the audio signal, often called "brightness". Sectral flux - reorts temoral changes in the signal sectra. Sectral kurtosis - describes sharness of the signal sectra. Sectral roll-off - determines a threshold bellow which the biggest art of the signal energy resides. Sectral skewness - statistical measure of the asymmetry of the robability distribution of the audio signal sectrum. Sectral sloe - characterize the loss of signal's energy at higher frequencies.

3 POSTER 206, PRAGUE MAY 24 3 Sectral sread - reresents instantaneous bandwidth around signal's sectral centroid. 3. Classification There are several classification techniques used for classification of audio content. They are based on comaring the similarities between unknown inut audio files and known sounds. In the ast, the sound rocessing used the intuitive comarison of functional vector atterns. Current studies of acoustical roerties favor statistical models because they rovides more flexible robabilistic results. The most common methods of classification are based on the Gaussian Mixture Model (GMM), Hidden Markov Model (HMM), k-nearest Neighbors (knn), Artificial Neural Networks (ANNs), Vector Quantization (VQ) and Suort Vector Machines (SVM). 3. Artificial Neural Networks The main concet of artificial neural networks has been insired by biological neural networks. ANN is created with mathematical neurons, rimitive units, where each unit rocessed weighted inut signals and generates the outut. The neural network is a toological arrangement of individual neurons into the structure communicated with oriented weighted interconnections. Each artificial neural network is defined by the tye of neurons, toological arrangements and the strategy of the adatation in the training rocedure. Basic ideas of the concet and arrangements of the neural network are shown on the most used feed-forward neural network in the Fig. : A schematic model of a single neuron is shown in Figure 2. We can slit the neuron into several arts as synases rated with weights w, which feed the inut x into the neuron, the body where an inner otential of neuron z is obtained, the block with a transfer function f, and finally, the outut of the neuron y. Into the body of the neuron also enters a value b called the bias. Fig. 2. Schematic model of a single neuron. The inner otential and the outut signal of the simle neuron can be comuted by following equation: z = n i= w x + b i i (2) y = f (z) (3) where n is the number of inuts. Multilayer neural networks have at least one hidden layer excet for the inut and the outut layer. The number of neurons in hidden layers can be different and it is selected according to the character of a solving task. Inut and outut layers are defined in the same way as for the single layer neural network, but the outut is involved by hidden layers: n k k y = f ( w x + b ) (4) j i= 0 ji i j where n reresents not only the dimension of the inut vector, but in general, for the k-th layer it is defined as the number of neurons in the revious (k-)-th layer. Fig.. Multilayer feed-forward neural network. The Fig. shows that individual neurons are arranged to several layers. Neurons of the same layer are not connected between each other, but they are usually fully connected to all neurons of neighbor layers. Connections between neurons reresent aths for the signal's roagation. They are oriented and rated with weights which modify the intensity of the assing signal. A zero weight reresents that the connection does not exist. The first layer of the neural network is called the inut layer and the last layer is called the outut layer, other layers of the ANN are called hidden layers. 3.2 Neural network training The term of neural network's training rocess reresents adatation of network's arameters and synatic weights to minimize the error function. This error function constitutes differences between the outut of the ANN and the exected outut. A basic algorithm used in neural network learning is the back roagation (BP). The term "back" means that comared to the inut signal, which moves through the ANN from the inut layer to the outut layer, the error moves through the ANN in the oosite direction, and can be comuted as: e = y y (5) d where ē reresents the error vector. Squared error for the ANN is obtained by:

4 4 M. MALÍK, R. ORJEŠEK, MUSICAL INSTRUMENT RECOGNITION = N ( j e ) j= 2 ε. (6) This squared error comutation is based on a single inut. It would be otimal if we were able to exress the mean square error for each of ossible inuts, but we do not have these inuts. We have only the training set of training airs. In this case we can exress the mean square error as: ε = E{ ε } (7) for each ε. By minimization of this function, for examle, with the gradient descent method we would have obtained the best ossible classification in a meaning of the mean squared error for the training set. The mean squared error is a function of weights w and via adatation of this weights can be realized a minimization rocess of the mean squared error. For weights modifications we can write: ( t + ) = w( t) µ ε w, (8) where µ reresents a learning rate. The gradient ε can be written as: ε = w ( t) ε. (9) The calculation of the gradient is not ractical due to the large number of elements in the training set and a large number of network weights, unfortunately, very difficult also for less extensive networks. Therefore, it is relaced by calculation of the sequence of artial gradients when each artial gradient obtained in one ste learning network aroximates the value of the gradient over the whole training set: ε w ε ( t) ( t) w. (0) One of comlications in the BP algorithm is that it can be traed in a local minimum. There have been develoed some imrovements of the BP that hels avoiding of this issue. The modification of the BP algorithm is the imlementation of the arameter η into the adatation rocess, where the arameter η is initially chosen close to and then it decreases the value during the adatation rocess to 0. In the case of increasing the error rate, it is recommended modify the arameter η: w ( t + ) = w( t) + ( ) w( t) + η w( t ) η. () For η = 0, the algorithm is the same as the basic BP algorithm. To revent overfitting, it can be used a method called droout described in [9]. 3.3 Gaussian mixture model The density in comonent models is exressed as a linear combination of density functions. A model with M comonents can be written in the form: M ( x) = P( j) ( x j) j=, (2) where arameters P(j) are called mixture coefficients which reresent robabilities of j-th comonent. Part (x j) reresents arameters of density functions which tyically fluctuate around j. The condition is that the function must not be negative and should integrate to over its entire definition field. A limitation of comonent s coefficients M j= P ( j) = ( j) 0, (3) P, (4) ensures that the model will reresent the robability function. The comonent model is generative and is useful as a rocess of generating samles from reresented density. The first of j-comonents is randomly selected with robability P(j). It just deends on choosing the form of density comonents. To maximize the similarity of the GMM data is used the Exectation Maximization algorithm (EM). EM is aroriate for rerocessing of roblem with the equivalence of minimizing of negative record of similarity in data set using the relation: E = Θ = N n= n ( x ) log, (5) which is regarded as an error function [6]. Θ reresents the set of arameters of GMM which is needed to find. After the a roer training of the model, the GMM method becomes very efficient and fast tool for classification, which is comutationally inexensive. A disadvantage may be the absence of a higher order signal information. 3.4 k-nearest Neighbors The main rincile of the knn algorithm is very simle and it is based on a comarison of data distances in selected feature s sace. A more similar data are closer together than a less similar data. An inut data of algorithm are divided into a training and a test data. The training data are data sets which are divided into grous, each grou characterize one class. The KNN seeks certain distance surroundings for each element of the test data the neighborhood, containing k training elements, resectively the reference data, and on the basis of certain criteria most often the majority rule algorithm assigns the tested element to one of classes. Desite the simlicity of the algorithm, this method gives good results and is used as well as a verification method. Additional advantages of the knn include ease of imlementation and flexibility. As the main disadvantage is

5 POSTER 206, PRAGUE MAY 24 5 considered the fact that the training data must be stored in memory and that the knn do not create a comlex model from the training data, thus saves some time, but very comarison is time deendant on the size of the database of training elements [6]. formants and sectral coefficients didn't reach good results, also the combination of MFCC with LPC didn't, so we mentioned only the others remaining audio features and combination of LPC with LSP in the Table 2. Informational results of absent features for knn and GMM can be found in []. 4. Dataset The dataset of audio recordings of musical instruments used for our exeriments is based rimarily on the database of Electronic music studios at the University of Iowa. It contains recordings of articular string and woodwind musical instruments that lay individual notes of the chromatic scale across the full range of the instrument including various laying techniques tyical for some instruments, for examle arco, izzicato, or vibrato. These audio recordings also involves various dynamics of layed tone,, mf and ff, and thus sound samles reresent the entire dynamic structure of selected musical instruments. Audio samles were recorded mostly by the condenser microhone Neumann KM 84 with cardioid characteristics, in the anechoic chamber in Wendell Johnson Seech and Hearing Centre. The samle frequency is 44, khz and bit deth is 6 bit. Overall, 3 western orchestral musical instrument classes has been used for training and testing, with its durations, the number of clis and the number of classes listed in Tab.. Tab. 2. The classification results. Determining of success was realized by the F- measure. The F-measure (also F-score or F-score) is frequently used statistical recision, for examle, in a data retrieval or machine learning. It is defined by the equation: 2. P. R F = P + R, (6) where P (Precision) is the number of correct ositive results divided by the number of all ositive results, R (Recall) reresents the number of correct ositive results divided by the number of ositive results that should have been returned and F is then a weighted average of P and R. 6. Conclusion Tab.. Comosition of the musical instruments dataset. 5. Results The entire database was divided into two arts similar as in [2], 70 to 30 %, where the bigger art was used as the training data and the smaller art for testing. We tested all audio features mentioned above indeendently and in combinations of the MFCC with the LPC and the LPC with the LSP, by four classification algorithms - the knn, the GMM, simle multilayer ANN and the Droout ANN. The solo features PLP, PLPCC, The aim of the exeriment was ut to the test the ability of recognition of selected orchestral musical instruments using four classification methods - knn, GMM, ANN and the Droout ANN. These methods achieve the best final ercentage for the combination of audio features LPC with LSP. These features overcome in ractice the most commonly used MFCC coefficients, and therefore they aear to be suitable for use in the rocessing of not only seech but also musical instruments. The ANN and its imrovement, the Droout ANN roved themselves as the best methods for musical instruments recognition. Also in time-deendency of training rocedure, the ANNs score the best results. In the future, the recognition model could by extended by hierarchical distribution of musical instruments and also other imroved ANN algorithms could be emloyed. References [] MALÍK, M., CHMULÍK, M., TICHÁ, D. Musical Instrument Recognition Using Selected Audio Features. In Proc. of the th Conf. Transcom, Žilina (Slovakia), 205,

6 6 M. MALÍK, R. ORJEŠEK, MUSICAL INSTRUMENT RECOGNITION [2] MARTIN, K. D., KIM, Y. E. Musical Instrument Identification: A Pattern-recognition Aroach. In Proc. of the 36th meeting of the Acoustical Society of America. USA, 998. [3] ERONEN, A., KLAPURI, A. Musical Instrument Recognition Using Cestral Coefficients and Temoral Features. In Proc. of Int. Conf. on Acoustics, Seech, and Signal Processing. Istambul (Turkey), 2000, [4] ERONEN, A. Comarison of Features for Musical Instrument Recognition. In Proc. of Int. Worksho on Alications of Signal Processing to Audio and Acoustics. NY (USA), 200, [5] The Munich Versatile and Fast Oen-Source Audio Feature Extractor. Online: htt:// [6] THEODORIDIS, S., KOUTROUMBAS, K. Pattern Recognition. 2 nd ed. Elsevier Academic Press, [7] BENADE, A. H. Fundamentals of Musical Acoustics. 2 nd ed. Dover, 990. [8] KOSTEK, B. Percetion Based Data Processing in Acoustics: Alications to Music Information Retrieval and Psychohysiology of Hearing. Sringer, [9] SRIVASTAVA, N., HINTON, G., KRIZHEVSKY, A. Droout: A Simle Way to Prevent Neural Networks from Overfitting. In Journal of Machine Learning Research, 204, vol. 5, About Authors... Miroslav MALÍK was born in Žilina, Slovakia, in 990. He finished MSc. degree at the University of Žilina, Faculty of Electrical Engineering, Deartment of Telecommunications and Multimedia in 204. Currently he studies doctoral degree at the above mentioned deartment. His research area involves acoustics, audio features, machine learning techniques for musical instrument recognition and emotion in music detection. Richard ORJEŠEK was born in Banská Bystrica, Slovakia, in 99. He finished MSc. degree at the University of Žilina, Faculty of Electrical Engineering, Deartment of Telecommunications and Multimedia in 205. Currently he studies doctoral degree at the above mentioned deartment and 2 nd MSc. at the Faculty of Management Science and Informatics. His research area includes machine learning techniques for the audio and image rocessing uroses.

Research on the optimization of voice quality of network English teaching system

Research on the optimization of voice quality of network English teaching system Available online www.ocr.com Journal of Chemical and Pharmaceutical Research, 2014, 6(6):654-660 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 Research on the otimization of voice quality of network

More information

The Use of the Attack Transient Envelope in Instrument Recognition

The Use of the Attack Transient Envelope in Instrument Recognition PAGE 489 The Use of the Attack Transient Enveloe in Instrument Recognition Benedict Tan & Dee Sen School of Electrical Engineering & Telecommunications University of New South Wales Sydney Australia Abstract

More information

Automatic Chord Recognition with Higher-Order Harmonic Language Modelling

Automatic Chord Recognition with Higher-Order Harmonic Language Modelling First ublished in the Proceedings of the 26th Euroean Signal Processing Conference (EUSIPCO-2018) in 2018, ublished by EURASIP. Automatic Chord Recognition with Higher-Order Harmonic Language Modelling

More information

Quantitative Evaluation of Violin Solo Performance

Quantitative Evaluation of Violin Solo Performance Quantitative Evaluation of Violin Solo Performance Yiju Lin, Wei-Chen Chang and Alvin WY Su SCREAM Lab, Deartment of Comuter Science and Engineering, ational Cheng-Kung University, Tainan, Taiwan, ROC

More information

Music Plus One and Machine Learning

Music Plus One and Machine Learning Christoher Rahael School of Informatics and Comuting, Indiana University, Bloomington crahael@indiana.edu Abstract A system for musical accomaniment is resented in which a comuter-driven orchestra follows

More information

The Informatics Philharmonic By Christopher Raphael

The Informatics Philharmonic By Christopher Raphael The Informatics Philharmonic By Christoher Rahael doi:10.1145/1897852.1897875 Abstract A system for musical accomaniment is resented in which a comuter-driven orchestra follows and learns from a soloist

More information

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 27 CALCULATION OF INTERAURAL CROSS-CORRELATION COEFFICIENT (IACC) OF ANY MUSIC SIGNAL CONVOLVED WITH IMPULSE RESPONSES BY USING THE IACC

More information

Convention Paper Presented at the 132nd Convention 2012 April Budapest, Hungary

Convention Paper Presented at the 132nd Convention 2012 April Budapest, Hungary Audio Engineering Society Convention Paer Presented at the nd Convention 0 Aril 6 9 Budaest, Hungary This aer was eer-reviewed as a comlete manuscrit for resentation at this Convention. Additional aers

More information

Analysis of Technique Evolution and Aesthetic Value Realization Path in Piano Performance Based on Musical Hearing

Analysis of Technique Evolution and Aesthetic Value Realization Path in Piano Performance Based on Musical Hearing Abstract Analysis of Technique Evolution and Aesthetic Value Realization Path in Piano Performance Based on Musical Hearing Lina Li Suzhou University Academy of Music, Suzhou 234000, China Piano erformance

More information

A Chance Constraint Approach to Multi Response Optimization Based on a Network Data Envelopment Analysis

A Chance Constraint Approach to Multi Response Optimization Based on a Network Data Envelopment Analysis Journal of Otimization in Industrial Engineering 1 (013) 49-59 A Chance Constraint Aroach to Multi Resonse Otimization Based on a Network ata Enveloment Analysis Mahdi Bashiri a* Hamid Reza Rezaei b a

More information

On Some Topological Properties of Pessimistic Multigranular Rough Sets

On Some Topological Properties of Pessimistic Multigranular Rough Sets I.J. Intelligent Systems Alications, 2012,, 10-17 ublished Online July 2012 in MES (htt://www.mecs-ress.org/) DOI: 10.515/ijisa.2012.0.02 On Some Toological roerties of essimistic Multigranular Rough Sets

More information

Classification of Musical Instruments sounds by Using MFCC and Timbral Audio Descriptors

Classification of Musical Instruments sounds by Using MFCC and Timbral Audio Descriptors Classification of Musical Instruments sounds by Using MFCC and Timbral Audio Descriptors Priyanka S. Jadhav M.E. (Computer Engineering) G. H. Raisoni College of Engg. & Mgmt. Wagholi, Pune, India E-mail:

More information

International Journal of Advance Engineering and Research Development MUSICAL INSTRUMENT IDENTIFICATION AND STATUS FINDING WITH MFCC

International Journal of Advance Engineering and Research Development MUSICAL INSTRUMENT IDENTIFICATION AND STATUS FINDING WITH MFCC Scientific Journal of Impact Factor (SJIF): 5.71 International Journal of Advance Engineering and Research Development Volume 5, Issue 04, April -2018 e-issn (O): 2348-4470 p-issn (P): 2348-6406 MUSICAL

More information

Automatic Rhythmic Notation from Single Voice Audio Sources

Automatic Rhythmic Notation from Single Voice Audio Sources Automatic Rhythmic Notation from Single Voice Audio Sources Jack O Reilly, Shashwat Udit Introduction In this project we used machine learning technique to make estimations of rhythmic notation of a sung

More information

DATA COMPRESSION USING NEURAL NETWORKS IN BIO-MEDICAL SIGNAL PROCESSING

DATA COMPRESSION USING NEURAL NETWORKS IN BIO-MEDICAL SIGNAL PROCESSING DATA COMPRESSION USING NEURAL NETWORKS IN BIO-MEDICAL SIGNAL PROCESSING Mandavi 1, Prasannjit 2, Nilotal Mrinal 3, Kalyan Chatterjee 4 and S. Dasguta 5 Deartment of Information Technology, Bengal College

More information

Predicting when to Laugh with Structured Classification

Predicting when to Laugh with Structured Classification ITERSEECH 04 redicting when to Laugh with Structured Classification Bilal iot, Olivier ietquin, Matthieu Geist SUELEC IMS-MaLIS research grou and UMI 958 (GeorgiaTech - CRS) University Lille, LIFL (UMR

More information

MUSI-6201 Computational Music Analysis

MUSI-6201 Computational Music Analysis MUSI-6201 Computational Music Analysis Part 9.1: Genre Classification alexander lerch November 4, 2015 temporal analysis overview text book Chapter 8: Musical Genre, Similarity, and Mood (pp. 151 155)

More information

Dynamics and Relativity: Practical Implications of Dynamic Markings in the Score

Dynamics and Relativity: Practical Implications of Dynamic Markings in the Score Dynamics and Relativity: Practical Imlications o Dynamic Markings in the Score Katerina Kosta 1, Oscar F. Bandtlow 2, Elaine Chew 1 1. Centre or Digital Music, School o Electronic Engineering and Comuter

More information

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION ULAŞ BAĞCI AND ENGIN ERZIN arxiv:0907.3220v1 [cs.sd] 18 Jul 2009 ABSTRACT. Music genre classification is an essential tool for

More information

Speech and Speaker Recognition for the Command of an Industrial Robot

Speech and Speaker Recognition for the Command of an Industrial Robot Speech and Speaker Recognition for the Command of an Industrial Robot CLAUDIA MOISA*, HELGA SILAGHI*, ANDREI SILAGHI** *Dept. of Electric Drives and Automation University of Oradea University Street, nr.

More information

Musical Instrument Identification Using Principal Component Analysis and Multi-Layered Perceptrons

Musical Instrument Identification Using Principal Component Analysis and Multi-Layered Perceptrons Musical Instrument Identification Using Principal Component Analysis and Multi-Layered Perceptrons Róisín Loughran roisin.loughran@ul.ie Jacqueline Walker jacqueline.walker@ul.ie Michael O Neill University

More information

IMPROVED SUBSTITUTION FOR ERRONEOUS LTP-PARAMETERS IN A SPEECH DECODER. Jari Makinen, Janne Vainio, Hannu Mikkola, Jani Rotola-Pukkila

IMPROVED SUBSTITUTION FOR ERRONEOUS LTP-PARAMETERS IN A SPEECH DECODER. Jari Makinen, Janne Vainio, Hannu Mikkola, Jani Rotola-Pukkila IMPROVED SUBSTITUTION FOR ERRONEOUS LTP-PARAMETERS IN A SPEECH DECODER Jari Makinen, Janne Vainio, Hannu Mikkola, Jani Rotola-Pukkila Seech and Audio Systems Laboratory, Nokia Research Center Tamere, Finland,

More information

Appendix A. Strength of metric position. Line toward next core melody tone. Scale degree in the melody. Sonority, in intervals above the bass

Appendix A. Strength of metric position. Line toward next core melody tone. Scale degree in the melody. Sonority, in intervals above the bass Aendi A Schema Protot y es the convenience of reresenting music rotot y es in standard music notation has no doubt made the ractice common. Yet standard music notation oversecifies a rototye s constituent

More information

A Fractal Video Communicator. J. Streit, L. Hanzo. Department of Electronics and Computer Sc., University of Southampton, UK, S09 5NH

A Fractal Video Communicator. J. Streit, L. Hanzo. Department of Electronics and Computer Sc., University of Southampton, UK, S09 5NH A Fractal Video Communicator J. Streit, L. Hanzo Deartment of Electronics and Comuter Sc., University of Southamton, UK, S09 5NH Abstract The image quality and comression ratio trade-os of ve dierent 176

More information

Chord Classification of an Audio Signal using Artificial Neural Network

Chord Classification of an Audio Signal using Artificial Neural Network Chord Classification of an Audio Signal using Artificial Neural Network Ronesh Shrestha Student, Department of Electrical and Electronic Engineering, Kathmandu University, Dhulikhel, Nepal ---------------------------------------------------------------------***---------------------------------------------------------------------

More information

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes hello Jay Biernat Third author University of Rochester University of Rochester Affiliation3 words jbiernat@ur.rochester.edu author3@ismir.edu

More information

MUSICAL INSTRUMENT RECOGNITION WITH WAVELET ENVELOPES

MUSICAL INSTRUMENT RECOGNITION WITH WAVELET ENVELOPES MUSICAL INSTRUMENT RECOGNITION WITH WAVELET ENVELOPES PACS: 43.60.Lq Hacihabiboglu, Huseyin 1,2 ; Canagarajah C. Nishan 2 1 Sonic Arts Research Centre (SARC) School of Computer Science Queen s University

More information

YSP-900. Digital Sound Projector OWNER S MANUAL

YSP-900. Digital Sound Projector OWNER S MANUAL AB -900 Digital Sound Projector OWNER S MANUAL CAUTION: READ THIS BEFORE OPERATING THIS UNIT. CAUTION: READ THIS BEFORE OPERATING THIS UNIT. 1 To assure the finest erformance, lease read this manual carefully.

More information

Research Article. ISSN (Print) *Corresponding author Shireen Fathima

Research Article. ISSN (Print) *Corresponding author Shireen Fathima Scholars Journal of Engineering and Technology (SJET) Sch. J. Eng. Tech., 2014; 2(4C):613-620 Scholars Academic and Scientific Publisher (An International Publisher for Academic and Scientific Resources)

More information

Hidden Markov Model based dance recognition

Hidden Markov Model based dance recognition Hidden Markov Model based dance recognition Dragutin Hrenek, Nenad Mikša, Robert Perica, Pavle Prentašić and Boris Trubić University of Zagreb, Faculty of Electrical Engineering and Computing Unska 3,

More information

A guide to the new. Singing Syllabus. What s changing in New set songs and sight-singing

A guide to the new. Singing Syllabus. What s changing in New set songs and sight-singing A guide to the new Singing Syllabus What s changing in 2009 New set songs and sight-singing Singing Syllabus from 2009 New set songs and sight-singing The Associated Board s Singing Syllabus for 2009 onwards

More information

Classification of Gamelan Tones Based on Fractal Analysis

Classification of Gamelan Tones Based on Fractal Analysis IOP Conference Series: Materials Science and Engineering PAPER OPEN ACCESS Classification of Gamelan Tones Based on Fractal Analysis To cite this article: A Wintarti et al 2018 IOP Conf. Ser.: Mater. Sci.

More information

Singer Identification

Singer Identification Singer Identification Bertrand SCHERRER McGill University March 15, 2007 Bertrand SCHERRER (McGill University) Singer Identification March 15, 2007 1 / 27 Outline 1 Introduction Applications Challenges

More information

Art and Technology- A Timeline. Dr. Gabriela Avram

Art and Technology- A Timeline. Dr. Gabriela Avram Art and Technology- A Timeline Dr. Gabriela Avram This week We are talking about the relationshi between: Society and technology Art and technology How social, olitical and cultural values affect scientific

More information

YSP-500. Digital Sound Projector TM OWNER S MANUAL

YSP-500. Digital Sound Projector TM OWNER S MANUAL AB -500 Digital Sound Projector TM OWNER S MANUAL CAUTION: READ THIS BEFORE OPERATING THIS UNIT. Caution: Read this before oerating this unit. 1 To assure the finest erformance, lease read this manual

More information

Theseus and the Minotaur

Theseus and the Minotaur Butler University Digital Commons @ Butler University Graduate Thesis Collection Graduate Scholarshi 016 Theseus and the Minotaur Ben Lutterbach Butler University, blutterb@butler.edu Follow this and additional

More information

Supervised Learning in Genre Classification

Supervised Learning in Genre Classification Supervised Learning in Genre Classification Introduction & Motivation Mohit Rajani and Luke Ekkizogloy {i.mohit,luke.ekkizogloy}@gmail.com Stanford University, CS229: Machine Learning, 2009 Now that music

More information

Automatic Laughter Detection

Automatic Laughter Detection Automatic Laughter Detection Mary Knox Final Project (EECS 94) knoxm@eecs.berkeley.edu December 1, 006 1 Introduction Laughter is a powerful cue in communication. It communicates to listeners the emotional

More information

Automatic Laughter Detection

Automatic Laughter Detection Automatic Laughter Detection Mary Knox 1803707 knoxm@eecs.berkeley.edu December 1, 006 Abstract We built a system to automatically detect laughter from acoustic features of audio. To implement the system,

More information

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 AN HMM BASED INVESTIGATION OF DIFFERENCES BETWEEN MUSICAL INSTRUMENTS OF THE SAME TYPE PACS: 43.75.-z Eichner, Matthias; Wolff, Matthias;

More information

UBTK YSP-1. Digital Sound Projector OWNER'S MANUAL

UBTK YSP-1. Digital Sound Projector OWNER'S MANUAL UBTK YSP-1 Digital Sound Projector OWNER'S MANUAL IMPORTANT SAFETY INSTRUCTIONS CAUTION RISK OF ELECTRIC SHOCK DO NOT OPEN CAUTION: TO REDUCE THE RISK OF ELECTRIC SHOCK, DO NOT REMOVE COVER (OR BACK).

More information

Comparison Parameters and Speaker Similarity Coincidence Criteria:

Comparison Parameters and Speaker Similarity Coincidence Criteria: Comparison Parameters and Speaker Similarity Coincidence Criteria: The Easy Voice system uses two interrelating parameters of comparison (first and second error types). False Rejection, FR is a probability

More information

Semi-supervised Musical Instrument Recognition

Semi-supervised Musical Instrument Recognition Semi-supervised Musical Instrument Recognition Master s Thesis Presentation Aleksandr Diment 1 1 Tampere niversity of Technology, Finland Supervisors: Adj.Prof. Tuomas Virtanen, MSc Toni Heittola 17 May

More information

Violin Timbre Space Features

Violin Timbre Space Features Violin Timbre Space Features J. A. Charles φ, D. Fitzgerald*, E. Coyle φ φ School of Control Systems and Electrical Engineering, Dublin Institute of Technology, IRELAND E-mail: φ jane.charles@dit.ie Eugene.Coyle@dit.ie

More information

UAB YSP-900. Digital Sound Projector OWNER S MANUAL

UAB YSP-900. Digital Sound Projector OWNER S MANUAL UAB -900 Digital Sound Projector OWNER S MANUAL IMPORTANT SAFETY INSTRUCTIONS IMPORTANT SAFETY INSTRUCTIONS CAUTION RISK OF ELECTRIC SHOCK DO NOT OPEN CAUTION: TO REDUCE THE RISK OF ELECTRIC SHOCK, DO

More information

Transcribing string music for saxophone: a presentation of Claude Debussy's Cello Sonata for baritone saxophone

Transcribing string music for saxophone: a presentation of Claude Debussy's Cello Sonata for baritone saxophone University of Iowa Iowa Research Online Theses and Dissertations Sring 2013 Transcribing string music for saxohone: a resentation of Claude Debussy's Cello Sonata for baritone saxohone Nathan Bancroft

More information

UAB YSP Digital Sound Projector OWNER S MANUAL

UAB YSP Digital Sound Projector OWNER S MANUAL UAB -1100 Digital Sound Projector OWNER S MANUAL IMPORTANT SAFETY INSTRUCTIONS IMPORTANT SAFETY INSTRUCTIONS CAUTION RISK OF ELECTRIC SHOCK DO NOT OPEN CAUTION: TO REDUCE THE RISK OF ELECTRIC SHOCK, DO

More information

Subjective Similarity of Music: Data Collection for Individuality Analysis

Subjective Similarity of Music: Data Collection for Individuality Analysis Subjective Similarity of Music: Data Collection for Individuality Analysis Shota Kawabuchi and Chiyomi Miyajima and Norihide Kitaoka and Kazuya Takeda Nagoya University, Nagoya, Japan E-mail: shota.kawabuchi@g.sp.m.is.nagoya-u.ac.jp

More information

COGNITION AND VOLITION

COGNITION AND VOLITION COGNITION AND VOLITION A Contribution to a Cybernetic Theory of Subjectivity Gotthard Günther *) Preamble It seems to be beyond controversy that the novel science of Cybernetics involves the roblem of

More information

Automatic Piano Music Transcription

Automatic Piano Music Transcription Automatic Piano Music Transcription Jianyu Fan Qiuhan Wang Xin Li Jianyu.Fan.Gr@dartmouth.edu Qiuhan.Wang.Gr@dartmouth.edu Xi.Li.Gr@dartmouth.edu 1. Introduction Writing down the score while listening

More information

A NOVEL CEPSTRAL REPRESENTATION FOR TIMBRE MODELING OF SOUND SOURCES IN POLYPHONIC MIXTURES

A NOVEL CEPSTRAL REPRESENTATION FOR TIMBRE MODELING OF SOUND SOURCES IN POLYPHONIC MIXTURES A NOVEL CEPSTRAL REPRESENTATION FOR TIMBRE MODELING OF SOUND SOURCES IN POLYPHONIC MIXTURES Zhiyao Duan 1, Bryan Pardo 2, Laurent Daudet 3 1 Department of Electrical and Computer Engineering, University

More information

Deep Neural Networks Scanning for patterns (aka convolutional networks) Bhiksha Raj

Deep Neural Networks Scanning for patterns (aka convolutional networks) Bhiksha Raj Deep Neural Networks Scanning for patterns (aka convolutional networks) Bhiksha Raj 1 Story so far MLPs are universal function approximators Boolean functions, classifiers, and regressions MLPs can be

More information

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM A QUER B EAMPLE MUSIC RETRIEVAL ALGORITHM H. HARB AND L. CHEN Maths-Info department, Ecole Centrale de Lyon. 36, av. Guy de Collongue, 69134, Ecully, France, EUROPE E-mail: {hadi.harb, liming.chen}@ec-lyon.fr

More information

Exploring Principles-of-Art Features For Image Emotion Recognition

Exploring Principles-of-Art Features For Image Emotion Recognition Exloring Princiles-of-Art Features For Image Emotion Recognition Sicheng Zhao, Yue Gao, iaolei Jiang, Hongxun Yao, Tat-Seng Chua, iaoshuai Sun School of Comuter Science and Technology, Harbin Institute

More information

CS229 Project Report Polyphonic Piano Transcription

CS229 Project Report Polyphonic Piano Transcription CS229 Project Report Polyphonic Piano Transcription Mohammad Sadegh Ebrahimi Stanford University Jean-Baptiste Boin Stanford University sadegh@stanford.edu jbboin@stanford.edu 1. Introduction In this project

More information

A Parametric Autoregressive Model for the Extraction of Electric Network Frequency Fluctuations in Audio Forensic Authentication

A Parametric Autoregressive Model for the Extraction of Electric Network Frequency Fluctuations in Audio Forensic Authentication Proceedings of the 3 rd International Conference on Control, Dynamic Systems, and Robotics (CDSR 16) Ottawa, Canada May 9 10, 2016 Paper No. 110 DOI: 10.11159/cdsr16.110 A Parametric Autoregressive Model

More information

Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods

Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods Kazuyoshi Yoshii, Masataka Goto and Hiroshi G. Okuno Department of Intelligence Science and Technology National

More information

Voice Controlled Car System

Voice Controlled Car System Voice Controlled Car System 6.111 Project Proposal Ekin Karasan & Driss Hafdi November 3, 2016 1. Overview Voice controlled car systems have been very important in providing the ability to drivers to adjust

More information

CAS LX 502 Semantics. Meaning as truth conditions. Recall the trick we can do. How do we arrive at truth conditions?

CAS LX 502 Semantics. Meaning as truth conditions. Recall the trick we can do. How do we arrive at truth conditions? CAS LX 502 Semantics 2a. Reference, Comositionality, Logic 2.1-2.3 Meaning as truth conditions! We know the meaning of if we know the conditions under which is true.! conditions under which is true = which

More information

EPSON PowerLite 5550C/7550C. User s Guide

EPSON PowerLite 5550C/7550C. User s Guide EPSON PowerLite 5550C/7550C User s Guide Coyright Notice All rights reserved. No art of this ublication may be reroduced, stored in a retrieval system, or transmitted in any form or by any means, electronic,

More information

MUSICAL NOTE AND INSTRUMENT CLASSIFICATION WITH LIKELIHOOD-FREQUENCY-TIME ANALYSIS AND SUPPORT VECTOR MACHINES

MUSICAL NOTE AND INSTRUMENT CLASSIFICATION WITH LIKELIHOOD-FREQUENCY-TIME ANALYSIS AND SUPPORT VECTOR MACHINES MUSICAL NOTE AND INSTRUMENT CLASSIFICATION WITH LIKELIHOOD-FREQUENCY-TIME ANALYSIS AND SUPPORT VECTOR MACHINES Mehmet Erdal Özbek 1, Claude Delpha 2, and Pierre Duhamel 2 1 Dept. of Electrical and Electronics

More information

MUSICAL INSTRUMENT IDENTIFICATION BASED ON HARMONIC TEMPORAL TIMBRE FEATURES

MUSICAL INSTRUMENT IDENTIFICATION BASED ON HARMONIC TEMPORAL TIMBRE FEATURES MUSICAL INSTRUMENT IDENTIFICATION BASED ON HARMONIC TEMPORAL TIMBRE FEATURES Jun Wu, Yu Kitano, Stanislaw Andrzej Raczynski, Shigeki Miyabe, Takuya Nishimoto, Nobutaka Ono and Shigeki Sagayama The Graduate

More information

THE importance of music content analysis for musical

THE importance of music content analysis for musical IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 15, NO. 1, JANUARY 2007 333 Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With

More information

Music Information Retrieval with Temporal Features and Timbre

Music Information Retrieval with Temporal Features and Timbre Music Information Retrieval with Temporal Features and Timbre Angelina A. Tzacheva and Keith J. Bell University of South Carolina Upstate, Department of Informatics 800 University Way, Spartanburg, SC

More information

Advanced Scalable Hybrid Video Coding

Advanced Scalable Hybrid Video Coding Politechnika Poznańska Wydział Elektryczny Instytut Elektroniki i Telekomunikacji Zakład Telekomunikacji Multimedialnej i Radioelektroniki ul. Piotrowo 3A, 6-965 Poznań Łukasz Błaszak Advanced Scalable

More information

Robert Alexandru Dobre, Cristian Negrescu

Robert Alexandru Dobre, Cristian Negrescu ECAI 2016 - International Conference 8th Edition Electronics, Computers and Artificial Intelligence 30 June -02 July, 2016, Ploiesti, ROMÂNIA Automatic Music Transcription Software Based on Constant Q

More information

LIFESTYLE VS 1. Video Expander

LIFESTYLE VS 1. Video Expander LIFESTYLE VS 1 Video Exander Imortant Safety Information 1. Read these instructions for all comonents before using this roduct. 2. Kee these instructions for future reference. 3. Heed all warnings on the

More information

Singer Traits Identification using Deep Neural Network

Singer Traits Identification using Deep Neural Network Singer Traits Identification using Deep Neural Network Zhengshan Shi Center for Computer Research in Music and Acoustics Stanford University kittyshi@stanford.edu Abstract The author investigates automatic

More information

Piano Why a Trinity Piano exam? Initial Grade 8. Exams and repertoire books designed to develop creative and confident piano players

Piano Why a Trinity Piano exam? Initial Grade 8. Exams and repertoire books designed to develop creative and confident piano players Piano 0 07 Initial Grade 8 Exams and reertoire books designed to develo creative and confident iano layers The 0 07 Piano syllabus from Trinity College London offers the choice and flexibility to allow

More information

When the computer enables freedom from the machine. (On an outline of the work Hérédo-Ribotes)

When the computer enables freedom from the machine. (On an outline of the work Hérédo-Ribotes) When the comuter enables freedom from the machine (On an outline of the work Hérédo-ibotes) abstract abien Lévy 1 comoser In some cases when the musical rocess is sufficiently verbalised and formalized

More information

Sequitur XIII for extended piano and live-electronics (two players)

Sequitur XIII for extended piano and live-electronics (two players) Karlheinz Essl Sequitur XIII for extended iano and live-electronics (two layers) 2009 Dedicated to Tzenka Dianova 2009 by Karlheinz Essl www.essl.at Karlheinz Essl Sequitur XIII for extended iano & live-electronics

More information

Normalized Cumulative Spectral Distribution in Music

Normalized Cumulative Spectral Distribution in Music Normalized Cumulative Spectral Distribution in Music Young-Hwan Song, Hyung-Jun Kwon, and Myung-Jin Bae Abstract As the remedy used music becomes active and meditation effect through the music is verified,

More information

Figure 1: Feature Vector Sequence Generator block diagram.

Figure 1: Feature Vector Sequence Generator block diagram. 1 Introduction Figure 1: Feature Vector Sequence Generator block diagram. We propose designing a simple isolated word speech recognition system in Verilog. Our design is naturally divided into two modules.

More information

ISSN ICIRET-2014

ISSN ICIRET-2014 Robust Multilingual Voice Biometrics using Optimum Frames Kala A 1, Anu Infancia J 2, Pradeepa Natarajan 3 1,2 PG Scholar, SNS College of Technology, Coimbatore-641035, India 3 Assistant Professor, SNS

More information

A Parametric Autoregressive Model for the Extraction of Electric Network Frequency Fluctuations in Audio Forensic Authentication

A Parametric Autoregressive Model for the Extraction of Electric Network Frequency Fluctuations in Audio Forensic Authentication Journal of Energy and Power Engineering 10 (2016) 504-512 doi: 10.17265/1934-8975/2016.08.007 D DAVID PUBLISHING A Parametric Autoregressive Model for the Extraction of Electric Network Frequency Fluctuations

More information

Music Source Separation

Music Source Separation Music Source Separation Hao-Wei Tseng Electrical and Engineering System University of Michigan Ann Arbor, Michigan Email: blakesen@umich.edu Abstract In popular music, a cover version or cover song, or

More information

Musically Useful Scale Practice

Musically Useful Scale Practice Musically Useul Scale Practice by Jason Siord jiano@gmail.com The study o scales is one o the oundations o iano ractice. They rovide the ianist with a means to ractice coordination, imrove inger dexterity,

More information

Available online at ScienceDirect. Procedia Computer Science 46 (2015 )

Available online at  ScienceDirect. Procedia Computer Science 46 (2015 ) Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 46 (2015 ) 381 387 International Conference on Information and Communication Technologies (ICICT 2014) Music Information

More information

MUSICAL INSTRUMENTCLASSIFICATION USING MIRTOOLBOX

MUSICAL INSTRUMENTCLASSIFICATION USING MIRTOOLBOX MUSICAL INSTRUMENTCLASSIFICATION USING MIRTOOLBOX MS. ASHWINI. R. PATIL M.E. (Digital System),JSPM s JSCOE Pune, India, ashu.rpatil3690@gmail.com PROF.V.M. SARDAR Assistant professor, JSPM s, JSCOE, Pune,

More information

2013 SCHOOLS NOTES. MOZART CLARINET CONCERTO Victoria. Image: Mats Bäcker

2013 SCHOOLS NOTES. MOZART CLARINET CONCERTO Victoria. Image: Mats Bäcker 2013 SCHOOLS NOTES MOZART CLARINET CONCERTO Victoria Image: Mats Bäcker MOZART CLARINET CONCERTO Possible Toics/Units o Study: The Classical Period and It s Inluences; Music or Small Ensembles/ Large Ensembles;

More information

TORCHMATE GROWTH SERIES MINI CATALOG

TORCHMATE GROWTH SERIES MINI CATALOG TORCHMATE GROWTH SERIES MINI CATALOG PLASMA EDUCATIONAL PACKAGE 4X4 table to CNC system with cable carriers Water table for fume control and material suort ACCUMOVE 2 next generation height control (machine

More information

Audio-Based Video Editing with Two-Channel Microphone

Audio-Based Video Editing with Two-Channel Microphone Audio-Based Video Editing with Two-Channel Microphone Tetsuya Takiguchi Organization of Advanced Science and Technology Kobe University, Japan takigu@kobe-u.ac.jp Yasuo Ariki Organization of Advanced Science

More information

Acoustic Scene Classification

Acoustic Scene Classification Acoustic Scene Classification Marc-Christoph Gerasch Seminar Topics in Computer Music - Acoustic Scene Classification 6/24/2015 1 Outline Acoustic Scene Classification - definition History and state of

More information

Investigation of Digital Signal Processing of High-speed DACs Signals for Settling Time Testing

Investigation of Digital Signal Processing of High-speed DACs Signals for Settling Time Testing Universal Journal of Electrical and Electronic Engineering 4(2): 67-72, 2016 DOI: 10.13189/ujeee.2016.040204 http://www.hrpub.org Investigation of Digital Signal Processing of High-speed DACs Signals for

More information

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS Andrew N. Robertson, Mark D. Plumbley Centre for Digital Music

More information

J. HARRY WHALLEY. Mixed Quartet, NGS and EEG 2012

J. HARRY WHALLEY. Mixed Quartet, NGS and EEG 2012 J HARRY WHALLEY CLASP TOGETHER (BETA) Mixed Quartet, NGS and EEG 202 Architecture: The diagrams on these ages demonstrate the architecture of the technology for Clas Together (beta) NGS - Neurogranular

More information

Detecting Musical Key with Supervised Learning

Detecting Musical Key with Supervised Learning Detecting Musical Key with Supervised Learning Robert Mahieu Department of Electrical Engineering Stanford University rmahieu@stanford.edu Abstract This paper proposes and tests performance of two different

More information

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng Introduction In this project we were interested in extracting the melody from generic audio files. Due to the

More information

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC G.TZANETAKIS, N.HU, AND R.B. DANNENBERG Computer Science Department, Carnegie Mellon University 5000 Forbes Avenue, Pittsburgh, PA 15213, USA E-mail: gtzan@cs.cmu.edu

More information

Recognising Cello Performers using Timbre Models

Recognising Cello Performers using Timbre Models Recognising Cello Performers using Timbre Models Chudy, Magdalena; Dixon, Simon For additional information about this publication click this link. http://qmro.qmul.ac.uk/jspui/handle/123456789/5013 Information

More information

Weiss High School Band

Weiss High School Band Creating the Ideal 2017-2018 Woodwind & Brass Audition Information www.weissbands.org 2017-2018 Woodwind & Brass Auditions It is an honor and a rivilege to be a member of the Program. The Weiss Band is

More information

Music Radar: A Web-based Query by Humming System

Music Radar: A Web-based Query by Humming System Music Radar: A Web-based Query by Humming System Lianjie Cao, Peng Hao, Chunmeng Zhou Computer Science Department, Purdue University, 305 N. University Street West Lafayette, IN 47907-2107 {cao62, pengh,

More information

MILLER, TYLER MAXWELL, M.M. Winds of Change (2013) Directed by Dr. Alejandro Rutty. 55pp.

MILLER, TYLER MAXWELL, M.M. Winds of Change (2013) Directed by Dr. Alejandro Rutty. 55pp. MILLER, TYLER MAXWELL, M.M. Winds of Change (201) Directed by Dr. Alejandro Rutty. 55. Winds of Change attemts to build uon the recent tradition of monohonic comositional ractice by combining techniques

More information

Henry Walford Davies. SONATA n r 2. in A major for Violin & Piano. edited by. Rupert Marshall-Luck EMP SP002

Henry Walford Davies. SONATA n r 2. in A major for Violin & Piano. edited by. Rupert Marshall-Luck EMP SP002 Henry Walford Davies SONATA n r 2 in A major for Violin & Piano edited by Ruert Marshall-Luck EMP SP002 I Poco agitato (quasi Allegro) Poco agitato (quasi Allegro) 8 Allegro semlice Allegro semlice 14

More information

Application Of Missing Feature Theory To The Recognition Of Musical Instruments In Polyphonic Audio

Application Of Missing Feature Theory To The Recognition Of Musical Instruments In Polyphonic Audio Application Of Missing Feature Theory To The Recognition Of Musical Instruments In Polyphonic Audio Jana Eggink and Guy J. Brown Department of Computer Science, University of Sheffield Regent Court, 11

More information

Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting

Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting Dalwon Jang 1, Seungjae Lee 2, Jun Seok Lee 2, Minho Jin 1, Jin S. Seo 2, Sunil Lee 1 and Chang D. Yoo 1 1 Korea Advanced

More information

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval DAY 1 Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval Jay LeBoeuf Imagine Research jay{at}imagine-research.com Rebecca

More information

Music Segmentation Using Markov Chain Methods

Music Segmentation Using Markov Chain Methods Music Segmentation Using Markov Chain Methods Paul Finkelstein March 8, 2011 Abstract This paper will present just how far the use of Markov Chains has spread in the 21 st century. We will explain some

More information

Improving Frame Based Automatic Laughter Detection

Improving Frame Based Automatic Laughter Detection Improving Frame Based Automatic Laughter Detection Mary Knox EE225D Class Project knoxm@eecs.berkeley.edu December 13, 2007 Abstract Laughter recognition is an underexplored area of research. My goal for

More information

A Thesis. Submitted to the Faculty. Master of Arts DIGITAL MUSICS. Beau Sievers DARTMOUTH COLLEGE. Hanover, New Hampshire.

A Thesis. Submitted to the Faculty. Master of Arts DIGITAL MUSICS. Beau Sievers DARTMOUTH COLLEGE. Hanover, New Hampshire. Follow te Bouncing Ball: Music, Motion, and Emotion A Tesis Submitted to te Faculty in artial fulfillment of te requirements for te degree of Master of Arts in DIGITAL MUSICS by Beau Sievers DARTMOUTH

More information