Sound Recording Techniques. MediaCity, Salford Wednesday 26 th March, 2014

Size: px
Start display at page:

Download "Sound Recording Techniques. MediaCity, Salford Wednesday 26 th March, 2014"

Transcription

1 Sound Recording Techniques MediaCity, Salford Wednesday 26 th March, 2014

2 Perception and automated assessment of recorded audio quality, focussing on user generated content.

3 How distortion affects the perceived quality of music: Psychoacoustic experiments Iain Jackson, Bruno M. Fazenda, Trevor J. Cox, Paul Kendrick, Francis F. Li, Stephen Groves-Kirkby, & Alex Wilson Acoustics Research Centre, University of Salford

4 How does clipping affect the perception of quality in music? Are hard clipping and soft clipping perceived differently in terms of quality? How well does HASQI predict subjective quality ratings of clipped music? How robust is HASQI across different styles of music?

5 What is HASQI? Hearing Aid Speech Quality Index (Kates & Arehart, 2010) Models the effect of degradation on quality. Measures the combined effect of noise, nonlinear distortion, and linear filters. For both normal-hearing and hearing-impaired listeners. Good performance for speech signals (Kressner et al, 2013) What happens when applied to music?

6 Arehart, Kates & Anderson (2011) Wide variety of degradation/processing: Additive noise, peak clipping, amplitude quantisation, compression, compression + babble, spectral sub, high-pass filter, low-pass filter, bandpass filter, positive spectral tilt, negative spectral tilt, single resonance peak, multiple peaks, stationary noise......a total of 112 conditions. But... Only 3 samples of music. Haydn jazz vocalise Quality ratings reasonably well predicted by HASQI. Were also significantly affected by genre of music.

7 Experiment 1 The effect of hard clipping on perceptions of quality In contrast to previous work, we assess the effect of a single type of processing hard clipping against a comprehensive range of musical styles.

8 Sample Selection Aim: Select a representative sample of as wide a range of musical styles as possible. Guided by previous work (Rentfrew & Gosling, 2003) 25 prototype songs from each of 14 Genres: Classical, jazz, blues, folk, alternative, rock, heavy metal, country, pop, religious, rap/hip-hop, soul, funk, and electronica/dance. Final sample library of 140 songs. We obtained CD copies of 117 songs on the list. How to scale down to a manageable number of songs for test? Sort and cluster by timbre.

9 Sample Selection Why select by timbre, not genre?

10 Genre Intuitively useful but lacking in objectivity. Timbre Apply objective methods to compare songs. Samples clustered using modified version of technique used by Aucouturier and Pachet (2002). Gaussian Mixture Model (GMM) fitted to Mel Frequency Cepstrum Coefficients (MFCC) for 3 sections each song, which are then clustered by similarity. Total number of clusters is an emergent feature: In this case it was found to be 6.

11 The test set From each of our 6 timbre clusters we draw two samples. One cluster, number 4, however contains only one sample. Additionally, we include the three samples used by Arehart et al (2011) in their previous assessment of HASQI and music ( jazz, Haydn, vocalise). Thus the final test set consists of 14 samples.

12 Table 1. The 14 songs the final test samples were taken from, by cluster number. Song Name Artist/Composer Riverboat Set: Denis Dillon s Square John Whelan 1 Dance Polka, Dancing on the Riverboat Crazy Train Ozzy Osbourne Haydn * * Ave Maria Franz Schubert 2 Packin' Truck Leadbelly vocalise * Tierney Sutton 3 Kalifornia Fatboy Slim Brown Sugar The Rolling Stones 4 The Four Seasons: Spring Antonio Vivaldi 5 For What It's Worth Buffalo Springfield The Girl From Ipanema Stan Getz Spoonful Howlin' Wolf 6 Nobody Loves Me But My Mother B.B. King jazz * *

13 Method

14 Distortion of samples HASQI is continuous between values of 0 to 1. HASQI values used to estimate discrete levels. 10 Levels per song sample: 9 levels of distortion, spread at equal intervals over full range of (available) HASQI values. Plus original, clean sample.

15 Threshold (% of peak level) Relationship between HASQI values and threshold Crazy Train For What It's Worth Distortion level (1-HASQI)

16 Table 1. The 14 songs the final test samples were taken from, by cluster number. Song Name Artist/Composer Example Samples Clean Medium High Riverboat Set: Denis Dillon s Square John Whelan 1 Dance Polka, Dancing on the Riverboat Crazy Train Ozzy Osbourne Haydn * * Ave Maria Franz Schubert 2 Packin' Truck Leadbelly vocalise * Tierney Sutton 3 Kalifornia Fatboy Slim Brown Sugar The Rolling Stones 4 The Four Seasons: Spring Antonio Vivaldi 5 For What It's Worth Buffalo Springfield The Girl From Ipanema Stan Getz Spoonful Howlin' Wolf 6 Nobody Loves Me But My Mother B.B. King jazz * *

17 Broadly reproduced method used by Arehart et al. 30 participants. Mean age 23.7 years (SD: 4.7 years) No reported hearing impairments Sounds presented over headphones. Sennheiser 650 HD Stereo, 72dB (linear) 140 trials. 14 songs x 10 processing conditions 7 second samples (randomised presentation order) Ratings of overall quality. Slider labelled Bad and Excellent at either end (output: 0-100)

18 Results

19 Figure 1. Mean quality ratings of each cluster, as a function of distortion level. (Error bars show 95% CIs.)

20 Figure 1. Mean quality ratings of each cluster, as a function of distortion level. (Error bars show 95% CIs.)

21 Differences in quality between timbre clusters? Repeated-measures ANOVA Independent variables: Level of distortion, cluster Dependent variable: Mean quality ratings Significant main effect for distortion level (F(4.97, ) = , p = <.01, η p ² =.94). Significant main effect for cluster (F(2.33, 67.48) = 42.43, p = <.01, η p ² =.59). Significant interaction of cluster x distortion level (F(11.91, ) = 6.98, p = <.01, η p ² =.19). Each successive level of distortion is associated with a significant decrease in quality ratings, but the rate of degradation is not perceived equally across all timbres.

22 Table 2. Clusters grouped according to (between group) significantly different quality ratings. Song Name Artist/Composer Riverboat Set: Denis Dillon s Square John Whelan 1 Dance Polka, Dancing on the Riverboat Crazy Train Ozzy Osbourne Haydn * * Ave Maria Franz Schubert 2 Packin' Truck Leadbelly vocalise * Tierney Sutton Spoonful Kalifornia Howlin' Fatboy Slim Wolf 3 6 Nobody Brown Sugar Loves Me But My Mother B.B. The Rolling King Stones 4 jazz The Four * Seasons: Spring * Antonio Vivaldi 35 Kalifornia For What It's Worth Fatboy Buffalo Slim Springfield Brown The Girl Sugar From Ipanema The Stan Rolling Getz Stones For Spoonful What It's Worth Buffalo Howlin' Springfield Wolf 5 6 The Nobody Girl From Loves Ipanema Me But My Mother Stan B.B. King Getz 4 The jazz Four * Seasons: Spring Antonio * Vivaldi

23 Results HASQI performance

24 Table 3. Correlation coefficients for quality ratings and values predicted by HASQI for each timbre cluster. Cluster Quality Mean (SD).732 (.065) HASQI performance: for speech =.942 (Kates & Arehart, 2010) for music =.838, (range =.770 to.849; Arehart et al, 2011) Rnonlin performance: for music =.95 (1 music sample, 10 participants; Moore et al, 2004)

25 Conclusions How robust is the HASQI model over a comprehensive range of musical styles? The performance of HASQI was found to be (a little) less accurate than previous work suggests. Overall correlation of predicted vs actual quality ratings =.73 (compared to equivalent value of.84 in Arehart et al). Predictive accuracy of HASQI can be improved by factoring in timbral features of samples.

26 Experiment 2 The effect of Hard Vs Soft clipping on perceptions of quality

27 Hard versus soft clipping Partial replication of Experiment 1. Both hard and soft clipping processing conditions included in test set. Equivalent to distortion levels 1 to 5 from Experiment 1 (as opposed to levels 1 to 9 considered in Experiment 1). Samples (original, clean files), experimental set-up, procedure, and number of participants all as per Experiment 1.

28 Threshold (% of peak level) Hard Clipping Thresholds Distortion level (1-HASQI)

29 Threshold (% of peak level) Soft Clipping Thresholds Distortion level (1-HASQI)

30 Hard versus soft clipping Table 4. Comparison examples of hard and soft clipping at equivalent HASQI levels. Song Name Artist/Composer Hard/Soft Clip Distortion Level Clean Low Medium Ave Maria Franz Schubert Hard Soft Packin' Truck Leadbelly Hard Soft

31 Mean quality rating Mean quality rating Hard versus soft clipping Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 Cluster Distortion Level (0 is clean, 9 is most distorted) Distortion Level (0 is clean, 9 is most distorted) Figure 4. Mean quality ratings for hard (left) and soft (right) distortion conditions, shown by cluster. Error bars show 95% CIs.

32 Hard versus soft clipping Across all samples, no significant difference between ratings for hard and soft clipping. HASQI performance is unaffected by type of distortion.

33 Experiment 3 Descriptions of quality attributes in different distortion categories

34 Digital audio sample statistics Since digital audio is encoded as discrete samples of the audio waveform, much can be said about a recording by the statistical properties of these samples. The Probability Mass Function can show the presence of distortion in mastered audio. Consider three categories: 1. The clean distribution, where there is no clipping and a wide dynamic range. 2. Audio with hard-clipping will feature a PMF with high values at its extreme values, where the maximum amplitude has been reached. 3. Where softer distortions are used, there is not one single large value at extremes but more gentle bumps in the nearby regions.

35 Subjective Test (Wilson & Fazenda, submitted) 63 samples of music, containing a mix of clean, hard-clipping distortion and soft distortions. 22 participants gave quality ratings for each sample on a 5-point scale and also provided 2 descriptors. Ratings for clean samples were significantly higher than for the two distorted categories. The two distortion categories did not significantly differ between themselves (F(1, 2) = 5.72, p < 0.001, η 2 = 0.008).

36 Verbal descriptions of distortion categories As well as a rating out of 5 participants were also asked to provide two words which described the attributes on which quality was assessed. For example: I gave this sample 5 stars because it was clear and full I gave this sample 1 star because it was distorted and dull Word-clouds of the most common attributes associated with (a) clean samples, (b) hard clipped samples, (c) soft distortion samples.

37 Verbal descriptions of distortion categories Table shows the five most commonly used descriptor words and their absolute frequencies for each of the clean, hard-clipped and soft distortion categories. Chi-Square analysis shows that there is significant variation in the distribution of words used to describe each of the three categories (χ 2 (8, N = 547) = 33.28; p < 0.001). Bold frequencies in the table indicate values significantly greater (>) or less than (<) the expected counts of the null hypothesis.

38 Verbal descriptions of distortion categories Distorted is used less than expected by chance to describe samples in the clean category. The opposite is true for both other categories, the hard and soft clipped distortion samples. Samples in the soft category are more frequently described as Distorted than those in the hard category. This suggests that small amounts of hard-clipping can go unnoticed. Punchy used less often when describing the soft distortions, compared to hard-clipping. This may be due to the lesser influence of inter-sample peaks in soft distortions compared to hardclipping. Harsh was not associated with either of the distortion categories but does appear more often than expected by chance for words describing the clean samples.

39 Conclusions Overall, HASQI found to predict degradation in music quality reasonably well. Performance across hard and soft clipping is very good. Limitation of HASQI for music - not developed for stereo. Model does not account for stereo width and panning.

40 References K.H. Arehart, J.M. Kates and M.C. Anderson. Effects of noise, nonlinear processing, and linear filtering on perceived music quality. Int. J. Audiol. 50(3): (2011). J.J. Aucouturier and F. Pachet. Music similarity measures: What s the use?. Proc. ISMIR. (2002). J.M. Kates and K.H. Arehart. The Hearing-Aid Speech Quality Index (HASQI). J. Audio Eng. Soc. 58(5): (2010). A. Kressner, D. Anderson, and C. Rozell. Evaluating the generalization of the Hearing Aid Speech Quality Index (HASQI). IEEE Trans. Audio. Speech. Lang. Processing. 21(2): (2013). B.C.J. Moore, C-T, Tan, N.Zacharov and V-V. Mattila. Measuring and predicting the perceived quality of music and speech subjected to combined linear and nonlinear distortion. J. Audio Eng. Soc. 52(12): (2004). J.P. Rentfrow and S.D. Gosling. The Do Re Mi s of everyday life: The structure and personality correlates of music preferences. J. Pers. Soc. Psychol. 84(6): (2003). A. Wilson and B.M. Fazenda. Sonic character: Categorisation of distortion profiles in relation to audio quality of music recordings. Submitted to 17 th Int. Conference on Digital Audio Effects (DAFx-14).

Pitch. The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high.

Pitch. The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high. Pitch The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high. 1 The bottom line Pitch perception involves the integration of spectral (place)

More information

Perceived audio quality of sounds degraded by non linear distortions and. single ended assessment using HASQI

Perceived audio quality of sounds degraded by non linear distortions and. single ended assessment using HASQI Perceived audio quality of sounds degraded by non linear distortions and single ended assessment using HASQI Kendrick, P, Jackson, IR, Li, FF, Fazenda, BM and Cox, TJ 10.17743/jaes.2015.0068 Title Authors

More information

Perception of audio quality in productions of popular music

Perception of audio quality in productions of popular music Perception of audio quality in productions of popular music Wilson, AD and Fazenda, BM 10.17743/jaes.2015.0090 Title Authors Type URL Perception of audio quality in productions of popular music Wilson,

More information

Audio-Based Video Editing with Two-Channel Microphone

Audio-Based Video Editing with Two-Channel Microphone Audio-Based Video Editing with Two-Channel Microphone Tetsuya Takiguchi Organization of Advanced Science and Technology Kobe University, Japan takigu@kobe-u.ac.jp Yasuo Ariki Organization of Advanced Science

More information

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION ULAŞ BAĞCI AND ENGIN ERZIN arxiv:0907.3220v1 [cs.sd] 18 Jul 2009 ABSTRACT. Music genre classification is an essential tool for

More information

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes hello Jay Biernat Third author University of Rochester University of Rochester Affiliation3 words jbiernat@ur.rochester.edu author3@ismir.edu

More information

Classification of Timbre Similarity

Classification of Timbre Similarity Classification of Timbre Similarity Corey Kereliuk McGill University March 15, 2007 1 / 16 1 Definition of Timbre What Timbre is Not What Timbre is A 2-dimensional Timbre Space 2 3 Considerations Common

More information

Variation in multitrack mixes : analysis of low level audio signal features

Variation in multitrack mixes : analysis of low level audio signal features Variation in multitrack mixes : analysis of low level audio signal features Wilson, AD and Fazenda, BM 10.17743/jaes.2016.0029 Title Authors Type URL Variation in multitrack mixes : analysis of low level

More information

Automatic Rhythmic Notation from Single Voice Audio Sources

Automatic Rhythmic Notation from Single Voice Audio Sources Automatic Rhythmic Notation from Single Voice Audio Sources Jack O Reilly, Shashwat Udit Introduction In this project we used machine learning technique to make estimations of rhythmic notation of a sung

More information

Experiments on tone adjustments

Experiments on tone adjustments Experiments on tone adjustments Jesko L. VERHEY 1 ; Jan HOTS 2 1 University of Magdeburg, Germany ABSTRACT Many technical sounds contain tonal components originating from rotating parts, such as electric

More information

Analysis, Synthesis, and Perception of Musical Sounds

Analysis, Synthesis, and Perception of Musical Sounds Analysis, Synthesis, and Perception of Musical Sounds The Sound of Music James W. Beauchamp Editor University of Illinois at Urbana, USA 4y Springer Contents Preface Acknowledgments vii xv 1. Analysis

More information

Music Genre Classification and Variance Comparison on Number of Genres

Music Genre Classification and Variance Comparison on Number of Genres Music Genre Classification and Variance Comparison on Number of Genres Miguel Francisco, miguelf@stanford.edu Dong Myung Kim, dmk8265@stanford.edu 1 Abstract In this project we apply machine learning techniques

More information

UNIVERSAL SPATIAL UP-SCALER WITH NONLINEAR EDGE ENHANCEMENT

UNIVERSAL SPATIAL UP-SCALER WITH NONLINEAR EDGE ENHANCEMENT UNIVERSAL SPATIAL UP-SCALER WITH NONLINEAR EDGE ENHANCEMENT Stefan Schiemenz, Christian Hentschel Brandenburg University of Technology, Cottbus, Germany ABSTRACT Spatial image resizing is an important

More information

Modeling sound quality from psychoacoustic measures

Modeling sound quality from psychoacoustic measures Modeling sound quality from psychoacoustic measures Lena SCHELL-MAJOOR 1 ; Jan RENNIES 2 ; Stephan D. EWERT 3 ; Birger KOLLMEIER 4 1,2,4 Fraunhofer IDMT, Hör-, Sprach- und Audiotechnologie & Cluster of

More information

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video Mohamed Hassan, Taha Landolsi, Husameldin Mukhtar, and Tamer Shanableh College of Engineering American

More information

TERRESTRIAL broadcasting of digital television (DTV)

TERRESTRIAL broadcasting of digital television (DTV) IEEE TRANSACTIONS ON BROADCASTING, VOL 51, NO 1, MARCH 2005 133 Fast Initialization of Equalizers for VSB-Based DTV Transceivers in Multipath Channel Jong-Moon Kim and Yong-Hwan Lee Abstract This paper

More information

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER

PERCEPTUAL QUALITY OF H.264/AVC DEBLOCKING FILTER PERCEPTUAL QUALITY OF H./AVC DEBLOCKING FILTER Y. Zhong, I. Richardson, A. Miller and Y. Zhao School of Enginnering, The Robert Gordon University, Schoolhill, Aberdeen, AB1 1FR, UK Phone: + 1, Fax: + 1,

More information

PERCEPTUAL QUALITY ASSESSMENT FOR VIDEO WATERMARKING. Stefan Winkler, Elisa Drelie Gelasca, Touradj Ebrahimi

PERCEPTUAL QUALITY ASSESSMENT FOR VIDEO WATERMARKING. Stefan Winkler, Elisa Drelie Gelasca, Touradj Ebrahimi PERCEPTUAL QUALITY ASSESSMENT FOR VIDEO WATERMARKING Stefan Winkler, Elisa Drelie Gelasca, Touradj Ebrahimi Genista Corporation EPFL PSE Genimedia 15 Lausanne, Switzerland http://www.genista.com/ swinkler@genimedia.com

More information

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4

Contents. xv xxi xxiii xxiv. 1 Introduction 1 References 4 Contents List of figures List of tables Preface Acknowledgements xv xxi xxiii xxiv 1 Introduction 1 References 4 2 Digital video 5 2.1 Introduction 5 2.2 Analogue television 5 2.3 Interlace 7 2.4 Picture

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Psychological and Physiological Acoustics Session 1pPPb: Psychoacoustics

More information

Analysis of Packet Loss for Compressed Video: Does Burst-Length Matter?

Analysis of Packet Loss for Compressed Video: Does Burst-Length Matter? Analysis of Packet Loss for Compressed Video: Does Burst-Length Matter? Yi J. Liang 1, John G. Apostolopoulos, Bernd Girod 1 Mobile and Media Systems Laboratory HP Laboratories Palo Alto HPL-22-331 November

More information

1 Introduction to PSQM

1 Introduction to PSQM A Technical White Paper on Sage s PSQM Test Renshou Dai August 7, 2000 1 Introduction to PSQM 1.1 What is PSQM test? PSQM stands for Perceptual Speech Quality Measure. It is an ITU-T P.861 [1] recommended

More information

Perceptual dimensions of short audio clips and corresponding timbre features

Perceptual dimensions of short audio clips and corresponding timbre features Perceptual dimensions of short audio clips and corresponding timbre features Jason Musil, Budr El-Nusairi, Daniel Müllensiefen Department of Psychology, Goldsmiths, University of London Question How do

More information

Music Complexity Descriptors. Matt Stabile June 6 th, 2008

Music Complexity Descriptors. Matt Stabile June 6 th, 2008 Music Complexity Descriptors Matt Stabile June 6 th, 2008 Musical Complexity as a Semantic Descriptor Modern digital audio collections need new criteria for categorization and searching. Applicable to:

More information

Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting

Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting Dalwon Jang 1, Seungjae Lee 2, Jun Seok Lee 2, Minho Jin 1, Jin S. Seo 2, Sunil Lee 1 and Chang D. Yoo 1 1 Korea Advanced

More information

A prototype system for rule-based expressive modifications of audio recordings

A prototype system for rule-based expressive modifications of audio recordings International Symposium on Performance Science ISBN 0-00-000000-0 / 000-0-00-000000-0 The Author 2007, Published by the AEC All rights reserved A prototype system for rule-based expressive modifications

More information

A SEMANTIC DIFFERENTIAL STUDY OF LOW AMPLITUDE SUPERSONIC AIRCRAFT NOISE AND OTHER TRANSIENT SOUNDS

A SEMANTIC DIFFERENTIAL STUDY OF LOW AMPLITUDE SUPERSONIC AIRCRAFT NOISE AND OTHER TRANSIENT SOUNDS 19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 A SEMANTIC DIFFERENTIAL STUDY OF LOW AMPLITUDE SUPERSONIC AIRCRAFT NOISE AND OTHER TRANSIENT SOUNDS PACS: 43.28.Mw Marshall, Andrew

More information

Music Recommendation from Song Sets

Music Recommendation from Song Sets Music Recommendation from Song Sets Beth Logan Cambridge Research Laboratory HP Laboratories Cambridge HPL-2004-148 August 30, 2004* E-mail: Beth.Logan@hp.com music analysis, information retrieval, multimedia

More information

Sound Quality Analysis of Electric Parking Brake

Sound Quality Analysis of Electric Parking Brake Sound Quality Analysis of Electric Parking Brake Bahare Naimipour a Giovanni Rinaldi b Valerie Schnabelrauch c Application Research Center, Sound Answers Inc. 6855 Commerce Boulevard, Canton, MI 48187,

More information

Musicians Adjustment of Performance to Room Acoustics, Part III: Understanding the Variations in Musical Expressions

Musicians Adjustment of Performance to Room Acoustics, Part III: Understanding the Variations in Musical Expressions Musicians Adjustment of Performance to Room Acoustics, Part III: Understanding the Variations in Musical Expressions K. Kato a, K. Ueno b and K. Kawai c a Center for Advanced Science and Innovation, Osaka

More information

AUTOREGRESSIVE MFCC MODELS FOR GENRE CLASSIFICATION IMPROVED BY HARMONIC-PERCUSSION SEPARATION

AUTOREGRESSIVE MFCC MODELS FOR GENRE CLASSIFICATION IMPROVED BY HARMONIC-PERCUSSION SEPARATION AUTOREGRESSIVE MFCC MODELS FOR GENRE CLASSIFICATION IMPROVED BY HARMONIC-PERCUSSION SEPARATION Halfdan Rump, Shigeki Miyabe, Emiru Tsunoo, Nobukata Ono, Shigeki Sagama The University of Tokyo, Graduate

More information

Objective quality measurement of audio using multiband dynamic range analysis

Objective quality measurement of audio using multiband dynamic range analysis Objective quality measurement of audio using multiband dynamic range analysis Fenton, S, Fazenda, BM and Wakefield, J Title Authors Type URL Published Date 29 Objective quality measurement of audio using

More information

Timbre blending of wind instruments: acoustics and perception

Timbre blending of wind instruments: acoustics and perception Timbre blending of wind instruments: acoustics and perception Sven-Amin Lembke CIRMMT / Music Technology Schulich School of Music, McGill University sven-amin.lembke@mail.mcgill.ca ABSTRACT The acoustical

More information

Supervised Musical Source Separation from Mono and Stereo Mixtures based on Sinusoidal Modeling

Supervised Musical Source Separation from Mono and Stereo Mixtures based on Sinusoidal Modeling Supervised Musical Source Separation from Mono and Stereo Mixtures based on Sinusoidal Modeling Juan José Burred Équipe Analyse/Synthèse, IRCAM burred@ircam.fr Communication Systems Group Technische Universität

More information

Recognising Cello Performers using Timbre Models

Recognising Cello Performers using Timbre Models Recognising Cello Performers using Timbre Models Chudy, Magdalena; Dixon, Simon For additional information about this publication click this link. http://qmro.qmul.ac.uk/jspui/handle/123456789/5013 Information

More information

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST)

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST) Computational Models of Music Similarity 1 Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST) Abstract The perceived similarity of two pieces of music is multi-dimensional,

More information

ON FINDING MELODIC LINES IN AUDIO RECORDINGS. Matija Marolt

ON FINDING MELODIC LINES IN AUDIO RECORDINGS. Matija Marolt ON FINDING MELODIC LINES IN AUDIO RECORDINGS Matija Marolt Faculty of Computer and Information Science University of Ljubljana, Slovenia matija.marolt@fri.uni-lj.si ABSTRACT The paper presents our approach

More information

REAL-TIME VISUALISATION OF LOUDNESS ALONG DIFFERENT TIME SCALES

REAL-TIME VISUALISATION OF LOUDNESS ALONG DIFFERENT TIME SCALES REAL-TIME VISUALISATION OF LOUDNESS ALONG DIFFERENT TIME SCALES Esben Skovenborg TC Group Research A/S Sindalsvej 34, DK-8240 Risskov, Denmark EsbenS@TCElectronic.com Søren H. Nielsen TC Group Research

More information

MODELS of music begin with a representation of the

MODELS of music begin with a representation of the 602 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 18, NO. 3, MARCH 2010 Modeling Music as a Dynamic Texture Luke Barrington, Student Member, IEEE, Antoni B. Chan, Member, IEEE, and

More information

Supervised Learning in Genre Classification

Supervised Learning in Genre Classification Supervised Learning in Genre Classification Introduction & Motivation Mohit Rajani and Luke Ekkizogloy {i.mohit,luke.ekkizogloy}@gmail.com Stanford University, CS229: Machine Learning, 2009 Now that music

More information

Subjective Similarity of Music: Data Collection for Individuality Analysis

Subjective Similarity of Music: Data Collection for Individuality Analysis Subjective Similarity of Music: Data Collection for Individuality Analysis Shota Kawabuchi and Chiyomi Miyajima and Norihide Kitaoka and Kazuya Takeda Nagoya University, Nagoya, Japan E-mail: shota.kawabuchi@g.sp.m.is.nagoya-u.ac.jp

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Musical Acoustics Session 3pMU: Perception and Orchestration Practice

More information

MUSI-6201 Computational Music Analysis

MUSI-6201 Computational Music Analysis MUSI-6201 Computational Music Analysis Part 9.1: Genre Classification alexander lerch November 4, 2015 temporal analysis overview text book Chapter 8: Musical Genre, Similarity, and Mood (pp. 151 155)

More information

Audio Feature Extraction for Corpus Analysis

Audio Feature Extraction for Corpus Analysis Audio Feature Extraction for Corpus Analysis Anja Volk Sound and Music Technology 5 Dec 2017 1 Corpus analysis What is corpus analysis study a large corpus of music for gaining insights on general trends

More information

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring 2009 Week 6 Class Notes Pitch Perception Introduction Pitch may be described as that attribute of auditory sensation in terms

More information

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG? WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG? NICHOLAS BORG AND GEORGE HOKKANEN Abstract. The possibility of a hit song prediction algorithm is both academically interesting and industry motivated.

More information

WEB APPENDIX. Managing Innovation Sequences Over Iterated Offerings: Developing and Testing a Relative Innovation, Comfort, and Stimulation

WEB APPENDIX. Managing Innovation Sequences Over Iterated Offerings: Developing and Testing a Relative Innovation, Comfort, and Stimulation WEB APPENDIX Managing Innovation Sequences Over Iterated Offerings: Developing and Testing a Relative Innovation, Comfort, and Stimulation Framework of Consumer Responses Timothy B. Heath Subimal Chatterjee

More information

Computational Modelling of Harmony

Computational Modelling of Harmony Computational Modelling of Harmony Simon Dixon Centre for Digital Music, Queen Mary University of London, Mile End Rd, London E1 4NS, UK simon.dixon@elec.qmul.ac.uk http://www.elec.qmul.ac.uk/people/simond

More information

Measurement of overtone frequencies of a toy piano and perception of its pitch

Measurement of overtone frequencies of a toy piano and perception of its pitch Measurement of overtone frequencies of a toy piano and perception of its pitch PACS: 43.75.Mn ABSTRACT Akira Nishimura Department of Media and Cultural Studies, Tokyo University of Information Sciences,

More information

Topics in Computer Music Instrument Identification. Ioanna Karydi

Topics in Computer Music Instrument Identification. Ioanna Karydi Topics in Computer Music Instrument Identification Ioanna Karydi Presentation overview What is instrument identification? Sound attributes & Timbre Human performance The ideal algorithm Selected approaches

More information

Effect of Compact Disc Materials on Listeners Song Liking

Effect of Compact Disc Materials on Listeners Song Liking University of Redlands InSPIRe @ Redlands Undergraduate Honors Theses Theses, Dissertations & Honors Projects 2015 Effect of Compact Disc Materials on Listeners Song Liking Vanessa A. Labarga University

More information

TOWARDS IMPROVING ONSET DETECTION ACCURACY IN NON- PERCUSSIVE SOUNDS USING MULTIMODAL FUSION

TOWARDS IMPROVING ONSET DETECTION ACCURACY IN NON- PERCUSSIVE SOUNDS USING MULTIMODAL FUSION TOWARDS IMPROVING ONSET DETECTION ACCURACY IN NON- PERCUSSIVE SOUNDS USING MULTIMODAL FUSION Jordan Hochenbaum 1,2 New Zealand School of Music 1 PO Box 2332 Wellington 6140, New Zealand hochenjord@myvuw.ac.nz

More information

HIDDEN MARKOV MODELS FOR SPECTRAL SIMILARITY OF SONGS. Arthur Flexer, Elias Pampalk, Gerhard Widmer

HIDDEN MARKOV MODELS FOR SPECTRAL SIMILARITY OF SONGS. Arthur Flexer, Elias Pampalk, Gerhard Widmer Proc. of the 8 th Int. Conference on Digital Audio Effects (DAFx 5), Madrid, Spain, September 2-22, 25 HIDDEN MARKOV MODELS FOR SPECTRAL SIMILARITY OF SONGS Arthur Flexer, Elias Pampalk, Gerhard Widmer

More information

hit), and assume that longer incidental sounds (forest noise, water, wind noise) resemble a Gaussian noise distribution.

hit), and assume that longer incidental sounds (forest noise, water, wind noise) resemble a Gaussian noise distribution. CS 229 FINAL PROJECT A SOUNDHOUND FOR THE SOUNDS OF HOUNDS WEAKLY SUPERVISED MODELING OF ANIMAL SOUNDS ROBERT COLCORD, ETHAN GELLER, MATTHEW HORTON Abstract: We propose a hybrid approach to generating

More information

An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions

An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions 1128 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 11, NO. 10, OCTOBER 2001 An Efficient Low Bit-Rate Video-Coding Algorithm Focusing on Moving Regions Kwok-Wai Wong, Kin-Man Lam,

More information

from ocean to cloud ADAPTING THE C&A PROCESS FOR COHERENT TECHNOLOGY

from ocean to cloud ADAPTING THE C&A PROCESS FOR COHERENT TECHNOLOGY ADAPTING THE C&A PROCESS FOR COHERENT TECHNOLOGY Peter Booi (Verizon), Jamie Gaudette (Ciena Corporation), and Mark André (France Telecom Orange) Email: Peter.Booi@nl.verizon.com Verizon, 123 H.J.E. Wenckebachweg,

More information

Topic 4. Single Pitch Detection

Topic 4. Single Pitch Detection Topic 4 Single Pitch Detection What is pitch? A perceptual attribute, so subjective Only defined for (quasi) harmonic sounds Harmonic sounds are periodic, and the period is 1/F0. Can be reliably matched

More information

Effect of task constraints on the perceptual. evaluation of violins

Effect of task constraints on the perceptual. evaluation of violins Manuscript Click here to download Manuscript: SaitisManuscriptRevised.tex Saitis et al.: Perceptual evaluation of violins 1 Effect of task constraints on the perceptual evaluation of violins Charalampos

More information

Noise evaluation based on loudness-perception characteristics of older adults

Noise evaluation based on loudness-perception characteristics of older adults Noise evaluation based on loudness-perception characteristics of older adults Kenji KURAKATA 1 ; Tazu MIZUNAMI 2 National Institute of Advanced Industrial Science and Technology (AIST), Japan ABSTRACT

More information

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS Andrew N. Robertson, Mark D. Plumbley Centre for Digital Music

More information

THE RELATIONSHIP BETWEEN DICHOTOMOUS THINKING AND MUSIC PREFERENCES AMONG JAPANESE UNDERGRADUATES

THE RELATIONSHIP BETWEEN DICHOTOMOUS THINKING AND MUSIC PREFERENCES AMONG JAPANESE UNDERGRADUATES SOCIAL BEHAVIOR AND PERSONALITY, 2012, 40(4), 567-574 Society for Personality Research http://dx.doi.org/10.2224/sbp.2012.40.4.567 THE RELATIONSHIP BETWEEN DICHOTOMOUS THINKING AND MUSIC PREFERENCES AMONG

More information

AN IMPROVED ERROR CONCEALMENT STRATEGY DRIVEN BY SCENE MOTION PROPERTIES FOR H.264/AVC DECODERS

AN IMPROVED ERROR CONCEALMENT STRATEGY DRIVEN BY SCENE MOTION PROPERTIES FOR H.264/AVC DECODERS AN IMPROVED ERROR CONCEALMENT STRATEGY DRIVEN BY SCENE MOTION PROPERTIES FOR H.264/AVC DECODERS Susanna Spinsante, Ennio Gambi, Franco Chiaraluce Dipartimento di Elettronica, Intelligenza artificiale e

More information

Creating a Feature Vector to Identify Similarity between MIDI Files

Creating a Feature Vector to Identify Similarity between MIDI Files Creating a Feature Vector to Identify Similarity between MIDI Files Joseph Stroud 2017 Honors Thesis Advised by Sergio Alvarez Computer Science Department, Boston College 1 Abstract Today there are many

More information

DYNAMIC AUDITORY CUES FOR EVENT IMPORTANCE LEVEL

DYNAMIC AUDITORY CUES FOR EVENT IMPORTANCE LEVEL DYNAMIC AUDITORY CUES FOR EVENT IMPORTANCE LEVEL Jonna Häkkilä Nokia Mobile Phones Research and Technology Access Elektroniikkatie 3, P.O.Box 50, 90571 Oulu, Finland jonna.hakkila@nokia.com Sami Ronkainen

More information

Convention Paper Presented at the 145 th Convention 2018 October 17 20, New York, NY, USA

Convention Paper Presented at the 145 th Convention 2018 October 17 20, New York, NY, USA Audio Engineering Society Convention Paper 10080 Presented at the 145 th Convention 2018 October 17 20, New York, NY, USA This Convention paper was selected based on a submitted abstract and 750-word precis

More information

ESG Engineering Services Group

ESG Engineering Services Group ESG Engineering Services Group PESQ Limitations for EVRC Family of Narrowband and Wideband Speech Codecs January 2008 80-W1253-1 Rev D 80-W1253-1 Rev D QUALCOMM Incorporated 5775 Morehouse Drive San Diego,

More information

MUSICAL INSTRUMENT RECOGNITION WITH WAVELET ENVELOPES

MUSICAL INSTRUMENT RECOGNITION WITH WAVELET ENVELOPES MUSICAL INSTRUMENT RECOGNITION WITH WAVELET ENVELOPES PACS: 43.60.Lq Hacihabiboglu, Huseyin 1,2 ; Canagarajah C. Nishan 2 1 Sonic Arts Research Centre (SARC) School of Computer Science Queen s University

More information

Study of White Gaussian Noise with Varying Signal to Noise Ratio in Speech Signal using Wavelet

Study of White Gaussian Noise with Varying Signal to Noise Ratio in Speech Signal using Wavelet American International Journal of Research in Science, Technology, Engineering & Mathematics Available online at http://www.iasir.net ISSN (Print): 2328-3491, ISSN (Online): 2328-3580, ISSN (CD-ROM): 2328-3629

More information

A New Method for Calculating Music Similarity

A New Method for Calculating Music Similarity A New Method for Calculating Music Similarity Eric Battenberg and Vijay Ullal December 12, 2006 Abstract We introduce a new technique for calculating the perceived similarity of two songs based on their

More information

Release Year Prediction for Songs

Release Year Prediction for Songs Release Year Prediction for Songs [CSE 258 Assignment 2] Ruyu Tan University of California San Diego PID: A53099216 rut003@ucsd.edu Jiaying Liu University of California San Diego PID: A53107720 jil672@ucsd.edu

More information

A PSYCHOACOUSTICAL INVESTIGATION INTO THE EFFECT OF WALL MATERIAL ON THE SOUND PRODUCED BY LIP-REED INSTRUMENTS

A PSYCHOACOUSTICAL INVESTIGATION INTO THE EFFECT OF WALL MATERIAL ON THE SOUND PRODUCED BY LIP-REED INSTRUMENTS A PSYCHOACOUSTICAL INVESTIGATION INTO THE EFFECT OF WALL MATERIAL ON THE SOUND PRODUCED BY LIP-REED INSTRUMENTS JW Whitehouse D.D.E.M., The Open University, Milton Keynes, MK7 6AA, United Kingdom DB Sharp

More information

An Accurate Timbre Model for Musical Instruments and its Application to Classification

An Accurate Timbre Model for Musical Instruments and its Application to Classification An Accurate Timbre Model for Musical Instruments and its Application to Classification Juan José Burred 1,AxelRöbel 2, and Xavier Rodet 2 1 Communication Systems Group, Technical University of Berlin,

More information

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC G.TZANETAKIS, N.HU, AND R.B. DANNENBERG Computer Science Department, Carnegie Mellon University 5000 Forbes Avenue, Pittsburgh, PA 15213, USA E-mail: gtzan@cs.cmu.edu

More information

Improving Frame Based Automatic Laughter Detection

Improving Frame Based Automatic Laughter Detection Improving Frame Based Automatic Laughter Detection Mary Knox EE225D Class Project knoxm@eecs.berkeley.edu December 13, 2007 Abstract Laughter recognition is an underexplored area of research. My goal for

More information

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound Pitch Perception and Grouping HST.723 Neural Coding and Perception of Sound Pitch Perception. I. Pure Tones The pitch of a pure tone is strongly related to the tone s frequency, although there are small

More information

Reduced complexity MPEG2 video post-processing for HD display

Reduced complexity MPEG2 video post-processing for HD display Downloaded from orbit.dtu.dk on: Dec 17, 2017 Reduced complexity MPEG2 video post-processing for HD display Virk, Kamran; Li, Huiying; Forchhammer, Søren Published in: IEEE International Conference on

More information

Chord Classification of an Audio Signal using Artificial Neural Network

Chord Classification of an Audio Signal using Artificial Neural Network Chord Classification of an Audio Signal using Artificial Neural Network Ronesh Shrestha Student, Department of Electrical and Electronic Engineering, Kathmandu University, Dhulikhel, Nepal ---------------------------------------------------------------------***---------------------------------------------------------------------

More information

HIT SONG SCIENCE IS NOT YET A SCIENCE

HIT SONG SCIENCE IS NOT YET A SCIENCE HIT SONG SCIENCE IS NOT YET A SCIENCE François Pachet Sony CSL pachet@csl.sony.fr Pierre Roy Sony CSL roy@csl.sony.fr ABSTRACT We describe a large-scale experiment aiming at validating the hypothesis that

More information

Automatic Music Similarity Assessment and Recommendation. A Thesis. Submitted to the Faculty. Drexel University. Donald Shaul Williamson

Automatic Music Similarity Assessment and Recommendation. A Thesis. Submitted to the Faculty. Drexel University. Donald Shaul Williamson Automatic Music Similarity Assessment and Recommendation A Thesis Submitted to the Faculty of Drexel University by Donald Shaul Williamson in partial fulfillment of the requirements for the degree of Master

More information

Non-Reducibility with Knowledge wh: Experimental Investigations

Non-Reducibility with Knowledge wh: Experimental Investigations Non-Reducibility with Knowledge wh: Experimental Investigations 1 Knowing wh and Knowing that Obvious starting picture: (1) implies (2). (2) iff (3). (1) John knows that he can buy an Italian newspaper

More information

Multiband Noise Reduction Component for PurePath Studio Portable Audio Devices

Multiband Noise Reduction Component for PurePath Studio Portable Audio Devices Multiband Noise Reduction Component for PurePath Studio Portable Audio Devices Audio Converters ABSTRACT This application note describes the features, operating procedures and control capabilities of a

More information

Keep your broadcast clear.

Keep your broadcast clear. Net- MOZAIC Keep your broadcast clear. Video stream content analyzer The NET-MOZAIC Probe can be used as a stand alone product or an integral part of our NET-xTVMS system. The NET-MOZAIC is normally located

More information

Correlating differences in the playing properties of five student model clarinets with physical differences between them

Correlating differences in the playing properties of five student model clarinets with physical differences between them Correlating differences in the playing properties of five student model clarinets with physical differences between them P. M. Kowal, D. Sharp and S. Taherzadeh Open University, DDEM, MCT Faculty, Open

More information

Perceptual Analysis of Video Impairments that Combine Blocky, Blurry, Noisy, and Ringing Synthetic Artifacts

Perceptual Analysis of Video Impairments that Combine Blocky, Blurry, Noisy, and Ringing Synthetic Artifacts Perceptual Analysis of Video Impairments that Combine Blocky, Blurry, Noisy, and Ringing Synthetic Artifacts Mylène C.Q. Farias, a John M. Foley, b and Sanjit K. Mitra a a Department of Electrical and

More information

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications Matthias Mauch Chris Cannam György Fazekas! 1 Matthias Mauch, Chris Cannam, George Fazekas Problem Intonation in Unaccompanied

More information

ISMIR 2008 Session 2a Music Recommendation and Organization

ISMIR 2008 Session 2a Music Recommendation and Organization A COMPARISON OF SIGNAL-BASED MUSIC RECOMMENDATION TO GENRE LABELS, COLLABORATIVE FILTERING, MUSICOLOGICAL ANALYSIS, HUMAN RECOMMENDATION, AND RANDOM BASELINE Terence Magno Cooper Union magno.nyc@gmail.com

More information

STAT 113: Statistics and Society Ellen Gundlach, Purdue University. (Chapters refer to Moore and Notz, Statistics: Concepts and Controversies, 8e)

STAT 113: Statistics and Society Ellen Gundlach, Purdue University. (Chapters refer to Moore and Notz, Statistics: Concepts and Controversies, 8e) STAT 113: Statistics and Society Ellen Gundlach, Purdue University (Chapters refer to Moore and Notz, Statistics: Concepts and Controversies, 8e) Learning Objectives for Exam 1: Unit 1, Part 1: Population

More information

ELEC 691X/498X Broadcast Signal Transmission Fall 2015

ELEC 691X/498X Broadcast Signal Transmission Fall 2015 ELEC 691X/498X Broadcast Signal Transmission Fall 2015 Instructor: Dr. Reza Soleymani, Office: EV 5.125, Telephone: 848 2424 ext.: 4103. Office Hours: Wednesday, Thursday, 14:00 15:00 Time: Tuesday, 2:45

More information

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM A QUER B EAMPLE MUSIC RETRIEVAL ALGORITHM H. HARB AND L. CHEN Maths-Info department, Ecole Centrale de Lyon. 36, av. Guy de Collongue, 69134, Ecully, France, EUROPE E-mail: {hadi.harb, liming.chen}@ec-lyon.fr

More information

Topic 10. Multi-pitch Analysis

Topic 10. Multi-pitch Analysis Topic 10 Multi-pitch Analysis What is pitch? Common elements of music are pitch, rhythm, dynamics, and the sonic qualities of timbre and texture. An auditory perceptual attribute in terms of which sounds

More information

Speeding up Dirac s Entropy Coder

Speeding up Dirac s Entropy Coder Speeding up Dirac s Entropy Coder HENDRIK EECKHAUT BENJAMIN SCHRAUWEN MARK CHRISTIAENS JAN VAN CAMPENHOUT Parallel Information Systems (PARIS) Electronics and Information Systems (ELIS) Ghent University

More information

Automatic Classification of Instrumental Music & Human Voice Using Formant Analysis

Automatic Classification of Instrumental Music & Human Voice Using Formant Analysis Automatic Classification of Instrumental Music & Human Voice Using Formant Analysis I Diksha Raina, II Sangita Chakraborty, III M.R Velankar I,II Dept. of Information Technology, Cummins College of Engineering,

More information

Perceptual Considerations in Designing and Fitting Hearing Aids for Music Published on Friday, 14 March :01

Perceptual Considerations in Designing and Fitting Hearing Aids for Music Published on Friday, 14 March :01 Perceptual Considerations in Designing and Fitting Hearing Aids for Music Published on Friday, 14 March 2008 11:01 The components of music shed light on important aspects of hearing perception. To make

More information

Jacob A. Maddams, Saoirse Finn, Joshua D. Reiss Centre for Digital Music, Queen Mary University of London London, UK

Jacob A. Maddams, Saoirse Finn, Joshua D. Reiss Centre for Digital Music, Queen Mary University of London London, UK AN AUTONOMOUS METHOD FOR MULTI-TRACK DYNAMIC RANGE COMPRESSION Jacob A. Maddams, Saoirse Finn, Joshua D. Reiss Centre for Digital Music, Queen Mary University of London London, UK jacob.maddams@gmail.com

More information

Objective Video Quality Assessment of Direct Recording and Datavideo HDR-40 Recording System

Objective Video Quality Assessment of Direct Recording and Datavideo HDR-40 Recording System JAICT, Journal of Applied Information and Communication Technologies Vol., No., 206 Objective Video Quality Assessment of Direct Recording and Datavideo HDR-40 Recording System Nofia Andreana, Arif Nursyahid

More information

Unequal Error Protection Codes for Wavelet Image Transmission over W-CDMA, AWGN and Rayleigh Fading Channels

Unequal Error Protection Codes for Wavelet Image Transmission over W-CDMA, AWGN and Rayleigh Fading Channels Unequal Error Protection Codes for Wavelet Image Transmission over W-CDMA, AWGN and Rayleigh Fading Channels MINH H. LE and RANJITH LIYANA-PATHIRANA School of Engineering and Industrial Design College

More information

Temporal Envelope and Periodicity Cues on Musical Pitch Discrimination with Acoustic Simulation of Cochlear Implant

Temporal Envelope and Periodicity Cues on Musical Pitch Discrimination with Acoustic Simulation of Cochlear Implant Temporal Envelope and Periodicity Cues on Musical Pitch Discrimination with Acoustic Simulation of Cochlear Implant Lichuan Ping 1, 2, Meng Yuan 1, Qinglin Meng 1, 2 and Haihong Feng 1 1 Shanghai Acoustics

More information

A Language Modeling Approach for the Classification of Audio Music

A Language Modeling Approach for the Classification of Audio Music A Language Modeling Approach for the Classification of Audio Music Gonçalo Marques and Thibault Langlois DI FCUL TR 09 02 February, 2009 HCIM - LaSIGE Departamento de Informática Faculdade de Ciências

More information

On Human Capability and Acoustic Cues for Discriminating Singing and Speaking Voices

On Human Capability and Acoustic Cues for Discriminating Singing and Speaking Voices On Human Capability and Acoustic Cues for Discriminating Singing and Speaking Voices Yasunori Ohishi 1 Masataka Goto 3 Katunobu Itou 2 Kazuya Takeda 1 1 Graduate School of Information Science, Nagoya University,

More information

The song remains the same: identifying versions of the same piece using tonal descriptors

The song remains the same: identifying versions of the same piece using tonal descriptors The song remains the same: identifying versions of the same piece using tonal descriptors Emilia Gómez Music Technology Group, Universitat Pompeu Fabra Ocata, 83, Barcelona emilia.gomez@iua.upf.edu Abstract

More information