Predicting annoyance judgments from psychoacoustic metrics: Identifiable versus neutralized sounds

Similar documents
Modeling sound quality from psychoacoustic measures

ADVANCED PROCEDURES FOR PSYCHOACOUSTIC NOISE EVALUATION

Colour-influences on loudness judgements

Relation between the overall unpleasantness of a long duration sound and the one of its events : application to a delivery truck

Loudness of pink noise and stationary technical sounds

Table 1 Pairs of sound samples used in this study Group1 Group2 Group1 Group2 Sound 2. Sound 2. Pair

Proceedings of Meetings on Acoustics

Psychoacoustic Evaluation of Fan Noise

Rhona Hellman and the Munich School of Psychoacoustics

Sound design strategy for enhancing subjective preference of EV interior sound

The quality of potato chip sounds and crispness impression

The importance of recording and playback technique for assessment of annoyance

A SEMANTIC DIFFERENTIAL STUDY OF LOW AMPLITUDE SUPERSONIC AIRCRAFT NOISE AND OTHER TRANSIENT SOUNDS

Experiments on tone adjustments

Measurement of overtone frequencies of a toy piano and perception of its pitch

Sound Quality Analysis of Electric Parking Brake

Determination of Sound Quality of Refrigerant Compressors

Loudness and Sharpness Calculation

Interior and Motorbay sound quality evaluation of full electric and hybrid-electric vehicles based on psychoacoustics

Noise evaluation based on loudness-perception characteristics of older adults

Characterization of sound quality of impulsive sounds using loudness based metric

Using the new psychoacoustic tonality analyses Tonality (Hearing Model) 1

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE

JOURNAL OF BUILDING ACOUSTICS. Volume 20 Number

DIFFERENCES IN TRAFFIC NOISE MEASUREMENTS WITH SLM AND BINAURAL RECORDING HEAD

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

Proceedings of Meetings on Acoustics

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE

A comparison of the temporal weighting of annoyance and loudness

Temporal summation of loudness as a function of frequency and temporal pattern

The importance of recording and playback technique for assessment of annoyance Celik, Emine; Waye, Kerstin Persson; Møller, Henrik

Consonance perception of complex-tone dyads and chords

Recent developments in sound quality evaluation

Analysing Room Impulse Responses with Psychoacoustical Algorithms: A Preliminary Study

MASTER S THESIS. Sound Quality Evaluation of Floor Impact Noise Generated by Walking. Payman Roonasi

EFFECTS OF REVERBERATION TIME AND SOUND SOURCE CHARACTERISTIC TO AUDITORY LOCALIZATION IN AN INDOOR SOUND FIELD. Chiung Yao Chen

DERIVING A TIMBRE SPACE FOR THREE TYPES OF COMPLEX TONES VARYING IN SPECTRAL ROLL-OFF

Progress in calculating tonality of technical sounds

IP Telephony and Some Factors that Influence Speech Quality

Loudness of transmitted speech signals for SWB and FB applications

Predicting Performance of PESQ in Case of Single Frame Losses

LOUDNESS EFFECT OF THE DIFFERENT TONES ON THE TIMBRE SUBJECTIVE PERCEPTION EXPERIMENT OF ERHU

A PSYCHOACOUSTICAL INVESTIGATION INTO THE EFFECT OF WALL MATERIAL ON THE SOUND PRODUCED BY LIP-REED INSTRUMENTS

Barwa dźwięku i jej podcechy. II rok reżyserii dźwięku AM_3_2017

Soundscape and Psychoacoustics Using the resources for environmental noise protection. Standards in Psychoacoustics

MEASURING LOUDNESS OF LONG AND SHORT TONES USING MAGNITUDE ESTIMATION

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng

Do Zwicker Tones Evoke a Musical Pitch?

Implementing sharpness using specific loudness calculated from the Procedure for the Computation of Loudness of Steady Sounds

Sound Quality of Wind Turbines

Modeling memory for melodies

Aalborg Universitet. Published in: The 2005 Congress and Exposition on Noise Control Engineering. Publication date: 2005

A BEM STUDY ON THE EFFECT OF SOURCE-RECEIVER PATH ROUTE AND LENGTH ON ATTENUATION OF DIRECT SOUND AND FLOOR REFLECTION WITHIN A CHAMBER ORCHESTRA

ANALYSING DIFFERENCES BETWEEN THE INPUT IMPEDANCES OF FIVE CLARINETS OF DIFFERENT MAKES

Study on the Sound Quality Objective Evaluation of High Speed Train's. Door Closing Sound

Room acoustics computer modelling: Study of the effect of source directivity on auralizations

Equal Intensity Contours for Whole-Body Vibrations Compared With Vibrations Cross-Modally Matched to Isophones

Binaural Measurement, Analysis and Playback

EXPLORING PSYCHOACOUSTIC INDICATORS TO ASSESS CLOSE PROXIMITY TYRE-ROAD NOISE

Listener Envelopment LEV, Strength G and Reverberation Time RT in Concert Halls

ACTIVE SOUND DESIGN: VACUUM CLEANER

Proceedings of Meetings on Acoustics

Springer Series in Information Sciences 22

TO HONOR STEVENS AND REPEAL HIS LAW (FOR THE AUDITORY STSTEM)

Analysis, Synthesis, and Perception of Musical Sounds

Noise assessment in a high-speed train

STAT 113: Statistics and Society Ellen Gundlach, Purdue University. (Chapters refer to Moore and Notz, Statistics: Concepts and Controversies, 8e)

Concert halls conveyors of musical expressions

The Tone Height of Multiharmonic Sounds. Introduction

INTER-NOISE AUGUST 2007 ISTANBUL, TURKEY

The characterisation of Musical Instruments by means of Intensity of Acoustic Radiation (IAR)

INSTRUCTION SHEET FOR NOISE MEASUREMENT

Analysis of car door closing sound quality

STUDY OF THE PERCEIVED QUALITY OF SAXOPHONE REEDS BY A PANEL OF MUSICIANS

DIGITAL COMMUNICATION

Basic Considerations for Loudness-based Analysis of Room Impulse Responses

Influence of timbre, presence/absence of tonal hierarchy and musical training on the perception of musical tension and relaxation schemas

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY

Acoustical Noise Problems in Production Test of Electro Acoustical Units and Electronic Cabinets

Perceptual differences between cellos PERCEPTUAL DIFFERENCES BETWEEN CELLOS: A SUBJECTIVE/OBJECTIVE STUDY

Hybrid active noise barrier with sound masking

Musicians Adjustment of Performance to Room Acoustics, Part III: Understanding the Variations in Musical Expressions

A perceptual study on face design for Moe characters in Cool Japan contents

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics)

The acoustics of the Concert Hall and the Chinese Theatre in the Beijing National Grand Theatre of China

Good playing practice when drumming: Influence of tempo on timing and preparatory movements for healthy and dystonic players

On the strike note of bells

Soundscape mapping in urban contexts using GIS techniques

Effect of room acoustic conditions on masking efficiency

TitleVocal Shimmer of the Laryngeal Poly. Citation 音声科学研究 = Studia phonologica (1977),

Chapter Two: Long-Term Memory for Timbre

LCD and Plasma display technologies are promising solutions for large-format

Perception of sound quality of product sounds A subjective study using a semantic differential

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

We realize that this is really small, if we consider that the atmospheric pressure 2 is

Perceptual and physical evaluation of differences among a large panel of loudspeakers

USING PULSE REFLECTOMETRY TO COMPARE THE EVOLUTION OF THE CORNET AND THE TRUMPET IN THE 19TH AND 20TH CENTURIES

PSYCHOACOUSTICS & THE GRAMMAR OF AUDIO (By Steve Donofrio NATF)

Effect of task constraints on the perceptual. evaluation of violins

A comparative case study of indoor soundscape approach on objective analyses and subjective evaluations of libraries

Transcription:

The 33 rd International Congress and Exposition on Noise Control Engineering Predicting annoyance judgments from psychoacoustic metrics: Identifiable versus neutralized sounds W. Ellermeier a, A. Zeitler a and H. Fastl b a Sound Quality Research Unit, Department of Acoustics, Aalborg University, Fredrik Bajers Vej 7 B-5, DK-9220 Aalborg Ø, Denmark b AG Technische Akustik, Mensch-Maschine-Kommunikation, TU München, Germany a [we;az]@acoustics.dk; b fastl@mmk.ei.tum.de Abstract [267] It is common practice to predict perceived noise annoyance by means of regression models using instrumental psychoacoustic metrics as predictors. The validity of this approach has been criticized for not taking into account non-sensory variables such as the meaning of the sound. The present study investigates to which extent judgments of annoyance reflect sensory attributes in terms of psychoacoustic metrics as opposed to cognitive and emotional variables related to the sound source. A new signal-processing method which substantially reduces the identifiability of sound sources was applied to a set of 40 environmental and product sounds. In the listening experiment, two independent groups of participants (n = 25 each) provided annoyance judgments of either the original or neutralized version of the sounds using the method of category-subdivision scaling. In the second part of the experiment, the participants rated the affective meaning of the sounds on a concept-specific semantic differential. Instrumental analyses of the sounds included the calculation of psychoacoustic metrics of loudness, sharpness, roughness, fluctuation strength, and tonal prominence which were entered into regression models to predict the outcome of the listening tests. It turned out that while instrumental metrics fared well in predicting overall annoyance, they did not account for the discrepancies in judgments of original versus neutralized sounds, suggesting that these actually reflect non-sensory effects mediated by the meaning of the sound. 1 INTRODUCTION Psychoacoustic metrics (indices of loudness, sharpness, roughness, and the like) when properly computed, and combined, usually go a long way in predicting the annoyance reactions produced by environmental sounds. They cannot, however, account for non-sensory influences entering into the judgment of a sound. These might include the attitude towards the source, effects of familiarity, preferences, and user expectations about prototypical sounds. Such non-auditory influences on psychoacoustical judgments are sometimes summarized as effects of the meaning of the sound [1, 2]. Due to Fastl s recent proposal of a signal-processing algorithm [3] that modifies the acoustic properties of a given sound so that it is very likely to become unrecognizable (and thus meaningless in the sense discussed), we now have a handle on quantifying the relative contribution of acoustic and non-acoustic factors to the judgments made in a listening test. The advantage of Fastl s method over other alternatives (such as filling the temporal envelope of 1/6

the original sound with broadband noise) is that it takes both temporal and spectral properties into account, and is designed to preserve the temporal loudness pattern of the original. That is accomplished by first subjecting the sound to a Fourier time transform (FTT), then applying some spectral broadening to the elements of the FTT pattern, and subsequently re-synthesizing the sound by an inverse FTT [3]. In an earlier report [4], we focused on the effects of this neutralization procedure on loudness judgments, showing that while identifiability was greatly reduced (from 93 to 13%), the effects on loudness were relatively small, reaching significance only for three in a sample of 40 sounds. In the present report, we will analyze the effects of the neutralization procedure on ratings of annoyance. Clearly, while the procedure guarantees loudness (as measured by appropriate metrics) to be unchanged, when investigating annoyance ratings of original and neutralized sounds, both acoustical differences (changes in roughness, for example due to the spectral broadening, see [5]), and differences in meaning will be confounded. To disentangle these effects, in a first step, annoyance ratings of both kinds of sounds will be obtained from two independent groups of subjects. Subsequently, the results will be related to (a) psychoacoustic metrics capturing the sensory effects of the sounds, and to (b) semantic-differential ratings capturing their connotative meaning. 2 METHOD 2.1 Participants A total of 50 students at Aalborg University between 19 and 31 years of age participated in the experiment. They were audiometrically screened with the requirement that their pure-tone thresholds did not exceed the normal curve by more than 20 db in the frequency range from 0.25 to 8 khz. Subsequently, half of the participants were randomly assigned to judge the annoyance of the original, half to judge the annoyance of the neutralized sounds. 2.2 Apparatus and Stimuli The original sounds were recorded using a Brüel & Kjær (Portable PULSE 3560 C) frontend connected to a (mono) microphone (Brüel & Kjær type 4165 or 4179) placed at appropriate distances from 0.3 to 7 m from the source. The files were converted to 16-bit, 44.1 khz format to be played from a regular (RME Digi96 Pro) sound card the output of which was amplified (Behringer HA 4400) before being presented diotically to the subjects listening in a doublewalled sound-attenuating chamber via headphones (Beyerdynamic DT 990). Fourty sounds were selected for the experiment to be highly identifiable in the original condition: Most of them were non-stationary everyday noises (e.g. toilet flush, door closing, scissors, car passing), about a third of the sounds consisted of product sounds of electrical devices (e.g. hairdryer, kitchen mixer, razor) recorded in their typical use. The sounds varied in duration from 0.7 to 5 s, and had overall sound-pressure levels between 30 and 80 db SPL. The 40 recorded sounds were processed using the algorithm proposed by Fastl [3] in order to obtain 40 neutralized sounds having identical loudness-time functions. 2.3 Procedure All participants performed three tasks in the following order: (1) a scaling experiment (loudness or annoyance), (2) an identification task, and (3) a semantic-differential rating of all sounds. For the annoyance scaling task reported in this paper, a category subdivision procedure (CS, see [6]) was used: Subjects were asked to judge each sound on a combined verbal-numerical category 2/6

scale that consisted of five verbal categories which were further subdivided into ten steps and labelled with the Danish equivalents of very slightly annoying (1-10), slightly annoying (11-20), medium (21-30), strongly annoying (31-40) and very strongly annoying (41-50). The endpoints of the resulting 50-point scale were verbally anchored to denote not at all annoying (0) and unbearably annoying (beyond 50). After a short practice run, each subject judged the sounds once in a random order. In the subsequent identification experiment, the 40 recorded (resp. neutralized) sounds were played again in a random sequence, and the subject was asked to identify the source by providing both a noun and a verb (e.g. motor - idling ). During a second session, subjects judged the same sounds using a semantic differential consisting of 12 bipolar adjective scales. 3 RESULTS In the present report we will focus on the annoyance scaling data, and their relation to instrumental psychoacoustic metrics. Preliminary results on loudness scaling, and on the outcome of the semantic differential measures have been reported elsewhere [4, 7]. 50 3.1 Annoyance scaling ANNOYANCE JUDGMENTS [CS scale] 40 30 20 10 0. bell tap glass coin toilet door flush coffee maker 5 10 15 20 25 30 35 40 SOUNDS [ordered by scale values in the neutralized condition] Figure 1: Annoyance scale values (plus/minus standard errors of the means) of the 40 test sounds as judged by 25 participants in their neutralized (filled circles) and by a different sample of 25 participants in their original version (open diamonds). Statistically significant discrepancies are marked by vertical arrows. The annoyance scaling data were averaged across the 25 subjects in each group, and are displayed in Figure 1 with the sound samples (along the abscissa) being arranged in ascending 3/6

order according to the mean annoyance produced by the neutralized sounds (filled circles), so that the judgments of the original, identifiable sounds (open diamonds) appear as deviations from the neutral curve. It is evident that these deviations are substantial, which is confirmed by a two-factor (sound by processing), mixed analysis of variance: In addition to the highly significant (but trivial) main effect of the 40 sounds, there is a significant main effect of processing (original vs. neutral), indicating that annoyance ratings of the neutralized sounds were higher on the average than those of the originals: F (1, 48) = 4.99; p < 0.03. Furthermore, there is a highly significant (sound by processing) interaction, showing that the effects of the neutralization differ significantly between sounds: F (16.25, 780.01) = 7.92; p < 0.001. In order to determine, which of the discrepancies between judgments of the original and neutralized sounds were producing these effects, post-hoc tests were performed using a Bonferroni correction to adjust for chance outcomes due to multiple testing. At the test-wise α-level of 0.00128 thus obtained, the seven sounds marked by arrows in Figure 1 were identified as producing significant differences in annoyance in the two experimental conditions investigated: coffee maker (M neutr M orig = 9.36 scale units), door locking (13.08 scale units), bouncing coin (11.52), toilet flushing (13.6), water running from a tap (8.88), bicycle bell (12.04), champaign glass (13.6). Note that for all of these sounds annoyance judgments of the originals were lower than than those of the neutralized version. 3.2 Instrumental analyses In the following, instrumental analyses of the 40 sounds and their neutralized counterparts were performed in order to explore whether changes in psychoacoustic metrics can account for the outcome of the annoyance scaling experiment. All analyses were performed using a commercially available psychoacoustic analysis program (Brüel & Kjaer Sound Quality type 7698, version 3.4.0). Sound-quality changes due to the neutralization Since the parallel loudness scaling study of original and neutralized sounds [4] had shown no overall loudness differences due to processing (M orig = 28.62; M neutr = 28.38), we investigated whether other sound-quality metrics might be affected. As may be seen in Table 1, on the average the sounds slightly increased in fluctuation strength, and roughness, and decreased somewhat in sharpness. These observations are in close agreement with Fastl s [5] earlier analysis of a long-duration traffic noise recording. The most striking effect, however, is that - due to the spectral broadening - the sounds loose almost all of their tonal content (as indicated by the prominence ratio statistics in the last line of Table 1). In fact, for most of the neutralized sounds, no tonal components could be detected. Analysis of the differences between original, and neutralized sounds In a second step, psychoacoustic metrics were explored with respect to their potential in accounting for the differences between neutralized and original sounds (evident in Figure 1). To that effect, five sound quality metrics (the fifth percentile of non-stationary loudness N 5, and the four parameters listed in Table 1) were entered into a multiple-regression equation with the difference in annoyance ratings between neutralized and original sounds being the criterion (the value to be predicted). As a result, all sound-quality metrics, except for the prominence ratio dropped out as non-significant, and the latter accounted for less than 12 percent of the variance (R 2 adj = 0.115) in the differences to be predicted. 4/6

Table 1: Changes in psychoacoustic metrics due to neutralization. Stimuli original neutralized Roughness [asper] 0.715 0.795 Fluctuation Str. [vacil] 2.070 2.315 Sharpness [acum] 3.090 2.875 Prominence ratio [db] 1.985-5.120 Note. Standard measures of roughness, fluctution strength, and the prominence ratio (substituting a threshold value of -6 db when no tonal component was detected) were computed for each sound, as well as the 50th percentile of Aures sharpness. The table entries are median values of these metrics across the 40 sounds analysed. Modelling overall annoyance If psychoacoustic metrics do not do well in accounting for the differences in annoyance ratings due to processing, they might nevertheless be well suited in predicting overall annoyance in both kinds of sounds (originals, and neutralized ones). To explore this, the measured soundquality indices of all 40 neutralized sounds were entered into a multiple linear regression. It turned out, that a combination of loudness (N 5 ), sharpness (S 50 ), and roughness (R) predicted the overall annoyance ratings (category scaling of annoyance, CSA) fairly well (R 2 adj = 0.856), accounting for nearly 86 percent of the variance: CSA = 8.07 + 0.563 N 5 + 3.022 S 50 + 2.175 R (1) Using the same three metrics (while allowing for different coefficients) to predict the annoyance of the original sounds, reduced the variance accounted for by 15%: R 2 adj = 0.704. But even allowing for different metrics in the regression equation for the original sounds did not improve the situation: The best model (obtained when substituting the prominence ratio for sharpness) did only slightly better, R 2 adj = 0.731. 4 CONCLUSIONS Direct scaling of the annoyance of well-recognizable sounds, and of their neutralized [3, 5] counterparts yields substantial differences. To explore, whether these differences are due to the acoustical changes inherent in the neutralization procedure, a number of analyses using instrumental sound quality metrics were performed, yielding three major conclusions: (1) The changes in psychoacoustical metrics due to the procedure are minor, except for a reduction in tonalness for those sounds having tonal components. (2) The differences in the directly scaled annoyance between original and neutralized sounds cannot be accounted for by any of the instrumental metrics explored. (3) Annoyance ratings of both the neutralized, and the original sounds, can be accounted for by fairly similar combination metrics, but the model for the original sounds accounts for substantially less of the variance. The fact that the original sounds are less well predicted by psychoacoustic metrics suggests that the additional variance is caused by non-acoustic factors related to their identifiability. These results suggest that the differences observed in the scaling of original vs. neutralized sounds are indeed due to differences in meaning, i.e. to non-acoustical factors. This conclusion is also supported by preliminary analyses of the Semantic Differential ratings [7], supposedly 5/6

measuring the connotative associations elicited by the sounds. These ratings exhibit much higher correlations with the difference scores than do the instrumental metrics. Further analyses comparing both sets of predictors are under way. ACKNOWLEDGEMENTS This investigation was carried out while the first two authors were with the Sound Quality Research Unit (SQRU) at the Department of Acoustics, Aalborg University. This unit receives financial support from Bang & Olufsen, Brüel & Kjær, and Delta Acoustics & Vibration, as well as from the Danish National Agency for Industry and Trade (EFS), and the Danish Technical Research Council (STVF). The authors would like to thank Karin Zimmer for performing the regression analyses, and Signe Kristensen and Pelle Coltau for their conscientious assistance in helping with collecting and coding the data. Alfred Zeitler is now with the BMW sound engineering department: alfred.zeitler@bmw.de. REFERENCES [1] E. Zwicker, H. Fastl, Psychoacoustics. Facts and models, 2nd edition, Springer, Berlin 1999. [2] U. Jekosch, Meaning in the context of sound quality assessment, Acustica - acta acustica 85(5), pp. 681 684, (1999). [3] H. Fastl, Neutralizing the meaning of sound for sound quality evaluations, in Proceedings 17th International Congress of Acoustics (ICA 2001), Rome, Italy 2001, (CD-ROM). [4] W. Ellermeier, A. Zeitler and H. Fastl, Impact of source identifiability on perceived loudness, in Proceedings of the 18th International Congress on Acoustics (ICA 2004), Kyoto, Japan, 2004, Vol. II, pp. 1491-1494. [5] H. Fastl, Features of neutralized sounds for long term evaluation, in Proceedings Forum Acusticum 2002, Sevilla, Spain, 2002, (CD-ROM). [6] J. Hellbrück, Category subdivision scaling - A powerful tool in audiometry and in noise assessment, In H. Fastl et al. (Eds.), Recent trends in hearing research. Festschrift for Seiichiro Namba, BIS, Oldenburg 1996, pp. 317-336. [7] A. Zeitler, W. Ellermeier and H. Fastl, Significance of meaning in sound quality evaluation, in Proceedings SFA/DAGA 2004, Strassbourg, France, 2004. 6/6