Psychoacoustics. lecturer:

Similar documents
9.35 Sensation And Perception Spring 2009

CTP 431 Music and Audio Computing. Basic Acoustics. Graduate School of Culture Technology (GSCT) Juhan Nam

2018 Fall CTP431: Music and Audio Computing Fundamentals of Musical Acoustics

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics)

APPLICATION OF A PHYSIOLOGICAL EAR MODEL TO IRRELEVANCE REDUCTION IN AUDIO CODING

THE PSYCHOACOUSTICS OF MULTICHANNEL AUDIO. J. ROBERT STUART Meridian Audio Ltd Stonehill, Huntingdon, PE18 6ED England

CTP431- Music and Audio Computing Musical Acoustics. Graduate School of Culture Technology KAIST Juhan Nam

Pitch. The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high.

Loudness and Sharpness Calculation

Creative Computing II

Calculation of Unsteady Loudness in the Presence of Gaps Through Application of the Multiple Look Theory

MODIFICATIONS TO THE POWER FUNCTION FOR LOUDNESS

Our Perceptions of Music: Why Does the Theme from Jaws Sound Like a Big Scary Shark?

Soundscape and Psychoacoustics Using the resources for environmental noise protection. Standards in Psychoacoustics

We realize that this is really small, if we consider that the atmospheric pressure 2 is

Loudness of pink noise and stationary technical sounds

Progress in calculating tonality of technical sounds

UNIVERSITY OF DUBLIN TRINITY COLLEGE

Quarterly Progress and Status Report. An attempt to predict the masking effect of vowel spectra

Psychoacoustic Evaluation of Fan Noise

Math and Music: The Science of Sound

Intelligent Tools for Multitrack Frequency and Dynamics Processing

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

August Acoustics and Psychoacoustics Barbara Crowe Music Therapy Director. Notes from BC s copyrighted materials for IHTP

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE

Using the new psychoacoustic tonality analyses Tonality (Hearing Model) 1

Proceedings of Meetings on Acoustics

Loudness of transmitted speech signals for SWB and FB applications

Rhona Hellman and the Munich School of Psychoacoustics

The Cocktail Party Effect. Binaural Masking. The Precedence Effect. Music 175: Time and Space

Brian C. J. Moore Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England

Absolute Perceived Loudness of Speech

CSC475 Music Information Retrieval

Experiments on tone adjustments

TO HONOR STEVENS AND REPEAL HIS LAW (FOR THE AUDITORY STSTEM)

Study on the Sound Quality Objective Evaluation of High Speed Train's. Door Closing Sound

Determination of Sound Quality of Refrigerant Compressors

Ch. 1: Audio/Image/Video Fundamentals Multimedia Systems. School of Electrical Engineering and Computer Science Oregon State University

Analysing Room Impulse Responses with Psychoacoustical Algorithms: A Preliminary Study

Informational Masking and Trained Listening. Undergraduate Honors Thesis

ADVANCED PROCEDURES FOR PSYCHOACOUSTIC NOISE EVALUATION

Lecture 2 What we hear: Basic dimensions of auditory experience

Audio Feature Extraction for Corpus Analysis

Sound design strategy for enhancing subjective preference of EV interior sound

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound

Welcome to the Tinnitus & Hyperacusis Group Education Session

Springer Series in Information Sciences 22

PLEASE SCROLL DOWN FOR ARTICLE. Full terms and conditions of use:

A few white papers on various. Digital Signal Processing algorithms. used in the DAC501 / DAC502 units

Interior and Motorbay sound quality evaluation of full electric and hybrid-electric vehicles based on psychoacoustics

INTRODUCTION J. Acoust. Soc. Am. 107 (3), March /2000/107(3)/1589/9/$ Acoustical Society of America 1589

Basic Considerations for Loudness-based Analysis of Room Impulse Responses

MUSI-6201 Computational Music Analysis

Noise evaluation based on loudness-perception characteristics of older adults

Modeling sound quality from psychoacoustic measures

U n w a n t e d W a n t e d S o u n d s P e r c e p t i o n o f s o u n d s f r o m w a t e r s t r u c t u r e s i n u r b a n s o u n d s c a p e s

Analysis, Synthesis, and Perception of Musical Sounds

What is proximity, how do early reflections and reverberation affect it, and can it be studied with LOC and existing binaural data?

Music Representations

Concert halls conveyors of musical expressions

Data Converter Overview: DACs and ADCs. Dr. Paul Hasler and Dr. Philip Allen

I. LISTENING. For most people, sound is background only. To the sound designer/producer, sound is everything.!tc 243 2

Sound Quality Analysis of Electric Parking Brake

Hidden melody in music playing motion: Music recording using optical motion tracking system

Automatic Minimisation of Masking in Multitrack Audio using Subgroups

Implementing sharpness using specific loudness calculated from the Procedure for the Computation of Loudness of Steady Sounds

PSYCHOACOUSTICS & THE GRAMMAR OF AUDIO (By Steve Donofrio NATF)

Do Zwicker Tones Evoke a Musical Pitch?

RoomMatch RM and RM TECHNICAL DATA SHEET. asymmetrical array modules. Key Features. Product Overview. Technical Specifications

MASTER'S THESIS. Listener Envelopment

Digital audio and computer music. COS 116, Spring 2012 Guest lecture: Rebecca Fiebrink

SPATIAL UTILIZATION OF SENSORY DISSONANCE AND THE CREATION OF SONIC SCULPTURE

Hugo Technology. An introduction into Rob Watts' technology

soothe audio processor Manual and FAQ

Inhibition of Oscillation in a Plastic Neural Network Model of Tinnitus Therapy Using Noise Stimulus

Advanced Techniques for Spurious Measurements with R&S FSW-K50 White Paper

Simple Harmonic Motion: What is a Sound Spectrum?

PsySound3: An integrated environment for the analysis of sound recordings

Measurement of overtone frequencies of a toy piano and perception of its pitch

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE

How to Obtain a Good Stereo Sound Stage in Cars

Robert Alexandru Dobre, Cristian Negrescu

DIFFERENCES IN TRAFFIC NOISE MEASUREMENTS WITH SLM AND BINAURAL RECORDING HEAD

A Big Umbrella. Content Creation: produce the media, compress it to a format that is portable/ deliverable

Proceedings of Meetings on Acoustics

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE

Topic 4. Single Pitch Detection

Music Complexity Descriptors. Matt Stabile June 6 th, 2008

Engineering in Recording

Smooth Rhythms as Probes of Entrainment. Music Perception 10 (1993): ABSTRACT

Quarterly Progress and Status Report. Violin timbre and the picket fence

MASTER S THESIS. Sound Quality Evaluation of Floor Impact Noise Generated by Walking. Payman Roonasi

Temporal summation of loudness as a function of frequency and temporal pattern

spiff manual version 1.0 oeksound spiff adaptive transient processor User Manual

Table 1 Pairs of sound samples used in this study Group1 Group2 Group1 Group2 Sound 2. Sound 2. Pair

Binaural summation of loudness: Noise and two-tone complexes

Detection and demodulation of non-cooperative burst signal Feng Yue 1, Wu Guangzhi 1, Tao Min 1

12/7/2018 E-1 1

Hybrid active noise barrier with sound masking

Topic 1. Auditory Scene Analysis

Transcription:

Psychoacoustics lecturer: stephan.werner@tu-ilmenau.de

Block Diagram of a Perceptual Audio Encoder loudness critical bands masking: frequency domain time domain binaural cues (overview) Source: Brandenburg, Vorlesung: Dig. Audiosignalverarbeitung

Structure of the Human Ear

pinna Structure of the Human Ear ossicles archways ear canal cochlea with organ of Corti ear drum eustachische tube outer ear middle ear inner ear Quelle: Ars Auditus; http://www.dasp.uni-wuppertal.de/index.php?id=57, 2010

Structure of the Human Ear - Cochlea left picture : - cochlea of a 5 month old fetus, - 2 ½ coils, 35 mm long, - dimensions are relative constant, 0,5 mm Quelle: Cochlee, http://www.cochlee.org, 2010 - blue arrow oval window - yellow arrow round window Quelle: Ars Auditus; http://www.dasp.uni-wuppertal.de/index.php?id=57, 2010

Structure of the Human Ear Organ of Corti - organ of corti of a guinea pig - white bar = 20 µm outer hair cells (OHC) pumping OHC inner hair cells (IHC) Quelle: Cochlee, http://www.cochlee.org, 2010 - ~ 3500 IHC and ~12000 OHC at humans Quelle: David C. Mountain, Boston University, 146th ASA Meeting

Preprocessing of Sound in the Peripheral System - frequency selectivity of the basilar membrane Source: Zwicker & Fastl Psychoacoustics Facts and Models

basilar membrane amplitude Preprocessing of Sound in the Peripheral System - frequency selectivity of the basilar membrane (simulation) 14 Hz frequency channels (Bark scale) 21 khz Source: Zwicker & Fastl Psychoacoustics Facts and Models

amplitude in m Preprocessing of Sound in the Peripheral System - frequency selectivity of the basilar membrane (simulation) frequency channels (Bark scale) Source: Zwicker & Fastl Psychoacoustics Facts and Models

Information Processing in the Auditory System - basilar membrane as a filter bank Source: Zwicker & Fastl Psychoacoustics Facts and Models

Information Processing in the Auditory System amplitude-time representation basilar membrane oscillation neurotransmitter concentration

Sound Perception

Frequency and Level Range of Human Hearing Source: Zwicker & Fastl Psychoacoustics Facts and Models

Threshold in Quiet or the Absolute Threshold Source: Zwicker & Fastl Psychoacoustics Facts and Models

Loudness Loudness Level: Loudness N: psychological concept to describe the magnitude of an auditory sensation, the loudness of a sound (measured in sone ) loudness level L of a sound is measured in phon L of a sound is the sound pressure of a 1 khz tone which is as loud as the sound Fig: Fletcher, Speech and Hearing in Communication, 1953.

Loudness Equal-Loudness Level Contours: N=64 sone Equal loudness contours of pure tones in a free sound field. The parameter is expressed in loudness level, L N, and loudness, N. 16 4 1 0.15 Links to measure the sensitivity on different frequencies: http://www.phys.unsw.edu.au/jw/hearing.html http://www.phys.unsw.edu.au/music/db/loudness.html, 2010 Fig: Suzuki et al., Precise and Full-range Determination of Two-dimensional Equal Loudness Contours, 2003.

Loudness Loudness Scale: aim: double the number of units on this scale means magnitude of sensation is doubled relation G(L) between loudness level L and the loudness N on the new scale one potential experiment: listen to sound with L 1 and than adjust same sound until L 2 =2xL 1

Loudness Loudness Scale: Example 1: L=40 phon N=1 sone 2 x loudness: N=2 sone L=40+10 = 50 phon Example 2: L=40 phon N=1 sone ½ of loudness: N=0.5 L=40-7.8 = 32.2 phon Fig: Fletcher, Speech and Hearing in Communication, 1953.

Loudness Loudness Scale: Example 2: L = 40 phon N=1 sone ½ of loudness: N=0.5 L=40-7.8=32.2 phon for values of L above 40 phon: N=2^((L-40)/10) Fig: Fletcher, Speech and Hearing in Communication, 1953.

Loudness contours for loudspeaker and binaural headphone presentation. Loudness headphone loudspeaker Fig: Master Thesis, F. Jürgens, TU Ilmenau, 2012.

Loudness Loudness Scale: Fig: en.wikipedia.org/wiki/sone, 2010.

Sound Example Example 1: Equal amplitude tones of frequencies 40Hz, 100Hz, 4000Hz, and 12000Hz Example 2: Equal amplitude Sweep from 0-16000 Hz

Critical Bands

Frequency Grouping in Human Hearing Different interpretations that produce the same segmentation Constant distance in the Cochlea By using tones under the threshold in quiet, their intensity add up in a critical band and are now audible Tones in a critical band above the threshold in quiet: their energy adds up Formula for the width of the critical bands for frequencies < 500 Hz: Constant 100Hz width for frequencies > 500 Hz: 0.2*frequency Source: Brandenburg, Vorlesung: Dig. Audiosignalverarbeitung

Frequency Grouping Bandwidth The Critical Bands Critical bandwidth as a function of frequency. Approximations for low and high frequency ranges are indicated by broken lines. Source: Zwicker & Fastl Psychoacoustics Facts and Models

Excursus - Critical Bands and Loudness Spectral effects - influence of frequency separation: measure the loudness level (or level of the equally loud 1 khz tone) of 2 tones by varying the frequency separation Fig: Zwicker, Fastl Psychoacoustics - Facts and Models, 2nd Edition, 1999.

Excursus - Critical Bands and Loudness Spectral effects - influence of bandwidth: bandwidth of the signals plays an important role sound level also influence loudness level total sound intensity (SPL) have to be constant to measure loudness as function of bandwidth critical bandwidth Fig: Zwicker, Fastl Psychoacoustics - Facts and Models, 2nd Edition, 1999.

Critical Bands: Bark Scale Critical-band concept used in many models and hypothesis unit was defined leading to so-called critical-band rate scale scale ranging from 0 24, unit Bark relation between z and f is important for understanding many characteristics of human ear

Critical Band Rate and Threshold in Quiet Source: U. Zölzer, Digitale Audiosignalverarbeitung

Signal Power in Critical Bands Source: U. Zölzer, Digitale Audiosignalverarbeitung

Masking

Masking data compression exploitation of perception in critical bands with reference to the threshold in quiet is not enough Basis for further compression are masking effects as described by Zwicker, Fletcher, Fastl, Feldtkeller and others.

Masking of Pure Tones by Noise - Broad-Band Noise broad-band noise: white noise from 20 Hz - 20 khz figure: masking threshold for pure tones masked by broad band noise of different levels uniform masking noise (UMN) by equalization of the 10 db per decade slope Fig: Zwicker, Fastl Psychoacoustics - Facts and Models, 2nd Edition, 1999.

Masking of Pure Tones by Noise - Narrow-Band Noise narrow-band noise: noise with a bandwidth equal or smaller than critical bandwidth figure: threshold of pure tones masked by narrow-band noise for different centre frequencies difference between maximum of masked threshold and test tone level Fig: Zwicker, Fastl Psychoacoustics - Facts and Models, 2nd Edition, 1999.

Masking of Pure Tones by Noise - Narrow-Band Noise narrow-band noise: noise with a bandwidth equal or smaller than critical bandwidth figure: dependence of masked threshold on level of narrowband noise dips at higher levels nonlinear effects (difference noise caused by interactions between test tone and noise) Fig: Zwicker, Fastl Psychoacoustics - Facts and Models, 2nd Edition, 1999.

Test: Narrow Band Noise Masking Tone Example 3: Narrow Band Noise at 1000 Hz, width 160 Hz; Sine tones at 600, 800, 1000, 1200, 1400, 1600 Hz at varying levels (-80 to -20 db)

Sound Examples: Masking with White Noise Example 4: Masking with white noise 500 Hz sinusoid tone at varying amplitude ALONE Level: -40,-35,-30,-25,-20,-15,-10 db Example 5: Masking with white noise 500 Hz tone at varying amplitude with White Noise Level: -40,-35,-30,-25,-20,-15,-10 db Noise Level: -50 db Example 6: Masking with white noise 5000 Hz tone at varying amplitude with White Noise Levels: same as Example 5

Masking of Pure Tones by Low-Pass or High-Pass Noise Source: Zwicker & Fastl Psychoacoustics Facts and Models

Masking of Pure Tones by Pure Tone pure tone: single frequency figure: 1 khz masking tone with level of 80 db threshold for detection of anything difficulties: beats (hatching) masker and difference tone (stippling) Source: Zwicker & Fastl Psychoacoustics Facts and Models

Masking of Pure Tone by Complex Tones complex tone: fundamental tone with its harmonics figure: threshold of pure tones masked by a complex tone with 200 Hz fundamental frequency and nine harmonics Source: Zwicker & Fastl Psychoacoustics Facts and Models

Tonality (1) Tonality index : noisy signal: = 0 tonal signal: = 1 System theory Sharp spectral lines = Signal is periodic = Signal is predictable Approximation: If the signal is predictable then it should be periodic Therefore we can use prediction to approximate if a signal is tonal (by periodicity)

Tonality (2) Source: Brandenburg, Vorlesung: Dig. Audiosignalverarbeitung

Tone Masking Source: U. Zölzer, Digitale Audiosignalverarbeitung

Calculating the Masking Threshold Different Masking with different maskers: Tone masking: (14.5 + i) db, where i is the frequency in bark Noise as a masker: 5.5 db

Calculating the Masking Threshold SFM = 0 db = 0 SFM = -60 db = 1

In-Band Masking Source: U. Zölzer, Digitale Audiosignalverarbeitung

Masking Neighboring Bands Source: U. Zölzer, Digitale Audiosignalverarbeitung

Sound Examples Example 7: Dynamic range Bach organ music with 16 bits per sample Example 8: Dynamic range Bach organ music with 11 bits per sample Example 9: Dynamic range Bach organ music with 6 bits per sample

Temporal Masking Effects (1) Source: Zwicker & Fastl Psychoacoustics Facts and Models

Temporal Masking Effects (2) Post-Masking: corresponds to decay in the effect of the masker expected Pre-Masking: appears during time before masker is switched on Quick build-up time for loud maskers Slower build-up time for faint test sounds Frequency resolution Blurring in time Frequency resolution in the ear Masking in time Because of in-ear fast processing between quiet to loud signals, we get Pre-Echoes Pre-Masking: 1-5 ms Post-Masking: ~100ms

Pre-Echo: Example without Pre-Echo

Pre-Echo: Example

Sound Examples Example 10: - Castanets original Example 11: - Castanets coded with a block size of 2048 samples

next lecture:?? Quantization and Coding??