Loudness of transmitted speech signals for SWB and FB applications

Similar documents
Loudness and Sharpness Calculation

Next Generation Software Solution for Sound Engineering

ETSI TR V1.1.1 ( )

Binaural Measurement, Analysis and Playback

ETSI TR V1.1.1 ( )

Psychoacoustics. lecturer:

IP Telephony and Some Factors that Influence Speech Quality

Soundscape and Psychoacoustics Using the resources for environmental noise protection. Standards in Psychoacoustics

Final draft ETSI EG V1.1.1 ( )

Loudness of pink noise and stationary technical sounds

Psychoacoustic Evaluation of Fan Noise

Rhona Hellman and the Munich School of Psychoacoustics

Table 1 Pairs of sound samples used in this study Group1 Group2 Group1 Group2 Sound 2. Sound 2. Pair

3GPP TS V9.2.0 ( )

Digital Signal Processing Detailed Course Outline

Determination of Sound Quality of Refrigerant Compressors

Using the BHM binaural head microphone

Modeling sound quality from psychoacoustic measures

Calibration of auralisation presentations through loudspeakers

OPERA APPLICATION NOTES (1)

ETSI TS V9.1.0 ( ) Technical Specification

Experiments on tone adjustments

3GPP TS V4.3.0 ( )

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE

A few white papers on various. Digital Signal Processing algorithms. used in the DAC501 / DAC502 units

Noise evaluation based on loudness-perception characteristics of older adults

Proceedings of Meetings on Acoustics

TO HONOR STEVENS AND REPEAL HIS LAW (FOR THE AUDITORY STSTEM)

Acoustic Echo Canceling: Echo Equality Index

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE

Effect of room acoustic conditions on masking efficiency

Progress in calculating tonality of technical sounds

ETSI TS V4.0.0 ( )

1 Introduction to PSQM

Analysing Room Impulse Responses with Psychoacoustical Algorithms: A Preliminary Study

ESG Engineering Services Group

The quality of potato chip sounds and crispness impression

Measuring Radio Network Performance

Vibration Measurement and Analysis

MUSI-6201 Computational Music Analysis

CTP 431 Music and Audio Computing. Basic Acoustics. Graduate School of Culture Technology (GSCT) Juhan Nam

DIFFERENCES IN TRAFFIC NOISE MEASUREMENTS WITH SLM AND BINAURAL RECORDING HEAD

Proceedings of Meetings on Acoustics

Study on the Sound Quality Objective Evaluation of High Speed Train's. Door Closing Sound

Calculation of Unsteady Loudness in the Presence of Gaps Through Application of the Multiple Look Theory

A Performance Ranking of. DBK Associates and Labs Bloomington, IN (AES Paper Given Nov. 2010)

A SEMANTIC DIFFERENTIAL STUDY OF LOW AMPLITUDE SUPERSONIC AIRCRAFT NOISE AND OTHER TRANSIENT SOUNDS

Basic Considerations for Loudness-based Analysis of Room Impulse Responses

Lecture 2 Video Formation and Representation

INTERNATIONAL TELECOMMUNICATION UNION GENERAL ASPECTS OF DIGITAL TRANSMISSION SYSTEMS PULSE CODE MODULATION (PCM) OF VOICE FREQUENCIES

Overview of ITU-R BS.1534 (The MUSHRA Method)

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics)

Testing Speech Quality of Mobile Phones in a Live Network Application Note

I. LISTENING. For most people, sound is background only. To the sound designer/producer, sound is everything.!tc 243 2

Proposed pads and levels are optimised for the long-term "all-digital" situation;

Audio Feature Extraction for Corpus Analysis

Interior and Motorbay sound quality evaluation of full electric and hybrid-electric vehicles based on psychoacoustics

Speech Quality Testing Solution (MOS) Whitepaper

Predicting Performance of PESQ in Case of Single Frame Losses

Getting Started with the LabVIEW Sound and Vibration Toolkit

Absolute Perceived Loudness of Speech

TH Premium IF 19. Technical Data. 60 db / 125 db SPL (ear simulator) 50 db / 113 db SPL (2 ccm coupler)

Temporal summation of loudness as a function of frequency and temporal pattern

Methods to measure stage acoustic parameters: overview and future research

Orbital Ka-ISO. Ext Ref Ka LNB with integrated isolator. Orbital Research Ltd Marine Drive, White Rock, BC. Canada V4B 1A9

Put your sound where it belongs: Numerical optimization of sound systems. Stefan Feistel, Bruce C. Olson, Ana M. Jaramillo AFMG Technologies GmbH

Sound design strategy for enhancing subjective preference of EV interior sound

PEVQ ADVANCED PERCEPTUAL EVALUATION OF VIDEO QUALITY. OPTICOM GmbH Naegelsbachstrasse Erlangen GERMANY

We realize that this is really small, if we consider that the atmospheric pressure 2 is

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound

Operation Manual OPERATION MANUAL ISL. Precision True Peak Limiter NUGEN Audio. Contents

Quarterly Progress and Status Report. An attempt to predict the masking effect of vowel spectra

ADVANCED PROCEDURES FOR PSYCHOACOUSTIC NOISE EVALUATION

Characterization of sound quality of impulsive sounds using loudness based metric

Collection of Setups for Measurements with the R&S UPV and R&S UPP Audio Analyzers. Application Note. Products:

Using the new psychoacoustic tonality analyses Tonality (Hearing Model) 1

Predicting annoyance judgments from psychoacoustic metrics: Identifiable versus neutralized sounds

Sound Quality Analysis of Electric Parking Brake

Hearing Aids for Tinnitus Patients: It s Not Just About Speech. Steve Benton, Au.D. VA Medical Center Decatur, GA

TR 038 SUBJECTIVE EVALUATION OF HYBRID LOG GAMMA (HLG) FOR HDR AND SDR DISTRIBUTION

Acoustical Testing 1

Orbital 694XA Series. Ka BAND EXTERNAL REFERENCE LNB with rear anchor posts. Wide range of Frequencies and Bandwidths LNB 1855R 1000 XA-WN60

Test Automation Tool for POLQA and PESQ Speech Quality Tests Application Note

Final Report. Executive Summary

Using Extra Loudspeakers and Sound Reinforcement

2017 Product Portfolio

HELM: High Efficiency Loudness Model for Broadcast Content

2. Measurements of the sound levels of CMs as well as those of the programs

Training. Center Training Courses

What is the minimum sound pressure level iphone or ipad can measure? What is the maximum sound pressure level iphone or ipad can measure?

Analog Code MicroPlug Manual. Attacker

TEST REPORT T&M Research Products, Inc. Model W11K20-3.5s Ω by M. E. Gruchalla PE, December 6, 2008

Lesson 2.2: Digitizing and Packetizing Voice. Optimizing Converged Cisco Networks (ONT) Module 2: Cisco VoIP Implementations

Precedence-based speech segregation in a virtual auditory environment

The Cocktail Party Effect. Binaural Masking. The Precedence Effect. Music 175: Time and Space

from ocean to cloud ADAPTING THE C&A PROCESS FOR COHERENT TECHNOLOGY

Diamond Cut Productions / Application Notes AN-2

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

EarStudio: Analog volume control. The importance of the analog volume control

If you want to get an official version of this User Network Interface Specification, please order it by sending your request to:

Transcription:

Loudness of transmitted speech signals for SWB and FB applications Challenges, auditory evaluation and proposals for handset and hands-free scenarios Jan Reimes HEAD acoustics GmbH Sophia Antipolis, 2017-05-10

Introduction (1/2) Loudness of received speech signal - most simple but important quality parameter of a communication device! Too loud: annoying, may cause hearing damage! Too quiet: impact on intelligibility (and other aspects of conversational quality ) Several measurement standards provide requirements for comfortable listening level Loudness == Level? Psycho-acoustics! 2

L/dB L/dB Introduction (2/2) For NB and WB terminals, so-called loudness ratings (LR) are used to evaluate transmission characteristics (e.g. ITU-T P.79) Basic concept of LR: calculate attenuation (in db) to achieve same perceived loudness compared to intermediate reference system (IRS) 300 500 f/hz 2000 4000 Reference System R f 15 13 11 9 7 5 15 13 D(f) = H f R f 11 LR ~ w f D f 300 500 f/hz 2000 4000 Device under test H f f 9 7 5 Weighted sum of transfer function provides attenuation versus IRS Technical measure; no information about absolute loudness Addresses mainly linear distortions Method not (yet?) defined for SWB/FB applications 3

Recent work on loudness Standardization: ITU-T SG12 / Q5 launched new work item P.Loudness Goal: evaluate and/or modify existing loudness models originated from psycho-acoustic domain Several standardized models already exists: Zwicker approach (DIN 45631/A1, ISO 532-1) Moore/Glasberg approach (ANSI S3.4-2007, ISO 532-2) Current release candidate model for P.Loudness available Based on very basic auditory experiments, no real terminals Loudness model is based on stationary loudness (ANSI S3.4) Modifications are fitted to auditory results Two modes for handset/hands-free are required Not applicable on artificial head recordings (handset) No binaural aspects considered 4

Auditory evaluation (1/3) Large test corpus based on binaural recordings of terminals (3G, 4G, VoIP) and realistic simulations (compression, codecs, loudspeaker distortions) Stimulus Binaural recording 8 German test sentences (ITU-T P.501) as source material Bandwidth from NB (up to 3.4 khz) to FB (up to 20 khz) Level range between 40 and 90 db SPL 52 conditions per mode (handset and hands-free mode) 4 sentences each 208 test stimuli per mode 5

Auditory evaluation (2/3) Absolute / categorial loudness assessment on 25-point scale 7 anchor definitions for better orientation Already used in previous studies 20 normal-hearing test subjects per mode Hearing-adequate playback of binaural recordings in listening lab 6

Auditory evaluation (3/3) Prior to evaluation: determination of individual loudness functions per test subject with a reference sound Principle of reference sound: should cause similar loudness excitation as speech, but independent of language, content, talker, Three different reference sounds were evaluated: 1 khz Sine tone (refers to definition of sone/phon) 1 Bark noise at 1 khz (used in initial P.Loudness experiments) 3 Bark noise at 1 khz (less tonal, smooth ) 7

Results of loudness models (1/5) Several state-of-the-art loudness models are evaluated: Zwicker: ISO 532-1 Moore/Glasberg: ANSI S3.4 (stationary), version 2002 & 2016, LT/ST smoothing P.Loudness candidate (stationary) Non-stationary models provide loudness vs. time curve, several single value calculations are possible: Average N5 percentile (peak-oriented) LL(p) (used in recent work) Auditory results of test stimuli provide values on point-scale Comparison to loudness models? 8

Results of loudness models (2/5) Proposed procedure for comparison between loudness models (results in phon/sone) and auditory test results (in points) Select reference signal (Sine, 1 Bark noise, 3 Bark noise, ) Calculate inverse of loudness functions with mapping function Transform auditory results in points to level in db ERL Example: 15.0 point in listening test refers to 75 db ERL (same loudness as 3 Bark noise reference signal at 75 db SPL ) 9

Results of loudness models (2/5) Proposed procedure for comparison between loudness models (results in phon/sone) and auditory test results (in points) Select loudness model and single value aggregat Calculate loudness (in sone or phon) for selected reference sound for a certain level range (e.g. from 40 to 90 db SPL ) Calculate mapping function between sone/phon and level Run loudness model on signal-under-test Transform output from sone/phon to level in db ERL with previously determined mapping function 10

Results of loudness models (3/5) Large amount of combinations possible (models, single values, reference signal) Evaluation of prediction performance by RMSE* Considering uncertainty of auditory data Baseline performance: auditory results vs. active speech level (ASL) acc. to ITU-T P.56 models should perform better! ASL/Sinus (HS) ASL/Sinus (HF) 11

Results of loudness models (4/5) Selected results per loudness model handset mode ISO 532-1/Avg./Sinus TVL2016-LT/Avg./3 Bark P.Loudness/3 Bark TVL2002-ST/Avg./1 Bark 12

Results of loudness models (5/5) Selected results per loudness model hands-free mode ISO 532-1/Avg./Sinus TVL2016-LT/LL(p)/3 Bark P.Loudness/3 Bark TVL2002-LT/N5/3 Bark 13

Summary & Conclusions Loudness assessment is a challenging task! SWB/FB terminals are commercially available but currently no instrumental loudness assessment test methods available Large auditory database and listening tests were conducted Considering state-of-the-art terminals and realistic simulations Evaluation of loudness models no clear winner : ISO 532-1 very accurate for HS & HF single model for both TVL2016-LT slightly worse, but considers binaural inhibition P.Loudness candidate also performs adequately, but New loudness model not necessarily needed? Finalize P.Loudness work item in standardization Specify application of loudness models in measurement standards 14

Jan Reimes Research & Standardization HEAD acoustics info@head-acoustics.de www.head-acoustics.de Copyright HEAD acoustics GmbH