Quarterly Progress and Status Report. Formant frequency tuning in singing

Similar documents
Making music with voice. Distinguished lecture, CIRMMT Jan 2009, Copyright Johan Sundberg

Kent Academic Repository

Some Phonatory and Resonatory Characteristics of the Rock, Pop, Soul, and Swedish Dance Band Styles of Singing

Quarterly Progress and Status Report. An attempt to predict the masking effect of vowel spectra

A comparison of the acoustic vowel spaces of speech and song*20

DEVELOPING THE MALE HEAD VOICE. A Paper by. Shawn T. Eaton, D.M.A.

Quarterly Progress and Status Report. X-ray study of articulation and formant frequencies in two female singers

Pitch. There is perhaps no aspect of music more important than pitch. It is notoriously

Quarterly Progress and Status Report. Replicability and accuracy of pitch patterns in professional singers

Pitch-Synchronous Spectrogram: Principles and Applications

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

EVTA SESSION HELSINKI JUNE 06 10, 2012

Welcome to Vibrationdata

The Perception of Formant Tuning in Soprano Voices

Quarterly Progress and Status Report. Acoustic analysis of three male voices of different quality

The Interplay between Glottis and Vocal Tract during the Male Passaggio

Quarterly Progress and Status Report. Musicians and nonmusicians sensitivity to differences in music performance

The role of vocal tract resonances in singing and in playing wind instruments

increase by 6 db each if the distance between them is halved. Likewise, vowels with a high first formant, such as /a/, or a high second formant, such

Quarterly Progress and Status Report. Violin timbre and the picket fence

Quarterly Progress and Status Report

Real-time magnetic resonance imaging investigation of resonance tuning in soprano singing

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 FORMANT FREQUENCY ADJUSTMENT IN BARBERSHOP QUARTET SINGING

Quarterly Progress and Status Report. Voice source characteristics in different registers in classically trained female musical theatre singers

Loudness and Pitch of Kunqu Opera 1 Li Dong, Johan Sundberg and Jiangping Kong Abstract Equivalent sound level (Leq), sound pressure level (SPL) and f

International Journal of Computer Architecture and Mobility (ISSN ) Volume 1-Issue 7, May 2013

Available online at International Journal of Current Research Vol. 9, Issue, 08, pp , August, 2017

CHAPTER 20.2 SPEECH AND MUSICAL SOUNDS

(Adapted from Chicago NATS Chapter PVA Book Discussion by Chadley Ballantyne. Answers by Ken Bozeman)

The Tone Height of Multiharmonic Sounds. Introduction

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng

Analysis of the effects of signal distance on spectrograms

Quarterly Progress and Status Report. Intonation preferences for major thirds with non-beating ensemble sounds

HARMONIC ANALYSIS OF ACOUSTIC WAVES

UNIVERSITY OF DUBLIN TRINITY COLLEGE

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics)

Advanced Signal Processing 2

1. Introduction NCMMSC2009

VOCAL CHARACTERISTICS OF THE FOUR VOICE PARTS

Vocal-tract Influence in Trombone Performance

Singing voice synthesis in Spanish by concatenation of syllables based on the TD-PSOLA algorithm

In 2015 Ian Howell of the New England Conservatory introduced

Using the new psychoacoustic tonality analyses Tonality (Hearing Model) 1

Physiological and Acoustic Characteristics of the Female Music Theatre Voice in belt and legit qualities

MUSIC ACOUSTICS. TMH/KTH Annual Report 2001

AUDITION PROCEDURES:

Vowel-pitch matching in Wagner s operas: Implications for intelligibility and ease of singing

Spectral correlates of carrying power in speech and western lyrical singing according to acoustic and phonetic factors

Concert halls conveyors of musical expressions

MELODY EXTRACTION FROM POLYPHONIC AUDIO OF WESTERN OPERA: A METHOD BASED ON DETECTION OF THE SINGER S FORMANT

Klages Philosophy of Trumpet Pedagogy

The Scale of Musical Instruments

A prototype system for rule-based expressive modifications of audio recordings

Music Source Separation

Quarterly Progress and Status Report. Perception of just noticeable time displacement of a tone presented in a metrical sequence at different tempos

Proposal for Presentation of Doctoral Essay. A Description and Application of Robert Aitken s Concept. of the Physical Flute

EE513 Audio Signals and Systems. Introduction Kevin D. Donohue Electrical and Computer Engineering University of Kentucky

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES

9.35 Sensation And Perception Spring 2009

WHAT IS BARBERSHOP. Life Changing Music By Denise Fly and Jane Schlinke

How We Sing: The Science Behind Our Musical Voice. Music has been an important part of culture throughout our history, and vocal

Augmentation Matrix: A Music System Derived from the Proportions of the Harmonic Series

Florida Performing Fine Arts Assessment Item Specifications for Benchmarks in Course: Chorus 5 Honors

Oskaloosa Community School District. Music. Grade Level Benchmarks

2. AN INTROSPECTION OF THE MORPHING PROCESS

Week 6 - Consonants Mark Huckvale

LOUDNESS EFFECT OF THE DIFFERENT TONES ON THE TIMBRE SUBJECTIVE PERCEPTION EXPERIMENT OF ERHU

Music Representations

Measurement of overtone frequencies of a toy piano and perception of its pitch

A METHOD OF MORPHING SPECTRAL ENVELOPES OF THE SINGING VOICE FOR USE WITH BACKING VOCALS

I. LISTENING. For most people, sound is background only. To the sound designer/producer, sound is everything.!tc 243 2

Quarterly Progress and Status Report. Analysis and simulation of small variations in the fundamental frequency of sustained vowels

CTP 431 Music and Audio Computing. Basic Acoustics. Graduate School of Culture Technology (GSCT) Juhan Nam

Does Saxophone Mouthpiece Material Matter? Introduction

Simple Harmonic Motion: What is a Sound Spectrum?

Author Index. Absolu, Brandt 165. Montecchio, Nicola 187 Mukherjee, Bhaswati 285 Müllensiefen, Daniel 365. Bay, Mert 93

Preferences for Strong or Weak Singer's Formant Resonance in Choral Tone Quality

the mathematics of the voice. As musicians, we d both been frustrated with groups inability to

We realize that this is really small, if we consider that the atmospheric pressure 2 is

Processing Linguistic and Musical Pitch by English-Speaking Musicians and Non-Musicians

Vocal tract adjustments in the high soprano range

THE DIGITAL DELAY ADVANTAGE A guide to using Digital Delays. Synchronize loudspeakers Eliminate comb filter distortion Align acoustic image.

2018 Fall CTP431: Music and Audio Computing Fundamentals of Musical Acoustics

Jaw Harp: An Acoustic Study. Acoustical Physics of Music Spring 2015 Simon Li

Perceptual thresholds for detecting modifications applied to the acoustical properties of a violin

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound

Music 209 Advanced Topics in Computer Music Lecture 1 Introduction

BBN ANG 141 Foundations of phonology Phonetics 3: Acoustic phonetics 1

HST 725 Music Perception & Cognition Assignment #1 =================================================================

by Staff Sergeant Samuel Woodhead

Quarterly Progress and Status Report. Statistical computer measurements of the tone-scale in played music

Proceedings of the 7th WSEAS International Conference on Acoustics & Music: Theory & Applications, Cavtat, Croatia, June 13-15, 2006 (pp54-59)

Timbre blending of wind instruments: acoustics and perception

Florida Performing Fine Arts Assessment Item Specifications for Benchmarks in Course: Chorus 2

Quarterly Progress and Status Report. A singer s expression of emotions in sung performance

Virtual Vibration Analyzer

Creative Computing II

EE-217 Final Project The Hunt for Noise (and All Things Audible)

MHSIB.5 Composing and arranging music within specified guidelines a. Creates music incorporating expressive elements.

Transcription:

Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Formant frequency tuning in singing Carlsson-Berndtsson, G. and Sundberg, J. journal: STL-QPSR volume: 32 number: 1 year: 1991 pages: 029-035 http://www.speech.kth.se/qpsr

STL-QPSR 1 / 199 1 FORMANT FREQUENCY TUNING IN SINGING Gllrlilla Carlssori & Johan Surldberg ABSTRACT It is sometimes claimed that some singers tune their two lowest formant frequencies to harmonic partials in order to increase the audibility of the voice. Voice acoustics predicts that such tuning of formants should cause vowel changes. Using a newly constructed digital singing machine, the perceptual consequences of such tuning have been explored. Four different cases were represented, constant formant frequencies, and formant frequencies adapted to the fundamental frequency according to either of three different strategies. The resulting voice timbres were judged by an expert panel of singing teachers in a listening test consisting of descending chromatic scales. Constant formant frequencies were clearly preferred, presumably because formant tuning entails formant frequency shifts between adjacent tones so substantial that salient vowel quzlity shifts occur. INTRODUCTION A singer is required to produce vowels that meet rather special demands. The tone must in some sense be beautiful. At the same time, it must often be audible even over a loud orchestra. These demands are partly met by articulatory means, i.e., by articulatory adjustments involving the tongue, the lips, and the jaw. Thereby, the vocal-tract resonances, i.e., the formants, are changed. A formant can be regarded as a peak in the frequency curve of the vocal-tract filter. This implies that all voice source partials are enhanced that are close to a formant. Typically, the overall radiated sound level of a vowel will increase if the frequency of the first formant approaches the strongest spectrum partial. The last mentioned effect on the overall sound level has inspired the hypothesis that singers might systematically adjust formants such that they coincide with partials, thus increasing the overall sound level without raising vocal effort. Adopting the terminology of Miller & Schutte (1990), we will henceforth refer to this as formarlt tutling. A case of systematical use of formant tuning has been examined by co-author JS (Sundberg, 1987). In super-pitch singing, the normal frequency value of the first formant is often lower than the fundamental frequency (FO). In such cases, sopranos were found to raise their first formant to a frequency just above the fundamental. This strategy has the advantage of increasing the sound level considerably as, otherwise, the first formant would be basically void of partials. In Fig. 1, the pitchdependent formant frequencies for different vowels, as produced by a professional soprano singer, are shown. Another and more exotic case of formant tuning is found in certain types of chanting (Smith, Stevens, & Tomlinson, 1967; Sundberg, 1988). The effect of this technique which seems to involve a clustering of the second and third formants is so strong that the partial, coinciding with the formant, stands out as a salient pitch appearing together with the pitch corresponding to the fundamental. At lower pitches, the problem never arises that the first formant is lower than the fundamental and hence void of partials. However, even in such cases, formant tuning may of course be used for purely timbered reasons. Also in such cases, effects on the overall sound level amounting to several db may occur. The singing teacher Berton Coffin developed a system of singing exercises where formant tuning was used systematically (Coffin, 1980). Thus, formant tuning seems to be commonly applied in some singing practise. Still, only limited efforts have been made to examine timbered consequences of this practice scientifically.

STL-QPSR 111 99 1 The situation that the frequency of a partial equals that of a formant may, of course, occur also by pure coincidence. An interesting question then is whether or not formant tuning is used deliberately by professional singers. SOPRANO 1 e I - I SPOKEN Fig. I. FUNDAMENTAL FREQUENCY (Hz) Formant requencies estimated from a professional soprano's singing of the vowels indicate d at various pitches. The leftmost vowel symbols refer to the subject's speech. The lines between the vowel symbols represent an idealized approximation o the data. In addition, the frequencies of the three first partials are shown. After undberg (1987). s' The probability of the formants occurring near partials obviously depends on pitch, as the spectrum partials appear densely along the frequency axis at low fundamental frequencies. The probability that the formant is half a bandwidth (BWl2) or less from a partial can be expressed as BW/FO. The lowest formants are of primary interest, as they contribute most to the overall sound level. Miller & Schutte (1990) examined sub-, supra-, and transglottal pressures together with the audio and EGG signals from a professional baritone singer performing different singing tasks. They reported no evidence of formant tuning in a descending scale. However, in a three-note triad arpeggio, they found that the singer used formant tuning on the two most important notes, i.e., the lowest

STL-QPSR 1/1991 and the highest notes. Miller & Schutte also reported that, because of this formant, tuning the vowel quality was substantially modified from the intended lid quality toward an /&/. However, it does not seem possible to exclude the explanation that this singer, in producing a strange pronunciation of the vowel, happened to have formants match partials by pure coincidence. Indeed, an attempt to synthesize their example revealed that spectra, similar to those published in their article, were obtained if the second formant remained close to 1.5 khz throughout the arpeggio. Raphael & Scherer (1987) studied voice modifications of stage actors comparing phonatory characteristics of the so-called Lessac "call" technique (Lessac, 1967) and a more neutral type of phonation, more like conversational speech. The "call" technique is used to enable actors to be heard over a loud background noise and to make the voice more flexible. In such a call, the qualities of vowel articulation are secondary to maintaining the tonal and expressive qualities of a voice. Two tendencies were found in call: (a) the spectrum peak corresponding to F1 was sharper and moved slightly towards the first partial; (b) the spectrum energy between 2150 and 2350 Hz was higher so that a peak similar to that of a singer's formant was created. The result is a characteristic brilliance of the voice quality. The effect is reported to arise from a series of manoeuvres: a "half-yawn sensation", two fingertips' worth of space between the upper and lower teeth, and forward stretch of the cheek muscles that feed into the upper lip. When applied, internal vibrational sensations can be felt in the skull and mouth during production of many vowels. The concept of formant tuning raises an interesting question. The first and second formants are decisive for the vowel quality while the higher formants are more influential on voice quality. Therefore, a general application of the formant-tuning principle will lead to discontinuities with regard to vowel quality. Formant tuning, on the other hand, also adds to the overall sound level. How important is vocal loudness as compared with vowel quality? The aim of the present investigation was to investigate the acceptability of different formant-tuning strategies for the task of singing a scale. METHOD Singing synthesis seems an almost ideal tool for investigating the timbered implications of formant tuning. In our department, the analog singing synthesizer MUSSE has been successfully used in perceptual research on singing (Carlsson, 1988; Sundberg, 1989). Recently, a new, digital version of this synthesizer, MUSSE DIG, has been developed (Carlsson & Neovius, 1990). A basic concern is the realism of the synthesis. To investigate this, an attempt was made to match a real vowel sound as produced by a male voice with the synthesizer. The result can be seen in Fig. 2. To investigate the acceptability of the principle of formant tuning, a set of different stimuli was synthesized using MUSSE DIG. The set consisted of descending, octave-wide chromatic scales approximating the vowel /a/, starting from C4 (261 Hz). Three different strategies for formant tuning were synthesized, as shown in Fig. 3. In strategy A, the first formant was invariably tuned to the partial lying closest to 550 Hz. In strategy B, the second formant was tuned to the partial lying closest to 1000 Hz. In strategy C, either the first or the second formant was tuned to the spectrum partial closest to 550 or 1000 Hz, depending on which alternative gave the smallest formant frequency departure from these values. In addition, a version D was synthesized in which the formants remained at 550 and 1000 Hz in all tones in the scale. These scales were arranged on a listening tape for a paired comparison test, using the constant formant frequency scale as the standard.

STL-QPSR 1/1991 Fig. 2. Power spectra of a real (a) and a matching synthesized vowel (b). The sung vowel is an lal. Nineteen members of the singing teachers' class at the Stockholm State Conservatory of Music listened to the syntheses. They listened over a loudspeaker to the four types of stimuli arranged in pairs for comparison. Each of stimuli A, B, and C, containing changing formant frequencies, was presented with stimulus D, with constant formant frequencies, as the standard. The subjects received an instruction sheet with the text: "Listen to the voice production in these descending chromatic scales which are both repeated once. Which voice production do you find most correct?" The subjects marked on an answer sheet which synthesis in each pair they preferred. RESULTS The results are presented in Fig. 4. One subject failed to give an answer to one pair. The synthesis with unchanged formant frequencies was preferred by all subjects except for one single subject who preferred stimulus C to stimulus D (with constant formant frequencies). This indicates that formant tuning can not be used as a general principle. In other words, it is impossible to apply to all tones in a scale. The reason would be the effects on vowel quality. DISCUSSION AND CONCLUSIONS The main issue considered in the present investigation is the acceptability of different formant tuning strategies. The main result was that formant tuning is not applicable as a general principle in singing. This seems to be in accordance with the suggestion of Schutte & Miller (1990) that formant tuning may be difficult to apply in scale passages without intervening consonants. Under which conditions can formant tuning be used, then? Apart from the chanting mentioned in the introduction, the only certain indication of formant tuning in singing technique is in super-pitch singing, where otherwise the first formant would be lower than the fundamental frequency (Sundberg, 1987). In a lower pitch range, formant tuning is not likely to occur as long as the same vowel is sung in legato; sudden changes of vowel quality would disturb the timbered continuity needed for the legato. Also, formant tuning may raise unreasonably high demands on articulation, particularly in rapid tempos and florid singing. A situation where formant tuning may be more likely to occur is a strong accentuation of a particular tone in a

STL-QPSR 11199 1 REFERENCES Carlsson, G. & Neovius, L. (1990): "Implementations of synthesis models for speech and singing," STL- QPSR NO. 2-3, pp. 63-67. Carlsson, G. (1988): The KTH Program for Synthesis of Singing, Thesis work at the Dept. of Speech Communication and Music Acoustics, KTH. Coffin, B. (1980): Overtones of Be1 Canto. The Phonetic Basis of Artistic Singing, Scarecrow Press, Metuchen, NJ. Lessac, A. (1967): The Use and Training of the Human Voice, Drama Book, New York, 2nd ed. Miller, D.G. & Schutte, H.K. (1990): "Formant tuning in a professional baritone," J. Voice 4:3, pp. 231-237 Raphael, B.N. & Scherer, R.C. (1987): "Voice modifications of stage actors: acoustic analyses," J. Voice 1:1, pp. 83-87. Smith, H., Stevens, K.N., & Tomlinson, R.S. (1967): "On an unusual mode of chanting by certain tibetan lamas," J.Acoust.SocAm. 415, pp. 1262-1264. Sundberg, J. (1987): The Science of the Singing Voice, Northern Illinois University Press, Dekalb, IL. Sundberg, J. (1988): "Vocal tract resonance in singing," The Nat.Ass. of Teachers of Singing J., MarcWApril. Sundberg, J. (1989): "Synthesis of singing by rule," pp. 45-55 and 401-403 in (M. Mathews & J. Pierce, eds.) Current Directions in Computer Music Research, MIT Press System Development Foundation, Benchmark Series, Cambridge, MA.