Welcome to Vibrationdata

Similar documents
Analysis of the effects of signal distance on spectrograms

How We Sing: The Science Behind Our Musical Voice. Music has been an important part of culture throughout our history, and vocal

Pitch-Synchronous Spectrogram: Principles and Applications

Making music with voice. Distinguished lecture, CIRMMT Jan 2009, Copyright Johan Sundberg

Laboratory Assignment 3. Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB

3 Voiced sounds production by the phonatory system

International Journal of Computer Architecture and Mobility (ISSN ) Volume 1-Issue 7, May 2013

Welcome to Vibrationdata

(Adapted from Chicago NATS Chapter PVA Book Discussion by Chadley Ballantyne. Answers by Ken Bozeman)

Week 6 - Consonants Mark Huckvale

Jaw Harp: An Acoustic Study. Acoustical Physics of Music Spring 2015 Simon Li

Music Representations

Music 170: Wind Instruments

Music Representations

Simple Harmonic Motion: What is a Sound Spectrum?

Creative Computing II

EVTA SESSION HELSINKI JUNE 06 10, 2012

CTP 431 Music and Audio Computing. Basic Acoustics. Graduate School of Culture Technology (GSCT) Juhan Nam

Digital music synthesis using DSP

Pitch. There is perhaps no aspect of music more important than pitch. It is notoriously

Quarterly Progress and Status Report. Formant frequency tuning in singing

A comparison of the acoustic vowel spaces of speech and song*20

2018 Fall CTP431: Music and Audio Computing Fundamentals of Musical Acoustics

Available online at International Journal of Current Research Vol. 9, Issue, 08, pp , August, 2017

How do clarinet players adjust the resonances of their vocal tracts for different playing effects?

Welcome to Vibrationdata

The role of vocal tract resonances in singing and in playing wind instruments

UNIVERSITY OF DUBLIN TRINITY COLLEGE

A PSYCHOACOUSTICAL INVESTIGATION INTO THE EFFECT OF WALL MATERIAL ON THE SOUND PRODUCED BY LIP-REED INSTRUMENTS

AN INTRODUCTION TO MUSIC THEORY Revision A. By Tom Irvine July 4, 2002

CHAPTER 20.2 SPEECH AND MUSICAL SOUNDS

Tempo and Beat Analysis

AN ALGORITHM FOR LOCATING FUNDAMENTAL FREQUENCY (F0) MARKERS IN SPEECH

WHAT IS BARBERSHOP. Life Changing Music By Denise Fly and Jane Schlinke

Proposal for Presentation of Doctoral Essay. A Description and Application of Robert Aitken s Concept. of the Physical Flute

Music for the Hearing Care Professional Published on Sunday, 14 March :24

The Tone Height of Multiharmonic Sounds. Introduction

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics)

FPFV-285/585 PRODUCTION SOUND Fall 2018 CRITICAL LISTENING Assignment

Lecture 17 Microwave Tubes: Part I

EE513 Audio Signals and Systems. Introduction Kevin D. Donohue Electrical and Computer Engineering University of Kentucky

Vocal-tract Influence in Trombone Performance

Linear Time Invariant (LTI) Systems

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

The Interactions Between Wind Instruments and their Players

Speaking loud, speaking high: non-linearities in voice strength and vocal register variations. Christophe d Alessandro LIMSI-CNRS Orsay, France

2. AN INTROSPECTION OF THE MORPHING PROCESS

Lab P-6: Synthesis of Sinusoidal Signals A Music Illusion. A k cos.! k t C k / (1)

Getting Started with the LabVIEW Sound and Vibration Toolkit

Choir Workshop Fall 2016 Vocal Production and Choral Techniques

ANALYSING DIFFERENCES BETWEEN THE INPUT IMPEDANCES OF FIVE CLARINETS OF DIFFERENT MAKES

Saxophonists tune vocal tract resonances in advanced performance techniques

THE DIGITAL DELAY ADVANTAGE A guide to using Digital Delays. Synchronize loudspeakers Eliminate comb filter distortion Align acoustic image.

Glossary of Singing Voice Terminology

Complete Vocal Technique in four pages

The Complete Conductor: Breath, Body and Spirit

The Complete Vocal Workout for Guys

Does Saxophone Mouthpiece Material Matter? Introduction

increase by 6 db each if the distance between them is halved. Likewise, vowels with a high first formant, such as /a/, or a high second formant, such

Author Index. Absolu, Brandt 165. Montecchio, Nicola 187 Mukherjee, Bhaswati 285 Müllensiefen, Daniel 365. Bay, Mert 93

APP USE USER MANUAL 2017 VERSION BASED ON WAVE TRACKING TECHNIQUE

DOC s DO s, DON T s and DEFINITIONS

Special Studies for the Tuba by Arnold Jacobs

Acoustical correlates of flute performance technique

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Interactions between the player's windway and the air column of a musical instrument 1

IBEGIN MY FIRST ARTICLE AS Associate Editor of Journal of Singing for

Quarterly Progress and Status Report. X-ray study of articulation and formant frequencies in two female singers

CSC475 Music Information Retrieval

EE-217 Final Project The Hunt for Noise (and All Things Audible)

Real-time magnetic resonance imaging investigation of resonance tuning in soprano singing

Measurement of overtone frequencies of a toy piano and perception of its pitch

Marion BANDS STUDENT RESOURCE BOOK

MIE 402: WORKSHOP ON DATA ACQUISITION AND SIGNAL PROCESSING Spring 2003

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES

Pitch. The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high.

Vocal tract resonances in speech, singing, and playing musical instruments

the mathematics of the voice. As musicians, we d both been frustrated with groups inability to

Comparison Parameters and Speaker Similarity Coincidence Criteria:

The Mathematics of Music and the Statistical Implications of Exposure to Music on High. Achieving Teens. Kelsey Mongeau

Musical Signal Processing with LabVIEW Introduction to Audio and Musical Signals. By: Ed Doering

Physics Homework 3 Fall 2015 Exam Name

Anatomy Of The Voice An Illustrated Guide For Singers Vocal Coaches And Speech Therapists

The Washington Professional Educator Standards Board. Washington Educator Skills Tests. Sample Test Questions. Music: Instrumental WA-SG-FLD036-01

How do clarinet players adjust the resonances of their vocal tracts for different playing effects?

Physiological and Acoustic Characteristics of the Female Music Theatre Voice in belt and legit qualities

ADSR AMP. ENVELOPE. Moog Music s Guide To Analog Synthesized Percussion. The First Step COMMON VOLUME ENVELOPES

HST 725 Music Perception & Cognition Assignment #1 =================================================================

Robert Alexandru Dobre, Cristian Negrescu

by Staff Sergeant Samuel Woodhead

OCTAVE C 3 D 3 E 3 F 3 G 3 A 3 B 3 C 4 D 4 E 4 F 4 G 4 A 4 B 4 C 5 D 5 E 5 F 5 G 5 A 5 B 5. Middle-C A-440

LEAD SECTIONAL. Expression Accurate sense of plan basic to complex Ability to craft a simple and successful plan, leave the interp for coaches

Sounds of Music. Definitions 1 Hz = 1 hertz = 1 cycle/second wave speed c (or v) = f f = (k/m) 1/2 / 2

Music Theory: A Very Brief Introduction

Advanced Signal Processing 2

Combining Instrument and Performance Models for High-Quality Music Synthesis

ANATOMY OF THE VOICE The physical working and structure of the vocal tract

5U Oakley Modular Series

Music Segmentation Using Markov Chain Methods

Version 5: August Requires performance/aural assessment. S1C1-102 Adjusting and matching pitches. Requires performance/aural assessment

Transcription:

Welcome to Vibrationdata Acoustics Shock Vibration Signal Processing February 2004 Newsletter Greetings Feature Articles Speech is perhaps the most important characteristic that distinguishes humans from animals. Speech can take many forms and can serve myriad purposes. Martin Luther King s I Have a Dream speech is among the most ennobling in American history. King s declaration resonated with moral clarity, enhanced by the resonation of his voice. The purpose of this month s newsletter is to present an acoustical analysis of King s speech, which is given in the second article. A proper evaluation requires a framework of acoustical principles, which are given in the first article. Sincerely, An Introduction to Human Speech page 2 Acoustic Analysis of Martin Luther King s I Have a Dream Speech page 5 Tom Irvine Email: tomirvine@aol.com 1

An Introduction to Human Speech by Tom Irvine o hear more harmonies with his melody. Figure 1. Vocal Folds and Oral Cavity Image courtesy of HyperPhysics, Georgia State University Introduction Speech generation is a rather complex process. This article considers four phases in speech production: respiration, phonation, resonation, and articulation. Respiration The lungs provide the airflow through the glottis, which is the opening between the vocal folds. The vocal folds are inside the larynx. The glottis is open during normal breathing. The vocal folds are spread far apart during this phase. Phonation Phonation is the process whereby the vocal folds convert the airflow energy into audible sound. Vocal folds are also referred to as either cords or chords. The folds are muscles. Additional muscles and cartilages inside the larynx cause the vocal folds to move inward during the onset of speech, closing the glottis. This closing is aided by an aerodynamic effect called the Bernoulli principle. That is, as the speed of a moving fluid or gas increases, the pressure within the fluid decreases. A suction force thus brings the vocal folds together as the airflow moves upward from the lungs to the mouth. The air particles that have passed through the now closed glottis continue traveling toward the mouth. The remaining pressure in the lungs is thus greater than the pressure on the closed side of the glottis. At a certain pressure 2

differential, the vocal folds are blown outward, thus opening the glottis and releasing a single 'puff' of air. The elastic restoring force in the vocal folds contributes to this opening process. The cycle is repeated, producing a periodic train of air pulses, illustrated in Figure 1. The vocal folds control the rate at which this oscillation occurs. Specifically, the vocal folds have a fundamental frequency that is a function of the folds mass and tension. The resulting pressure time history would have the shape of a sawtooth wave. This wave is composed of the fundamental frequency and its integer harmonics. It represents the hundreds of air puffs per second that make up speech. The phonation process thus described is a simplification. Researchers have determined that the folds alternately take on a convergent and divergent shape during the cycle, as shown in Figure 2. The average air pressure within the glottis tends to be larger in the convergent configuration than in the divergent shape, resulting in the asymmetry of air pressures that helps sustain the oscillation. Vocal Fold Fundamental Frequency The most obvious difference between the male and female voice is fundamental frequency, or pitch. Due to the increase in mass of a male's vocal folds, which occurs during puberty, the average speaking fundamental frequency for males varies between 100-132 Hz while the average for females varies between 142-256 Hz, per Reference 1. Nearly all information in speech is in the range 200 Hz to 8 khz. Some telephone systems carry sound from only 300 Hz to 3 khz, but the speech is still reasonably intelligible. The pitch is determined by the spacing of harmonics perhaps more than by the fundamental frequency, per Reference 2. Thus a man's voice on the phone is readily identifiable even though the fundamental of that signal is not present. Resonation Resonation refers to the quality of the voice as regulated by the vocal tract including the soft palate. The vocal tract is like a closed-open pipe. The natural frequencies of a closed-open pipe of 17 centimeters occur around 500, 1500, and 2500 Hz. The fundamental frequency increases to 600 Hz for if the length decreases to 14 centimeters. The frequency of speed is determined primarily by the vocal cords. The vocal tract frequency response further shapes the speech production. It acts as a filter that amplifies certain frequencies while attenuating others. Amplification occurs when the natural frequency of the vocal folds is at or near the natural frequency of the vocal tract. This condition is resonance. Articulation Articulators transform the sound into intelligible speech. Articulation is controlled by the positions of the tongue, lips, and jaw. The teeth also play a role. These articulators retune the natural frequency of the vocal tract system, which is important for producing vowels. 3

The position of the lips and tongue determine the geometry of the opening, thus controlling the natural frequency of the system. In this sense, the vocal tract system behaves as Helmholtz or cavity resonator, in addition to behaving as a closed-open pipe. References 1. Mikos and Pausewang, The Relative Contribution of Speaking Fundamental Frequency and Formant Frequencies to Gender Identification, Presented at the 2001 Convention of the American Speech-Language-Hearing Association November 15-18, 2001, New Orleans, LA. 2. Joe Wolfe, University of New South Wales, 1999. Figure 2. Image Courtesy of Phil Hoole 4

Acoustic Analysis of Martin Luther King s I Have a Dream Speech by Tom Irvine o Introduction Martin Luther King, Jr. delivered his I have a Dream speech on the steps of the Lincoln Memorial in Washington D.C. on August 28, 1963. King began the speech: I am happy to join with you today in what will go down in history as the greatest demonstration for freedom in the history of our nation. Five score years ago, a great American, in whose symbolic shadow we stand signed the Emancipation Proclamation. King delivered the following memorable lines in the middle of the speech: where they will not be judged by the color of their skin but by the content of their character. I have a dream today! King concluded: When we let freedom ring, when we let it ring from every village and every hamlet, from every state and every city, we will be able to speed up that day when all of God's children, black men and white men, Jews and Gentiles, Protestants and Catholics, will be able to join hands and sing in the words of the old Negro spiritual, "Free at last! Free at last! Thank God Almighty, we are free at last!" These words deliver a powerful message even in written form. King s pitch modulation and vocal tract resonation transformed his dream into an electrifying elocution which awakened the conscience of a nation. I have a dream that my four little children will one day live in a nation 5

Vocal Fold Fundamental Frequency The pressure time history of the I have a dream quote is given in Figure 1. There is a 2 second pause near the middle of the sample. This marks the gap between children and will. King exercised tremendous pitch modulation during his speech. This is one of several vocal characteristics that enhanced his message. Identifying a precise fundamental frequency, however, is challenging as a result of the modulation. A spectral magnitude function of the time history is given in Figure 2. The magnitude has a linear scale, although the pressure unit is not specified. The spectral function has sharp peaks at approximately 240 Hz and 360 Hz. The difference between these peaks is 120 Hz. King s vocal fold fundamental frequency thus appears to be 120 Hz. Recall from the previous article that the average speaking fundamental frequency for males varies between 100-132 Hz. The spectral function has a small peak at 120 Hz, which may have been attenuated by the highpass filtering characteristics of the recording equipment. The frequency response characteristics of the recording equipment are unknown, however. Furthermore, there are numerous spectral peaks across the entire domain in Figure 2. Some of the peaks are harmonics of the vocal fold fundamental frequency. The array of pitches gives a very rich, melodious sound. Vocal Tract Fundamental Frequency The highest levels in the spectral function occur among a cluster of peaks from 540 Hz to 650 Hz. The fifth natural frequency of King s voice was approximately 600 Hz, which is in the midst of this cluster. That the fifth natural frequency would project more energy than any of the preceding four would be highly unusual for any system, however. Vocal tract resonation is the explanation for the cluster response. Recall that the vocal tract behaves as a closed-open pipe. The fundamental frequency of a 14.2 cm long closed-open pipe is 600 Hz. This appears to have been King s vocal tract fundamental frequency, approximately. Thus King s fifth natural frequency exited his vocal tract mode, resulting in significant amplification of his voice. Pitch Modulation A spectrogram waterfall plot of the I have a dream quote is given in Figure 3. This format reveals the pitch modulation as the spectral peaks shift higher or lower in frequency. Note the rapid pitch increase that occurs from 540 Hz to 600 Hz from 6 to 8 seconds. Again, there is a 2 second pause near the middle of the sample, which marks the gap between children and will. The frequency increases to nearly 650 Hz as King resumes, pronouncing will. Thereafter, the pitch experiences a very gradual decrease, returning to 540 Hz. 6

The difference between 540 Hz and 650 Hz is approximately one-quarter of an octave, which is a wide spectrum. This domain covers the musical notes C#, D, D#, and E. Frequency Analysis of a Brief Segment A time history with a 50 millisecond segment is given in Figure 4. This occurs as the pitch is rising before the gap. The top signal is the measured data. The bottom signal was synthesized from discrete sinusoids using the method in Reference 1. The goal was to match the characteristics of the measured data. This method indirectly yields the frequencies of the measured data. The dominant frequency is 560 Hz, which is the modulated vocal tract fundamental frequency. The signal also contains integer harmonics of this frequency at 1120, 2240, and 3360 Hz. In addition, there is an 840 Hz component. The mechanism of this sinusoid is not immediately clear, however. Conclusion Martin Luther King, Jr. illuminated his call for freedom and justice with his lyrical voice, accentuating his words with pitch modulation and magnifying his message through vocal tract resonation. Reference 1. Irvine, A Time Domain, Curve-Fitting Method for Accelerometer Data Analysis, AIAA Paper 7667, 2003. 7

TIME HISTORY - MLK SPEECH EXCERPT AMPLITUDE 0 5 10 15 Figure 1. TIME (SEC) SPECTRAL MAGNITUDE - MLK SPEECH EXCERPT MAGNITUDE 0 500 1000 1500 2000 Figure 2. FREQUENCY (Hz) 8

Spectrogram Waterfall Gradual Pitch Decrease Sharp Pitch Increase Figure 3. 9

EXCERPT FROM SPEECH TOP - MEASURED DATA BOTTOM - SYNTHESIZED SIGNAL AMPLITUDE 6.60 6.61 6.62 6.63 6.64 6.65 TIME (SEC) Figure 4. 10