Estimating the Time to Reach a Target Frequency in Singing

Similar documents
Quantifying Tone Deafness in the General Population

Effects of Musical Training on Key and Harmony Perception

Repetition Priming in Music

OVER THE YEARS, PARTICULARLY IN THE PAST

How do we perceive vocal pitch accuracy during singing? Pauline Larrouy-Maestri & Peter Q Pfordresher

The Beat Alignment Test (BAT): Surveying beat processing abilities in the general population

Finger motion in piano performance: Touch and tempo

A sensitive period for musical training: contributions of age of onset and cognitive abilities

How do scoops influence the perception of singing accuracy?

Perception of melodic accuracy in occasional singers: role of pitch fluctuations? Pauline Larrouy-Maestri & Peter Q Pfordresher

Abnormal Electrical Brain Responses to Pitch in Congenital Amusia Isabelle Peretz, PhD, 1 Elvira Brattico, MA, 2 and Mari Tervaniemi, PhD 2

Activation of learned action sequences by auditory feedback

The effect of focused instruction on young children s singing accuracy

SHORT TERM PITCH MEMORY IN WESTERN vs. OTHER EQUAL TEMPERAMENT TUNING SYSTEMS

Singing accuracy, listeners tolerance, and pitch analysis

Pitch-Matching Accuracy in Trained Singers and Untrained Individuals: The Impact of Musical Interference and Noise

Proceedings of Meetings on Acoustics

Temporal Coordination and Adaptation to Rate Change in Music Performance

WORKING MEMORY AND MUSIC PERCEPTION AND PRODUCTION IN AN ADULT SAMPLE. Keara Gillis. Department of Psychology. Submitted in Partial Fulfilment

Stability and accuracy of long-term memory for musical pitch

Temporal coordination in joint music performance: effects of endogenous rhythms and auditory feedback

Object selectivity of local field potentials and spikes in the macaque inferior temporal cortex

Expressive performance in music: Mapping acoustic cues onto facial expressions

Music Radar: A Web-based Query by Humming System

Reference Manual. Using this Reference Manual...2. Edit Mode...2. Changing detailed operator settings...3

MELODIC AND RHYTHMIC CONTRASTS IN EMOTIONAL SPEECH AND MUSIC

On time: the influence of tempo, structure and style on the timing of grace notes in skilled musical performance

THE INTERACTION BETWEEN MELODIC PITCH CONTENT AND RHYTHMIC PERCEPTION. Gideon Broshy, Leah Latterner and Kevin Sherwin

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

Impaired learning of event frequencies in tone deafness

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES

Influence of tonal context and timbral variation on perception of pitch

MEASURING LOUDNESS OF LONG AND SHORT TONES USING MAGNITUDE ESTIMATION

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

Good playing practice when drumming: Influence of tempo on timing and preparatory movements for healthy and dystonic players

Opera Minora. brief notes on selected musical topics

The Power of Listening

Characterization of de cits in pitch perception underlying `tone deafness'

Brain-Computer Interface (BCI)

Why are natural sounds detected faster than pips?

Affective Priming. Music 451A Final Project

Music 175: Pitch II. Tamara Smyth, Department of Music, University of California, San Diego (UCSD) June 2, 2015

Music Information Retrieval Using Audio Input

MOST FORMS OF ENSEMBLE PERFORMANCE SYNCHRONIZATION OF TIMING AND MOTION AMONG PERFORMING MUSICIANS

Smooth Rhythms as Probes of Entrainment. Music Perception 10 (1993): ABSTRACT

AUTOMATIC ACCOMPANIMENT OF VOCAL MELODIES IN THE CONTEXT OF POPULAR MUSIC

Measurement of overtone frequencies of a toy piano and perception of its pitch

Developmental changes in the perception of pitch contour: Distinguishing up from down

Florida Performing Fine Arts Assessment Item Specifications for Benchmarks in Course: M/J Chorus 3

Acoustic and musical foundations of the speech/song illusion

Quarterly Progress and Status Report. Replicability and accuracy of pitch patterns in professional singers

The Relationship Between Auditory Imagery and Musical Synchronization Abilities in Musicians

Semi-automated extraction of expressive performance information from acoustic recordings of piano music. Andrew Earis

Dial A440 for absolute pitch: Absolute pitch memory by non-absolute pitch possessors

Analysis of local and global timing and pitch change in ordinary

Florida Performing Fine Arts Assessment Item Specifications for Benchmarks in Course: Chorus 5 Honors

ANALYSIS OF VOCAL IMITATIONS OF PITCH TRAJECTORIES

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng

Pitch perception and production in congenital amusia: evidence from Cantonese speakers

AUD 6306 Speech Science

DOI: / ORIGINAL ARTICLE. Evaluation protocol for amusia - portuguese sample

Elementary Music Curriculum Objectives

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics)

Using the new psychoacoustic tonality analyses Tonality (Hearing Model) 1

AUDITION PROCEDURES:

Timbre blending of wind instruments: acoustics and perception

K-12 Performing Arts - Music Standards Lincoln Community School Sources: ArtsEdge - National Standards for Arts Education

ANALYSIS OF INTONATION TRAJECTORIES IN SOLO SINGING

Sample Analysis Design. Element2 - Basic Software Concepts (cont d)

Florida Performing Fine Arts Assessment Item Specifications for Benchmarks in Course: Chorus 2

Quarterly Progress and Status Report. Perception of just noticeable time displacement of a tone presented in a metrical sequence at different tempos

Noise evaluation based on loudness-perception characteristics of older adults

Musicians Adjustment of Performance to Room Acoustics, Part III: Understanding the Variations in Musical Expressions

However, in studies of expressive timing, the aim is to investigate production rather than perception of timing, that is, independently of the listene

The Tone Height of Multiharmonic Sounds. Introduction

Robert Alexandru Dobre, Cristian Negrescu

VISUAL CONTENT BASED SEGMENTATION OF TALK & GAME SHOWS. O. Javed, S. Khan, Z. Rasheed, M.Shah. {ojaved, khan, zrasheed,

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng

We realize that this is really small, if we consider that the atmospheric pressure 2 is

Module PS4083 Psychology of Music

Expressive timing and dynamics in infantdirected and non-infant-directed singing

Take a Break, Bach! Let Machine Learning Harmonize That Chorale For You. Chris Lewis Stanford University

The power of music in children s development

Music Curriculum Kindergarten

STUDENT LEARNING OBJECTIVE (SLO) PROCESS TEMPLATE

How We Sing: The Science Behind Our Musical Voice. Music has been an important part of culture throughout our history, and vocal

Auditory Feedback in Music Performance: The Role of Melodic Structure and Musical Skill

What is Statistics? 13.1 What is Statistics? Statistics

The Lecture Contains: Frequency Response of the Human Visual System: Temporal Vision: Consequences of persistence of vision: Objectives_template

CS 591 S1 Computational Audio

Kent Academic Repository

NEW QUERY-BY-HUMMING MUSIC RETRIEVAL SYSTEM CONCEPTION AND EVALUATION BASED ON A QUERY NATURE STUDY

Pitch and Timing Abilities in Inherited Speech and Language Impairment

CHOIR Grade 6. Benchmark 4: Students sing music written in two and three parts.

NEUROLOGICALLY INTACT INDIVIDUALS APPEAR

2 Autocorrelation verses Strobed Temporal Integration

Intonation in Unaccompanied Singing: Accuracy, Drift and a Model of Reference Pitch Memory

GENERAL ARTICLE. The Brain on Music. Nandini Chatterjee Singh and Hymavathy Balasubramanian

Supervised Learning in Genre Classification

User-Specific Learning for Recognizing a Singer s Intended Pitch

Transcription:

THE NEUROSCIENCES AND MUSIC III: DISORDERS AND PLASTICITY Estimating the Time to Reach a Target Frequency in Singing Sean Hutchins a and David Campbell b a Department of Psychology, McGill University, Montreal, Quebec, Canada b Department of Statistics and Actuarial Science, Simon Fraser University, Surrey, British Columbia, Canada The ability to match pitches quickly and accurately is essential for proficient singing. We describe a new technique for estimating the time to reach a target frequency that uses adaptive optimal-kernel (AOK) time-frequency representations, designed to optimize the time-frequency tradeoff at each time point. We show in two experiments that this measure is more sensitive to tonal priming effects than an onset latency measurement. This analysis is applied to the vocal productions of untrained singers to reveal effects of tonality and pitch height. Key words: singing; time-frequency representation; priming Introduction The ability to match pitch is essential for proficient singing. Pitch-matching ability is one of the most important determinants of singing ability. Whereas many people claim to be tone deaf, 1 recent studies show that many people may underestimate their musical proficiency, at least when it comes to pitch matching. 2,3 These studies have found a wide range of individual difference in proficiency among occasional singers. Natural singing abilities span the range from professional singers, who are consistently more accurate at pitch-matching tasks than untrained singers, 4 to congenital amusics. 5 This wide range of abilities contrasts with the other major form of vocal communication, speech, in which all neurologically normal adults without major hearing deficiencies develop highfunctioning proficiency. Hutchins and Palmer 6 used pitch matching to show priming advantage for the tonic tone (the most important and most frequent tone Address for correspondence: Sean Hutchins, Department of Psychology, McGill University, 1205 Dr. Penfield Drive, Montreal, Quebec, H3A 1B1, Canada. Sean.Michael.Hutchins@umontreal.ca in a melody), as well as a repetition priming advantage for individual pitches. This study used a singing task: musically untrained participants heard a short melody, and sang back the pitch of the final melodic tone as soon as possible after hearing it on the syllable /ba/. All participants were female, to control for vocal range. The onset latency (OL) of the sung tones was measured, defined as the time from the onset of the final melodic stimulus tone to the onset of the first pitched part of the singer s response. Hutchins and Palmer 6 found a significant effect of tonal priming, such that the onset of responses to melodies were on average 22 ms faster for those that ended on the highly expected tonic tone than those that ended on a less-expected nontonic tone. We present an alternative metric to the OL measurement, the time to reach the target frequency (TRTF). This measure considers not onlytheonsetofthetonebutalsotheonset of the target frequency, which may occur at or after the OL. The TRTF incorporates both latency and accuracy in measurements of tones whose pitch can change across time. We hypothesize that there should be a tonal priming effect in the TRTF as well. The Neurosciences and Music III: Disorders and Plasticity: Ann. N.Y. Acad. Sci. 1169: 116 120 (2009). doi: 10.1111/j.1749-6632.2009.04856.x c 2009 New York Academy of Sciences. 116

Hutchins & Campbell: Time to Reach Target Frequency in Singing 117 Determining the TRTF requires both time and frequency resolution, which presents a challenge. To the degree that a frequency is precisely specified, its resolution in time becomes blurred, and likewise as units of time are shortened, one loses accuracy in determining the frequency. The rate at which sung tones move through pitch space affects the ideal ratio between frequency and temporal resolution. To alleviate this problem, we use an adaptive windowing time-frequency estimation method, adaptive optimal-kernel (AOK) timefrequency representations (T-FR), 7 which adjusts the tradeoff between time and frequency to account for how quickly the signal is changing at any given time. In addition, this method allows us to examine the way singers correct an inaccurate initial pitch, and their trajectory through pitch space. Onset and TRTF The AOK T-FR was performed using MATLAB and software available from the site http://www.mathworks.com/matlabcentral/ fileexchange/loadfile.do?objectid=11551, using κ = 2400 (down-sampling the recording to 4900 Hz), δ = 128, and α = 2. The ridge was defined as the largest amplitude frequency feature from the AOK T-FR at each point in time after restricting to frequencies less than 500 Hz. The OL measurement was defined as the earliest time in a vocal response where the following criteria were met: The ridge lies within the female register of approximately 175 480 Hz. The ridge persists for at least 50 ms within the female register. The pitch exceeds a minimum amplitude equal to 10% of the maximum amplitude of the response. The TRTF measurement was defined as the earliest time in a vocal response where: Figure 1. An example of the ridge obtained from the AOK T-FR of a sample vocal production from Hutchins and Palmer. 6 The vertical line indicates the measured time to target frequency, the dotted-dashed line indicates the signal end, where the signal amplitude drops below 10% of the maximum signal amplitude, and the dashed horizontal lines indicate the pitch tolerance boundaries (one semitone above and below the target frequency). The ridge is within one semitone of the target frequency. At least 80% of the remaining sung response is within one semitone of the target frequency. The OL criteria are met. Figure 1 shows an example of the ridge obtained from a sample vocal production from Hutchins and Palmer. 6 The solid vertical line indicates the measured time to target frequency, the dotted-dashed vertical line indicates the signal end, where the signal amplitude drops below 10% of the maximum signal amplitude, and the dashed horizontal lines indicate the pitch tolerance boundaries (one semitone above and below the target frequency).

118 Annals of the New York Academy of Sciences Results Comparing OL with TRTF A 2 2 repeated-measures ANOVA comparing the factors of measurement type and tonic ending showed a main effect of measurement type, F(1, 23) = 50.85, P < 0.001. The TRTF measurement yielded significantly longer latencies than the OL measurement (OL TRTF by definition). There was also a main effect of tonic ending, F(1, 23) = 19.97, P < 0.001, as well as an interaction between measurement type and tonic ending, F(1, 23) = 57.78, P < 0.01. Participants responded faster to tonic endings than nontonic endings across both measurement types, and this effect was larger for the TRTF measurement (33 ms) than for the OL measurement (22 ms). To ensure that this pattern of results was not specific to this particular experiment, the same analyses were also applied to Hutchins and Palmer s 6 Experiment 3, which used an identical task and design (only the timbre of the final target tone was changed) with a different groupofsingers. The2 2 repeated-measures ANOVA showed the same pattern of results as Experiment 2, with main effects of measurement type, F(1, 23) = 57.78, P < 0.001, tonic ending, F(1, 23) = 13.43, P < 0.01, and an interaction between measurement type and tonic ending, F(1, 23) = 4.98, P = 0.04. Comparing the Paths Taken to Reach the Target Frequencies The ridges obtained from the AOK T-FR give a description of the trajectories of the pitches (fundamental frequency) of the sung responses. Figure 2 shows the range of trajectories in the 400 ms after the OL for the data from Experiment 2 for each target pitch (C4- G4) across all responses. The dark lines indicate the median ridge trajectory, and the lines outside of those indicate the 80%, 90%, and 95% trajectories. The median trajectories for all targets began near the middle of the vocal range, and fell to reach low targets, but ascended to reach higher target pitches. A repeatedmeasures ANOVA comparing the factors of measurement type and target pitch revealed a main effect of target pitch, F(7,17) = 8.68, P < 0.001, and an interaction between target pitch and measurement type, F(7,17) = 4.43, P < 0.01. Both response latencies and initial inaccuracy times (differences between the TRTF and OL measurements) were longer on average for high and low target pitches compared with mid-range target pitches. Discussion The AOK T-FR measurement of the time for singers to reach target frequency showed a larger effect of tonal priming than could be obtained with traditional OL measurements. 6 The TRTF measure yielded tonal priming effects (measured as the difference in response latencies between tonic and non-tonic conditions) that were approximately 50% larger than the OL measures. Singers were less accurate in initial frequency and took longer to adjust to the correct pitch when the pitch they were attempting to match was a nontonic tone than when it is a tonic tone. Thus, the TRTF measure shows that tonic priming is not limited solely to planning of the vocal production, but continues to have an effect even after the tone has begun to be produced. The ridges provided by the AOK T-FR also provide a useful estimate of the average vocal trajectory that singers use to reach a target pitch. Singers tended to start in the mid-range of frequencies regardless of the pitch of the target tone, and adjusted their frequency upwards to reach high-frequency target pitches, and downward to reach low-frequency target pitches. This provides further evidence that, in speeded pitch-matching paradigms, such as those of Hutchins and Palmer, 6 participants continue to tune their vocal responses even after beginning to sing. One interesting

Hutchins & Campbell: Time to Reach Target Frequency in Singing 119 Figure 2. The range of ridge trajectories in the 400 ms after the OL for the data from Experiment 2 for each target pitch (C4-G4) across all responses. The dark lines indicate the median ridge trajectory, and the lines outside of those indicate the 80%, 90%, and 95% quantile interval boundaries. The horizontal lines with arrows indicate the target pitch (referred to by musical pitch below each graph). feature of these ridges is that they begin to move toward the target frequency immediately, indicating that participants are not waiting to receive their own external feedback. This may indicate that the onset of singing occurs while the vocal folds are still in the process of adjusting. In sum, the AOK T-FR provides a useful way of analyzing sung vocal production. It can yield stronger measurements of psychological processes, such as tonal priming, than can be found solely with OLs and gives a more complete picture of how singers change pitch over the course of time. Acknowledgments We would like to thank Caroline Palmer, Jim Ramsay, Janeen Loehr, and Werner Goebl. This research was supported in part by a Tomlinson Fellowship and a Le Fonds Québecois de la Recherché sur la Nature et les Technologies Fellowship to S.H. D.C. was supported by

120 Annals of the New York Academy of Sciences MITACS Grant 208683 and NSERC Grant 224607. Conflicts of Interest The authors declare no conflicts of interest. References 1. Sloboda, J.A., J.A. Wise & I. Peretz. 2005. Quantifying tone deafness in the general population. Ann. N. Y. Acad. Sci. 1060: 255 261. 2. Dalla Bella, S., J.-F. Giguère & I. Peretz. 2007. Singing proficiency in the general population. J. Acoust. Soc. Am. 121: 1182 1189. 3. Pfordresher, P.Q. & S. Brown. 2007. Poor-pitch singing in the absence of tone deafness. Music. Percept. 25: 95 115. 4. Murry, T. 1990. Pitch-matching accuracy in singers and nonsingers. J. Voice 4: 317 321. 5. Ayotte, J., I. Peretz & K. Hyde. 2002. Congenital amusia: a group study of adults afflicted with a musicspecific disorder. Brain 125: 238 251. 6. Hutchins, S. & C. Palmer. 2008. Repetition priming in music. J. Exp. Psychol. Hum. Percept. Perform. 34: 693 707. 7. Jones, D. & R. Baraniuk. 1995. An adaptive optimalkernel time-frequency representation. IEEE Trans. Signal Process. 43: 2361 2371.