Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound

Similar documents
DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

Pitch. The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high.

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics)

The Tone Height of Multiharmonic Sounds. Introduction

Auditory scene analysis

Measurement of overtone frequencies of a toy piano and perception of its pitch

Pitch perception for mixtures of spectrally overlapping harmonic complex tones

Do Zwicker Tones Evoke a Musical Pitch?

HST 725 Music Perception & Cognition Assignment #1 =================================================================

I. INTRODUCTION. 1 place Stravinsky, Paris, France; electronic mail:

Springer Handbook of Auditory Research. Series Editors: Richard R. Fay and Arthur N. Popper

Consonance perception of complex-tone dyads and chords

UNIVERSITY OF DUBLIN TRINITY COLLEGE

Quarterly Progress and Status Report. Violin timbre and the picket fence

CSC475 Music Information Retrieval

2 Autocorrelation verses Strobed Temporal Integration

Pitch is one of the most common terms used to describe sound.

AUD 6306 Speech Science

Creative Computing II

Brian C. J. Moore Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England

EE391 Special Report (Spring 2005) Automatic Chord Recognition Using A Summary Autocorrelation Function

CTP 431 Music and Audio Computing. Basic Acoustics. Graduate School of Culture Technology (GSCT) Juhan Nam

A DISSERTATION SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY

Analysis, Synthesis, and Perception of Musical Sounds

Music Representations

9.35 Sensation And Perception Spring 2009

CTP431- Music and Audio Computing Musical Acoustics. Graduate School of Culture Technology KAIST Juhan Nam

We realize that this is really small, if we consider that the atmospheric pressure 2 is

Simple Harmonic Motion: What is a Sound Spectrum?

Pitch: The Perceptual Ends of the Periodicity; but Of What Periodicity?

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng

Physics and Neurophysiology of Hearing

Author Index. Absolu, Brandt 165. Montecchio, Nicola 187 Mukherjee, Bhaswati 285 Müllensiefen, Daniel 365. Bay, Mert 93

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

Vibration Measurement and Analysis

Transcription An Historical Overview

2018 Fall CTP431: Music and Audio Computing Fundamentals of Musical Acoustics

Tempo and Beat Analysis

Voice segregation by difference in fundamental frequency: Effect of masker type

Topic 10. Multi-pitch Analysis

Proceedings of Meetings on Acoustics

Lecture 7: Music

Automatic music transcription

Acoustic and musical foundations of the speech/song illusion

Consonance, 2: Psychoacoustic factors: Grove Music Online Article for print

Note on Posted Slides. Noise and Music. Noise and Music. Pitch. PHY205H1S Physics of Everyday Life Class 15: Musical Sounds

Temporal Envelope and Periodicity Cues on Musical Pitch Discrimination with Acoustic Simulation of Cochlear Implant

Pitch Perception. Roger Shepard

MEASURING SENSORY CONSONANCE BY AUDITORY MODELLING. Dept. of Computer Science, University of Aarhus

The Semitone Paradox

Music 175: Pitch II. Tamara Smyth, Department of Music, University of California, San Diego (UCSD) June 2, 2015

PHYSICS OF MUSIC. 1.) Charles Taylor, Exploring Music (Music Library ML3805 T )

Timbre perception

The Lecture Contains: Frequency Response of the Human Visual System: Temporal Vision: Consequences of persistence of vision: Objectives_template

EE513 Audio Signals and Systems. Introduction Kevin D. Donohue Electrical and Computer Engineering University of Kentucky

Topic 4. Single Pitch Detection

Neural Correlates of Auditory Streaming of Harmonic Complex Sounds With Different Phase Relations in the Songbird Forebrain

DERIVING A TIMBRE SPACE FOR THREE TYPES OF COMPLEX TONES VARYING IN SPECTRAL ROLL-OFF

Pitch strength decreases as F0 and harmonic resolution increase in complex tones composed exclusively of high harmonics a)

GCT535- Sound Technology for Multimedia Timbre Analysis. Graduate School of Culture Technology KAIST Juhan Nam

Digital audio and computer music. COS 116, Spring 2012 Guest lecture: Rebecca Fiebrink

Robert Alexandru Dobre, Cristian Negrescu

聲音有高度嗎? 音高之聽覺生理基礎. Do Sounds Have a Height? Physiological Basis for the Pitch Percept

Perceptual Considerations in Designing and Fitting Hearing Aids for Music Published on Friday, 14 March :01

BBN ANG 141 Foundations of phonology Phonetics 3: Acoustic phonetics 1

AD-A ýqj m7n IN , 111:

Identification of Harmonic Musical Intervals: The Effect of Pitch Register and Tone Duration

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY

Laboratory Assignment 3. Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB

Behavioral and neural identification of birdsong under several masking conditions

LOUDNESS EFFECT OF THE DIFFERENT TONES ON THE TIMBRE SUBJECTIVE PERCEPTION EXPERIMENT OF ERHU

Musical Signal Processing with LabVIEW Introduction to Audio and Musical Signals. By: Ed Doering

Tempo and Beat Tracking

Smooth Rhythms as Probes of Entrainment. Music Perception 10 (1993): ABSTRACT

Temporal summation of loudness as a function of frequency and temporal pattern

EFFECT OF REPETITION OF STANDARD AND COMPARISON TONES ON RECOGNITION MEMORY FOR PITCH '

Music Representations

The presence of multiple sound sources is a routine occurrence

Abstract. Introduction. Peter Cariani

The perception of concurrent sound objects through the use of harmonic enhancement: a study of auditory attention

A Psychoacoustically Motivated Technique for the Automatic Transcription of Chords from Musical Audio

Music Perception & Cognition

Psychoacoustics. lecturer:

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE

Topics in Computer Music Instrument Identification. Ioanna Karydi

The Cocktail Party Effect. Binaural Masking. The Precedence Effect. Music 175: Time and Space

Auditory Illusions. Diana Deutsch. The sounds we perceive do not always correspond to those that are

I. LISTENING. For most people, sound is background only. To the sound designer/producer, sound is everything.!tc 243 2

Consonance and Pitch

Topic 1. Auditory Scene Analysis

Audio Feature Extraction for Corpus Analysis

Lecture 2 What we hear: Basic dimensions of auditory experience

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

Effect of harmonic rank on sequential sound segregation

1aAA14. The audibility of direct sound as a key to measuring the clarity of speech and music

THE importance of music content analysis for musical

Augmentation Matrix: A Music System Derived from the Proportions of the Harmonic Series

The Physics Of Sound. Why do we hear what we hear? (Turn on your speakers)

I. INTRODUCTION. Electronic mail:

Musical Illusions Diana Deutsch Department of Psychology University of California, San Diego La Jolla, CA 92093

Transcription:

Pitch Perception and Grouping HST.723 Neural Coding and Perception of Sound

Pitch Perception. I. Pure Tones The pitch of a pure tone is strongly related to the tone s frequency, although there are small effects of level and masking. <1000 Hz: increased level: decreased pitch 1000-2000 Hz: little or no change >2000 Hz: increased level: increased pitch Difference Limens for Frequency (DLF) The auditory system is exquisitely sensitive to changes in frequency (e.g. 2-3 Hz at 1000 Hz = 0.01 db).

How is frequency coded - Place or timing? Zwicker s proposal for FM detection. (See Moore, 1997) Place Pros: Could in principle be used at all frequencies. Cons: Peak of BM traveling wave shifts basally with level by ½ octave no similar pitch shift is seen; fails to account for poorer performance in DLFs at very high frequencies (> 4 khz), although does a reasonable job of predicting frequency-modulation difference limens (FMDLs).

Temporal cues Timing Pros: Pitch estimate is basically level-invariant; may explain the absence of musical pitch above ca. 4-5 khz. Cons: Thought to break down totally above about 4 khz (although some optimal detector models predict residual performance up to 8 or 10 khz); harder to explain diplacusis (differences in pitch perception between the ears). See Rose et al. (1971)

Musical pitch Musical pitch is probably at least 2- dimensional: Tone height: monotonically related to frequency Tone chroma: related to pitch class (note name) Circularity in pitch judgments: changes in chroma but no change in height. In circular pitch is a half-octave interval perceived as going up or down? (Deutsch, 1986) Musical pitch of pure tones breaks down above about 5 khz: octave matches become erratic and melodies are no longer recognized. Differences in frequency are still detected only tone chroma is absent. Further evidence for the influence of temporal coding?

Pitch Perception in Complex Tones Many complex sounds are harmonic, i.e. consist of components at integer multiples of the fundamental frequency (f 0 ). They generally have a pitch corresponding to the f 0. Place Theory (Ohm, 1843; Helmholtz, 1863) Lowest component of a complex determines its pitch. Other components important only for timbre. BUT: Pitch does not change when the fundamental is not physically present or is masked (Seebeck, 1841; Schouten, 1940; Licklider, 1956). What pitches do these complexes elicit? 2f 0 4f 0 f 0 3f 0 5f 0 6f 0 10f 0 3f 0 5f 0 2f 0 4f 0 (pitch of this depends on f0; Flanagan and Guttman) Demonstration: ASA CD Tracks 40-42

Pitch ambiguities A complex comprising 800, 1000, and 1200 Hz has a pitch of 200 Hz. What about 850, 1050, and 1250 Hz? (ASA CD track 38) Pitch salience becomes weaker when only higher harmonics are present. (ASA CD track 43-45). Why is this?

Resolved vs. unresolved harmonics ( See Moore, 1997 and Plomp, 1964)

Mechanisms of Complex Pitch Perception Temporal Theory (Schouten, 1940): Pitch is extracted from the summed waveform of adjacent components. This requires that some components interact. Pattern Recognition Theory (e.g. Goldstein, 1973): The frequencies of individual components are determined and the best-fitting f0 is selected. This requires that some components remain resolved and that some form of harmonic template exists. Evidence against a pure temporal model Pitch sensation is strongest for low-order (resolved) harmonics (Plomp, 1964; Ritsma, 1964). Pitch can be elicited by only two components, one in each ear (Houtsma and Goldstein, 1972). Pitch can be elicited by consecutively presented harmonics (Hall and Peters, 1981). Evidence again a pure pattern recognition theory Very high, unresolved harmonics can still produce a (weaker) pitch sensation Aperiodic, sinusoidally amplitude-modulated (SAM) white noise can produce a pitch sensation (Burns and Viemeister, 1976; 1981).

Autocorrelation model of pitch perception Based on an original proposal by Licklider (1951). The stimulus within each frequency channel is correlated (delayed, multiplied and averaged) with itself (through delay lines). This produces peaks at time intervals corresponding to multiples of the stimulus period. Pooling interval histograms across frequency produces an overall estimate of the dominant interval, which generally corresponds to the fundamental frequency. (See Meddis and Hewitt, 1991)

Autocorrelation model Pros: Model can deal with both resolved and unresolved harmonics Predicts no effect of phase for resolved harmonics, but strong phase effects for unresolved harmonics, in line with data (Meddis & Hewitt, 1991). Predicts a dominance region of pitch, roughly in line with early psychophysical data, due to reduction in phase locking with frequency. Cons: Deals too well with unresolved harmonics predicts no difference based on resolvability, in contrast to psychophysical data (Carlyon and Shackleton, 1994). Dominance region based on absolute, not relative, frequency, in contrast to data. [N.B. The template model of Shamma and Klein (2000) involves place and timing coding, but not in the traditional sense.]

Regular Interval Noise Delay (d) Gain (g) Noise (X(t)) +- Rippled noise g d Noise (X(t)) +- Comb-filtered noise g d g d +- +- Noise (X(t)) Iterated rippled noise

Musical intervals: Consonance and Dissonance In the West, the equal-tempered scale has been adopted, with the octave split into twelve equal steps on a log scale, i.e., 4 semitones higher is 2 4/12 times higher in frequency. This is a compromise: the intervals in the harmonic series only approximate the notes of the scale. Octave Fifth Fourth Maj. 3rd f0 2f0 3f0 4f0 5f0 6f0 8f0 log(f) 7f0 Perceived dissonance is in part due to beating effects between neighboring harmonics. Remaining effect of perceived consonance and dissonance may be simply cultural.

Auditory Grouping and Pitch Simultaneous, harmonically related tones tend to form a single auditory object, which makes ecological sense. What happens if one component is slightly out of tune? Harmonicity can be a strong cue in binding components together, but it can be overridden by competing cues or expectations (Darwin et al., 1994). A mistuned harmonic can be heard out more easily, but can still contribute to the overall pitch of the complex. This is an example of duplex perception.

Demonstrations of Simultaneous Grouping Principles Demo 1: Mistuning of a single component. (Track 18) Demo 2: Two groups of harmonics: Effect of common fate or inharmonicity? (Track 19) Demo 3: Simultaneous vs. sequential streaming and effects of onset disparity. (Track 22).

Timbre is: Recognizing Sounds That attribute of auditory sensation in terms of which a listener can judge that two sounds similarly presented and having the same loudness and pitch are dissimilar. (ANSI, 1960) Effect of Spectrum on Timbre Example 1: Example 2: Hemony clarion bell Guitar tone

Traditional view: Spectrum dominant in determining timbre; auditory system relatively insensitive to phase. However, most sounds can be easily recognized despite very large spectral distortions, due to room acoustics or poorquality electroacoustic transmission. Effect of Temporal Envelope on Timbre Piano tone Reversed piano tone Time Damped and ramped sinusoids also have very different sound qualities (Patterson, 1994). Neural correlates are currently a subject of research (e.g., Lu et al., 2001). Analysis and resynthesis has revealed the importance of dynamic spectrotemporal profile, as well as the role of inharmonic tones in musical instruments. Spectral and temporal properties of most instruments change with F0 and level, making realistic synthesis very difficult.

Sequential Stream Segregation Pure tones Demo 1: Stream segregation depends on frequency and repetition rate. (Track 1) Demo 2: Segregation allows the recognition of a melody within one stream. (Track 5) Demo 3: Segregation implies loss of rhythmic information across streams. (Track 3)

Sequential Segregation of Complex Tones Peripheral channeling (tonotopic), or higher level object recognition? Van Noorden (1975): segregation of complex tones with the same F0 but different harmonics: Evidence for role of peripheral filtering? Hartmann and Johnson (1991): Interleaved melodies. Varied timbre and lateralization: Results suggest dominance of peripheral channeling. However, higher level processes are clearly involved (repetition rate, build-up effects, etc.)

Periodicity vs. Spectrum Vliegen and Oxenham (1999) measured streaming using unresolved harmonic complexes with different F0s. High F0 Frequency Low F0 Bandpass filter No difference in streaming ability found for resolved or unresolved harmonics: Differences in spectrum are sufficient but not necessary for streaming. Results not compatible with peripheral channeling hypothesis.