AUD 6306 Speech Science

Size: px
Start display at page:

Download "AUD 6306 Speech Science"

Transcription

1 AUD 3 Speech Science Dr. Peter Assmann Spring semester 2 Role of Pitch Information Pitch contour is the primary cue for tone recognition Tonal languages rely on pitch level and differences to convey lexical meanings within syllables Pitch helps to segregate auditory components from different sound sources Pitch and consonant voicing Voice pitch is higher following a voiceless consonant compared to a voiced consonant. Listeners perceive these small changes; voicing judgments are influenced by F of the following vowel. a d a (lower F) a t a (higher F) Size variation in natural speech Adults voices Fundamental Frequency Formant Frequencies Children s voices Fundamental frequency (Hz) F as a function of age and sex Age (years) Geo. Mean formant frequency (Hz) FFs as a function of age and sex 1 1 Age (years) 1

2 Pitch and vowel identification There is a systematic relationship between F and formant frequencies across voices (low-pitched voices tend to have lower formants than high-pitched voices and vice versa). log mean FF (Hz) Relationship between F and FFs 2 R 2.4 p 4.e-4 N log mean F (Hz) Pitch and vowel identification Frequency-shifted speech is more intelligible and perceived as more natural when the normal co-variation of F and formant frequency is preserved, even when frequency shifts approach or exceed the range found in human speech. 2

3 Identification accuracy (%) Sentences Vowels Geo. mean F1-F2-F3 (Hz) Comparison of vowel identification accuracy (red circles) and sentence recognition (blue circles). Geometric mean formant frequency (Hz) Original (unscaled) voices FF=.2F+. r=.; N=2 1 1 Medians of vowels per talker Scatterplot of vowel identification accuracy and 1 HINT sentence recognition Geometric mean F (Hz) Geometric mean formant frequency (Hz) Age-transformed (gender preserved) FF=.2F+. r=.; N= Medians of vowels per talker Geometric mean F (Hz) Geometric mean formant frequency (Hz) Age-transformed (gender swapped) FF=.2F+. r=.; N= Medians of vowels per talker Geometric mean F (Hz) Cocktail Party Effect Colin Cherry (13) coined the term cocktail party effect to describe the ability of listeners to attend to a single talker in a mixture of conversations and other background noises. Cocktail Party Phenomenon Cherry s experiments involved listening to two different messages presented to one or both ears, on the same pitch or on different pitches, spoken by two talkers of the same gender or differing in gender. 3

4 Cocktail Party Phenomenon Cherry concluded that listeners rely on several cues to follow a conversation in the presence of competing voices, including Spatial separation Pitch differences Gender differences Auditory scene analysis The sound that reaches the eardrum of the listener is often a mixture of different sources Acoustic signals originating from different sound sources combine additively Unlike vision, the concept of occlusion is hard to define in audition: sounds overlap but also combine in complex ways. Auditory scene analysis (ASA) is the process by which the auditory system organizes complex mixtures of sound. ASA involves grouping processes, in which sound components that are likely to have come from the same environmental source are linked to form a single perceptual unit. Auditory scene analysis Computational auditory scene analysis Reviewed by Cooke and Ellis (21) Human listeners are good at separating mixtures of sounds, as reflected in speech communication and listening to music in complex listening environments (cocktail parties) Attempts to reproduce this separation process using computational models had limited success (a hard problem!) Demonstration: when a sequence of tones, made up of alternating low and high notes is played to listeners at a slow rate they hear low and high tones alternating. At high rates they hear two separate streams, one high and one low. Bregman used the term stream segregation to describe the auditory processes that group together sounds that share common features and segregate them from sounds that differ. 4

5 Stream segregation: when the sequence split into two separate streams it is difficult to attend to the low and high streams at the same time. It is much easier to listen for a subset of 3 tones from the sequence when they belong to the same stream. Stream segregation: At slow rates we hear alternating low and high notes. At faster rates we hear two separate streams of low and high notes. Stream segregation: A B Standard= repeating 3-tone cycle Left panels: withinstream Right panels: acrossstream Which set (A or B) preserves the standard more effectively? Auditory grouping principles figure-ground phenomenon proximity good continuation closure common fate old-plus-new heuristic Grouping and segregation are complementary processes Sequential grouping Simultaneous grouping

6 Grouping by timbre Grouping by timbre Tones that deviate from rising/falling pattern pop out of sequence Tones that deviate from rising/falling pattern pop out of sequence Grouping by onset Harmonics of speech sounds or music that start and stop at the same time are grouped together Gestalt law of common fate Grouping by onset Rasch (1) showed that it is easier to distinguish two tones from one another when onset of the first precedes the onset of the second by a short time interval Tone 1 Tone 2 Time Grouping by onset Darwin (11) showed that a harmonic which starts or stops before the remaining harmonics of a vowel is (partially) excluded from the vowel percept. Grouping by onset Darwin (11) Harmonic # onset is earlier than any of the remaining harmonics; Vowel quality shifts in the direction of lower F1 (as if the harmonic had been removed) Synchronous harmonics Asynchronous harmonics

7 Schema-based grouping When one harmonic is gradually ramped up in level we hear a slight, gradual change in vowel quality Schema-based grouping When the harmonic is augmented by > db relative to its original vowel, we hear a slight change in vowel quality and we hear an extra superimposed tone. Uniform amplitude harmonics Ramped harmonic # Normal amplitude harmonics harmonic # augmented db harmonic # augmented db Principle of good continuation When sounds are interrupted by silence (e.g., signal dropouts or faulty communication lines) or by interfering sounds, the sound is heard as continuous (the auditory system fills in the missing pieces). Periodicity and noise Hypothesis: Periodicity of speech contributes to robustness Harmonicity in the frequency domain Across-frequency grouping of spectral features Unvoiced sounds (e.g., whispered speech) are more susceptible to masking and interference by competing sounds F and voice separation When two people speak at the same time, it is often easier to understand what they say if the pitches of their voices differ, for example if one voice is male and the other is female. F and voice separation Hypothesis 1: voice separation becomes easier and intelligibility improves as the pitch (F ) difference between the voices is increased.

8 F and voice separation Voice pitch is rarely constant in natural speech, but changes over time (melody of speech, or prosody). Time variation in voice pitch may help listeners to track a target voice in a mixture of voices. F and voice separation Hypothesis 2: voice separation is easier when the natural variation in pitch (F ) is present, and becomes more difficult when the pitch is held constant (monotone). F and voice separation A high quality speech vocoder was used to construct pairs of sentences on different F s. The F difference between the two voices was manipulated (, 1, 2, 4, or semitones). F was either constant or variable (natural pitch contour) Example stimuli: Hz Results Hz Results Significant improvement with F. No effect of F modulation. No interaction of F x F modulation. Marginally higher scores for intoned sentences at and 1 semitones may stem from momentary differences in F between the sentences. 1 st F 2 nd F

9 Brungart et al. (21) Brungart et al. (21) 2-talker correct responses (%) Different Modulated talker, talker, Same different talker same noise sex 3 3 Target-to-Masker Ratio (db) 2-talker correct responses (%) 3 3 Target-to-Masker Ratio (db) Midterm take-home exam In a recent review of the literature on clear speech, Smiljanić and Bradlow (2) describe a speaking style called clear speech and its effect on intelligibility in different populations of listeners. Your assignment is to review the literature on clear speech, using both primary and secondary sources to summarize what is known about clear speech and what these findings imply for speech production and perception. Midterm take-home exam Identify a set of organizing principles or themes that emerge from current research and use these as section headers for your paper. There is no set length for the paper, but - (double-spaced) pages is a reasonable target. Midterm take-home exam Two review papers to get started: 1. Smiljanić R., Bradlow A.R. (2). Speaking and Hearing Clearly: Talker and Listener Factors in Speaking Style Changes. Lang Linguist Compass. 3(1): Uchanski, RM. Clear speech. In: Pisoni, DB.; Remez, R., editors. The handbook of speech perception. Malden, MA/Oxford, UK: Blackwell; 2. p. 2-3.

Voice segregation by difference in fundamental frequency: Effect of masker type

Voice segregation by difference in fundamental frequency: Effect of masker type Voice segregation by difference in fundamental frequency: Effect of masker type Mickael L. D. Deroche a) Department of Otolaryngology, Johns Hopkins University School of Medicine, 818 Ross Research Building,

More information

Topic 1. Auditory Scene Analysis

Topic 1. Auditory Scene Analysis Topic 1 Auditory Scene Analysis What is Scene Analysis? (from Bregman s ASA book, Figure 1.2) ECE 477 - Computer Audition, Zhiyao Duan 2018 2 Auditory Scene Analysis The cocktail party problem (From http://www.justellus.com/)

More information

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound Pitch Perception and Grouping HST.723 Neural Coding and Perception of Sound Pitch Perception. I. Pure Tones The pitch of a pure tone is strongly related to the tone s frequency, although there are small

More information

Music Perception with Combined Stimulation

Music Perception with Combined Stimulation Music Perception with Combined Stimulation Kate Gfeller 1,2,4, Virginia Driscoll, 4 Jacob Oleson, 3 Christopher Turner, 2,4 Stephanie Kliethermes, 3 Bruce Gantz 4 School of Music, 1 Department of Communication

More information

Auditory Illusions. Diana Deutsch. The sounds we perceive do not always correspond to those that are

Auditory Illusions. Diana Deutsch. The sounds we perceive do not always correspond to those that are In: E. Bruce Goldstein (Ed) Encyclopedia of Perception, Volume 1, Sage, 2009, pp 160-164. Auditory Illusions Diana Deutsch The sounds we perceive do not always correspond to those that are presented. When

More information

Precedence-based speech segregation in a virtual auditory environment

Precedence-based speech segregation in a virtual auditory environment Precedence-based speech segregation in a virtual auditory environment Douglas S. Brungart a and Brian D. Simpson Air Force Research Laboratory, Wright-Patterson AFB, Ohio 45433 Richard L. Freyman University

More information

Auditory scene analysis

Auditory scene analysis Harvard-MIT Division of Health Sciences and Technology HST.723: Neural Coding and Perception of Sound Instructor: Christophe Micheyl Auditory scene analysis Christophe Micheyl We are often surrounded by

More information

Behavioral and neural identification of birdsong under several masking conditions

Behavioral and neural identification of birdsong under several masking conditions Behavioral and neural identification of birdsong under several masking conditions Barbara G. Shinn-Cunningham 1, Virginia Best 1, Micheal L. Dent 2, Frederick J. Gallun 1, Elizabeth M. McClaine 2, Rajiv

More information

Informational masking of speech produced by speech-like sounds without linguistic content

Informational masking of speech produced by speech-like sounds without linguistic content Informational masking of speech produced by speech-like sounds without linguistic content Jing Chen, Huahui Li, Liang Li, and Xihong Wu a) Department of Machine Intelligence, Speech and Hearing Research

More information

Topic 10. Multi-pitch Analysis

Topic 10. Multi-pitch Analysis Topic 10 Multi-pitch Analysis What is pitch? Common elements of music are pitch, rhythm, dynamics, and the sonic qualities of timbre and texture. An auditory perceptual attribute in terms of which sounds

More information

Demonstrations. to accompany Bregman s. Auditory Scene Analysis. The perceptual organization of sound MIT Press, 1990

Demonstrations. to accompany Bregman s. Auditory Scene Analysis. The perceptual organization of sound MIT Press, 1990 Demonstrations to accompany Bregman s Auditory Scene Analysis The perceptual organization of sound MIT Press, 1990 Albert S. Bregman Pierre A. Ahad Department of Psychology Auditory Research Laboratory

More information

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring 2009 Week 6 Class Notes Pitch Perception Introduction Pitch may be described as that attribute of auditory sensation in terms

More information

Processing Linguistic and Musical Pitch by English-Speaking Musicians and Non-Musicians

Processing Linguistic and Musical Pitch by English-Speaking Musicians and Non-Musicians Proceedings of the 20th North American Conference on Chinese Linguistics (NACCL-20). 2008. Volume 1. Edited by Marjorie K.M. Chan and Hana Kang. Columbus, Ohio: The Ohio State University. Pages 139-145.

More information

Quarterly Progress and Status Report. Violin timbre and the picket fence

Quarterly Progress and Status Report. Violin timbre and the picket fence Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Violin timbre and the picket fence Jansson, E. V. journal: STL-QPSR volume: 31 number: 2-3 year: 1990 pages: 089-095 http://www.speech.kth.se/qpsr

More information

Speaking in Minor and Major Keys

Speaking in Minor and Major Keys Chapter 5 Speaking in Minor and Major Keys 5.1. Introduction 28 The prosodic phenomena discussed in the foregoing chapters were all instances of linguistic prosody. Prosody, however, also involves extra-linguistic

More information

MEMORY & TIMBRE MEMT 463

MEMORY & TIMBRE MEMT 463 MEMORY & TIMBRE MEMT 463 TIMBRE, LOUDNESS, AND MELODY SEGREGATION Purpose: Effect of three parameters on segregating 4-note melody among distraction notes. Target melody and distractor melody utilized.

More information

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics)

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics) 1 Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics) Pitch Pitch is a subjective characteristic of sound Some listeners even assign pitch differently depending upon whether the sound was

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Psychological and Physiological Acoustics Session 1pPPb: Psychoacoustics

More information

Acoustic and musical foundations of the speech/song illusion

Acoustic and musical foundations of the speech/song illusion Acoustic and musical foundations of the speech/song illusion Adam Tierney, *1 Aniruddh Patel #2, Mara Breen^3 * Department of Psychological Sciences, Birkbeck, University of London, United Kingdom # Department

More information

Rhythm and Melody Aspects of Language and Music

Rhythm and Melody Aspects of Language and Music Rhythm and Melody Aspects of Language and Music Dafydd Gibbon Guangzhou, 25 October 2016 Orientation Orientation - 1 Language: focus on speech, conversational spoken language focus on complex behavioural

More information

Polyrhythms Lawrence Ward Cogs 401

Polyrhythms Lawrence Ward Cogs 401 Polyrhythms Lawrence Ward Cogs 401 What, why, how! Perception and experience of polyrhythms; Poudrier work! Oldest form of music except voice; some of the most satisfying music; rhythm is important in

More information

Acoustic Prosodic Features In Sarcastic Utterances

Acoustic Prosodic Features In Sarcastic Utterances Acoustic Prosodic Features In Sarcastic Utterances Introduction: The main goal of this study is to determine if sarcasm can be detected through the analysis of prosodic cues or acoustic features automatically.

More information

Music Representations

Music Representations Lecture Music Processing Music Representations Meinard Müller International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de Book: Fundamentals of Music Processing Meinard Müller Fundamentals

More information

SHORT TERM PITCH MEMORY IN WESTERN vs. OTHER EQUAL TEMPERAMENT TUNING SYSTEMS

SHORT TERM PITCH MEMORY IN WESTERN vs. OTHER EQUAL TEMPERAMENT TUNING SYSTEMS SHORT TERM PITCH MEMORY IN WESTERN vs. OTHER EQUAL TEMPERAMENT TUNING SYSTEMS Areti Andreopoulou Music and Audio Research Laboratory New York University, New York, USA aa1510@nyu.edu Morwaread Farbood

More information

HST 725 Music Perception & Cognition Assignment #1 =================================================================

HST 725 Music Perception & Cognition Assignment #1 ================================================================= HST.725 Music Perception and Cognition, Spring 2009 Harvard-MIT Division of Health Sciences and Technology Course Director: Dr. Peter Cariani HST 725 Music Perception & Cognition Assignment #1 =================================================================

More information

Pitch. The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high.

Pitch. The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high. Pitch The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high. 1 The bottom line Pitch perception involves the integration of spectral (place)

More information

Automatic Laughter Detection

Automatic Laughter Detection Automatic Laughter Detection Mary Knox Final Project (EECS 94) knoxm@eecs.berkeley.edu December 1, 006 1 Introduction Laughter is a powerful cue in communication. It communicates to listeners the emotional

More information

1 Introduction to PSQM

1 Introduction to PSQM A Technical White Paper on Sage s PSQM Test Renshou Dai August 7, 2000 1 Introduction to PSQM 1.1 What is PSQM test? PSQM stands for Perceptual Speech Quality Measure. It is an ITU-T P.861 [1] recommended

More information

UNIVERSITY OF DUBLIN TRINITY COLLEGE

UNIVERSITY OF DUBLIN TRINITY COLLEGE UNIVERSITY OF DUBLIN TRINITY COLLEGE FACULTY OF ENGINEERING & SYSTEMS SCIENCES School of Engineering and SCHOOL OF MUSIC Postgraduate Diploma in Music and Media Technologies Hilary Term 31 st January 2005

More information

Improving Frame Based Automatic Laughter Detection

Improving Frame Based Automatic Laughter Detection Improving Frame Based Automatic Laughter Detection Mary Knox EE225D Class Project knoxm@eecs.berkeley.edu December 13, 2007 Abstract Laughter recognition is an underexplored area of research. My goal for

More information

EMS : Electroacoustic Music Studies Network De Montfort/Leicester 2007

EMS : Electroacoustic Music Studies Network De Montfort/Leicester 2007 AUDITORY SCENE ANALYSIS AND SOUND SOURCE COHERENCE AS A FRAME FOR THE PERCEPTUAL STUDY OF ELECTROACOUSTIC MUSIC LANGUAGE Blas Payri, José Luis Miralles Bono Universidad Politécnica de Valencia, Campus

More information

Auditory Stream Segregation (Sequential Integration)

Auditory Stream Segregation (Sequential Integration) Auditory Stream Segregation (Sequential Integration) David Meredith Department of Computing, City University, London. dave@titanmusic.com www.titanmusic.com MSc/Postgraduate Diploma in Music Information

More information

German Center for Music Therapy Research

German Center for Music Therapy Research Effects of music therapy for adult CI users on the perception of music, prosody in speech, subjective self-concept and psychophysiological arousal Research Network: E. Hutter, M. Grapp, H. Argstatter,

More information

Timbre perception

Timbre perception Harvard-MIT Division of Health Sciences and Technology HST.725: Music Perception and Cognition Prof. Peter Cariani Timbre perception www.cariani.com Timbre perception Timbre: tonal quality ( pitch, loudness,

More information

Pitch is one of the most common terms used to describe sound.

Pitch is one of the most common terms used to describe sound. ARTICLES https://doi.org/1.138/s41562-17-261-8 Diversity in pitch perception revealed by task dependence Malinda J. McPherson 1,2 * and Josh H. McDermott 1,2 Pitch conveys critical information in speech,

More information

Experiments on tone adjustments

Experiments on tone adjustments Experiments on tone adjustments Jesko L. VERHEY 1 ; Jan HOTS 2 1 University of Magdeburg, Germany ABSTRACT Many technical sounds contain tonal components originating from rotating parts, such as electric

More information

Topic 4. Single Pitch Detection

Topic 4. Single Pitch Detection Topic 4 Single Pitch Detection What is pitch? A perceptual attribute, so subjective Only defined for (quasi) harmonic sounds Harmonic sounds are periodic, and the period is 1/F0. Can be reliably matched

More information

Release from speech-on-speech masking in a front-and-back geometry

Release from speech-on-speech masking in a front-and-back geometry Release from speech-on-speech masking in a front-and-back geometry Neil L. Aaronson Department of Physics and Astronomy, Michigan State University, Biomedical and Physical Sciences Building, East Lansing,

More information

Informational Masking and Trained Listening. Undergraduate Honors Thesis

Informational Masking and Trained Listening. Undergraduate Honors Thesis Informational Masking and Trained Listening Undergraduate Honors Thesis Presented in partial fulfillment of requirements for the Degree of Bachelor of the Arts by Erica Laughlin The Ohio State University

More information

Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods

Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods Kazuyoshi Yoshii, Masataka Goto and Hiroshi G. Okuno Department of Intelligence Science and Technology National

More information

Musical Signal Processing with LabVIEW Introduction to Audio and Musical Signals. By: Ed Doering

Musical Signal Processing with LabVIEW Introduction to Audio and Musical Signals. By: Ed Doering Musical Signal Processing with LabVIEW Introduction to Audio and Musical Signals By: Ed Doering Musical Signal Processing with LabVIEW Introduction to Audio and Musical Signals By: Ed Doering Online:

More information

The presence of multiple sound sources is a routine occurrence

The presence of multiple sound sources is a routine occurrence Spectral completion of partially masked sounds Josh H. McDermott* and Andrew J. Oxenham Department of Psychology, University of Minnesota, N640 Elliott Hall, 75 East River Road, Minneapolis, MN 55455-0344

More information

Acoustic Scene Classification

Acoustic Scene Classification Acoustic Scene Classification Marc-Christoph Gerasch Seminar Topics in Computer Music - Acoustic Scene Classification 6/24/2015 1 Outline Acoustic Scene Classification - definition History and state of

More information

Analysis, Synthesis, and Perception of Musical Sounds

Analysis, Synthesis, and Perception of Musical Sounds Analysis, Synthesis, and Perception of Musical Sounds The Sound of Music James W. Beauchamp Editor University of Illinois at Urbana, USA 4y Springer Contents Preface Acknowledgments vii xv 1. Analysis

More information

Temporal Envelope and Periodicity Cues on Musical Pitch Discrimination with Acoustic Simulation of Cochlear Implant

Temporal Envelope and Periodicity Cues on Musical Pitch Discrimination with Acoustic Simulation of Cochlear Implant Temporal Envelope and Periodicity Cues on Musical Pitch Discrimination with Acoustic Simulation of Cochlear Implant Lichuan Ping 1, 2, Meng Yuan 1, Qinglin Meng 1, 2 and Haihong Feng 1 1 Shanghai Acoustics

More information

9.35 Sensation And Perception Spring 2009

9.35 Sensation And Perception Spring 2009 MIT OpenCourseWare http://ocw.mit.edu 9.35 Sensation And Perception Spring 29 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. Hearing Kimo Johnson April

More information

PHY 103 Auditory Illusions. Segev BenZvi Department of Physics and Astronomy University of Rochester

PHY 103 Auditory Illusions. Segev BenZvi Department of Physics and Astronomy University of Rochester PHY 103 Auditory Illusions Segev BenZvi Department of Physics and Astronomy University of Rochester Reading Reading for this week: Music, Cognition, and Computerized Sound: An Introduction to Psychoacoustics

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Musical Acoustics Session 3pMU: Perception and Orchestration Practice

More information

Chapter Two: Long-Term Memory for Timbre

Chapter Two: Long-Term Memory for Timbre 25 Chapter Two: Long-Term Memory for Timbre Task In a test of long-term memory, listeners are asked to label timbres and indicate whether or not each timbre was heard in a previous phase of the experiment

More information

Expressive Singing Synthesis based on Unit Selection for the Singing Synthesis Challenge 2016

Expressive Singing Synthesis based on Unit Selection for the Singing Synthesis Challenge 2016 Expressive Singing Synthesis based on Unit Selection for the Singing Synthesis Challenge 2016 Jordi Bonada, Martí Umbert, Merlijn Blaauw Music Technology Group, Universitat Pompeu Fabra, Spain jordi.bonada@upf.edu,

More information

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES Vishweshwara Rao and Preeti Rao Digital Audio Processing Lab, Electrical Engineering Department, IIT-Bombay, Powai,

More information

Music Representations

Music Representations Advanced Course Computer Science Music Processing Summer Term 00 Music Representations Meinard Müller Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Music Representations Music Representations

More information

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications Matthias Mauch Chris Cannam György Fazekas! 1 Matthias Mauch, Chris Cannam, George Fazekas Problem Intonation in Unaccompanied

More information

EMERGENT SOUNDSCAPE COMPOSITION: REFLECTIONS ON VIRTUALITY

EMERGENT SOUNDSCAPE COMPOSITION: REFLECTIONS ON VIRTUALITY EMERGENT SOUNDSCAPE COMPOSITION: REFLECTIONS ON VIRTUALITY by Mark Christopher Brady Bachelor of Science (Honours), University of Cape Town, 1994 THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS

More information

We realize that this is really small, if we consider that the atmospheric pressure 2 is

We realize that this is really small, if we consider that the atmospheric pressure 2 is PART 2 Sound Pressure Sound Pressure Levels (SPLs) Sound consists of pressure waves. Thus, a way to quantify sound is to state the amount of pressure 1 it exertsrelatively to a pressure level of reference.

More information

Commentary on David Huron s On the Role of Embellishment Tones in the Perceptual Segregation of Concurrent Musical Parts

Commentary on David Huron s On the Role of Embellishment Tones in the Perceptual Segregation of Concurrent Musical Parts Commentary on David Huron s On the Role of Embellishment Tones in the Perceptual Segregation of Concurrent Musical Parts JUDY EDWORTHY University of Plymouth, UK ALICJA KNAST University of Plymouth, UK

More information

UNIT 1: QUALITIES OF SOUND. DURATION (RHYTHM)

UNIT 1: QUALITIES OF SOUND. DURATION (RHYTHM) UNIT 1: QUALITIES OF SOUND. DURATION (RHYTHM) 1. SOUND, NOISE AND SILENCE Essentially, music is sound. SOUND is produced when an object vibrates and it is what can be perceived by a living organism through

More information

AN ALGORITHM FOR LOCATING FUNDAMENTAL FREQUENCY (F0) MARKERS IN SPEECH

AN ALGORITHM FOR LOCATING FUNDAMENTAL FREQUENCY (F0) MARKERS IN SPEECH AN ALGORITHM FOR LOCATING FUNDAMENTAL FREQUENCY (F0) MARKERS IN SPEECH by Princy Dikshit B.E (C.S) July 2000, Mangalore University, India A Thesis Submitted to the Faculty of Old Dominion University in

More information

1aAA14. The audibility of direct sound as a key to measuring the clarity of speech and music

1aAA14. The audibility of direct sound as a key to measuring the clarity of speech and music 1aAA14. The audibility of direct sound as a key to measuring the clarity of speech and music Session: Monday Morning, Oct 31 Time: 11:30 Author: David H. Griesinger Location: David Griesinger Acoustics,

More information

Automatic Laughter Detection

Automatic Laughter Detection Automatic Laughter Detection Mary Knox 1803707 knoxm@eecs.berkeley.edu December 1, 006 Abstract We built a system to automatically detect laughter from acoustic features of audio. To implement the system,

More information

ONLINE ACTIVITIES FOR MUSIC INFORMATION AND ACOUSTICS EDUCATION AND PSYCHOACOUSTIC DATA COLLECTION

ONLINE ACTIVITIES FOR MUSIC INFORMATION AND ACOUSTICS EDUCATION AND PSYCHOACOUSTIC DATA COLLECTION ONLINE ACTIVITIES FOR MUSIC INFORMATION AND ACOUSTICS EDUCATION AND PSYCHOACOUSTIC DATA COLLECTION Travis M. Doll Ray V. Migneco Youngmoo E. Kim Drexel University, Electrical & Computer Engineering {tmd47,rm443,ykim}@drexel.edu

More information

Brian C. J. Moore Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England

Brian C. J. Moore Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England Asymmetry of masking between complex tones and noise: Partial loudness Hedwig Gockel a) CNBH, Department of Physiology, University of Cambridge, Downing Street, Cambridge CB2 3EG, England Brian C. J. Moore

More information

Speech To Song Classification

Speech To Song Classification Speech To Song Classification Emily Graber Center for Computer Research in Music and Acoustics, Department of Music, Stanford University Abstract The speech to song illusion is a perceptual phenomenon

More information

Singing voice synthesis in Spanish by concatenation of syllables based on the TD-PSOLA algorithm

Singing voice synthesis in Spanish by concatenation of syllables based on the TD-PSOLA algorithm Singing voice synthesis in Spanish by concatenation of syllables based on the TD-PSOLA algorithm ALEJANDRO RAMOS-AMÉZQUITA Computer Science Department Tecnológico de Monterrey (Campus Ciudad de México)

More information

Assessment may include recording to be evaluated by students, teachers, and/or administrators in addition to live performance evaluation.

Assessment may include recording to be evaluated by students, teachers, and/or administrators in addition to live performance evaluation. Title of Unit: Choral Concert Performance Preparation Repertoire: Simple Gifts (Shaker Song). Adapted by Aaron Copland, Transcribed for Chorus by Irving Fine. Boosey & Hawkes, 1952. Level: NYSSMA Level

More information

MUSI-6201 Computational Music Analysis

MUSI-6201 Computational Music Analysis MUSI-6201 Computational Music Analysis Part 9.1: Genre Classification alexander lerch November 4, 2015 temporal analysis overview text book Chapter 8: Musical Genre, Similarity, and Mood (pp. 151 155)

More information

Voice & Music Pattern Extraction: A Review

Voice & Music Pattern Extraction: A Review Voice & Music Pattern Extraction: A Review 1 Pooja Gautam 1 and B S Kaushik 2 Electronics & Telecommunication Department RCET, Bhilai, Bhilai (C.G.) India pooja0309pari@gmail.com 2 Electrical & Instrumentation

More information

MELODIC AND RHYTHMIC CONTRASTS IN EMOTIONAL SPEECH AND MUSIC

MELODIC AND RHYTHMIC CONTRASTS IN EMOTIONAL SPEECH AND MUSIC MELODIC AND RHYTHMIC CONTRASTS IN EMOTIONAL SPEECH AND MUSIC Lena Quinto, William Forde Thompson, Felicity Louise Keating Psychology, Macquarie University, Australia lena.quinto@mq.edu.au Abstract Many

More information

Making music with voice. Distinguished lecture, CIRMMT Jan 2009, Copyright Johan Sundberg

Making music with voice. Distinguished lecture, CIRMMT Jan 2009, Copyright Johan Sundberg Making music with voice MENU: A: The instrument B: Getting heard C: Expressivity The instrument Summary RADIATED SPECTRUM Level Frequency Velum VOCAL TRACT Frequency curve Formants Level Level Frequency

More information

LESSON 1 PITCH NOTATION AND INTERVALS

LESSON 1 PITCH NOTATION AND INTERVALS FUNDAMENTALS I 1 Fundamentals I UNIT-I LESSON 1 PITCH NOTATION AND INTERVALS Sounds that we perceive as being musical have four basic elements; pitch, loudness, timbre, and duration. Pitch is the relative

More information

K-12 Performing Arts - Music Standards Lincoln Community School Sources: ArtsEdge - National Standards for Arts Education

K-12 Performing Arts - Music Standards Lincoln Community School Sources: ArtsEdge - National Standards for Arts Education K-12 Performing Arts - Music Standards Lincoln Community School Sources: ArtsEdge - National Standards for Arts Education Grades K-4 Students sing independently, on pitch and in rhythm, with appropriate

More information

Psychoacoustics. lecturer:

Psychoacoustics. lecturer: Psychoacoustics lecturer: stephan.werner@tu-ilmenau.de Block Diagram of a Perceptual Audio Encoder loudness critical bands masking: frequency domain time domain binaural cues (overview) Source: Brandenburg,

More information

The Tone Height of Multiharmonic Sounds. Introduction

The Tone Height of Multiharmonic Sounds. Introduction Music-Perception Winter 1990, Vol. 8, No. 2, 203-214 I990 BY THE REGENTS OF THE UNIVERSITY OF CALIFORNIA The Tone Height of Multiharmonic Sounds ROY D. PATTERSON MRC Applied Psychology Unit, Cambridge,

More information

Musical Illusions Diana Deutsch Department of Psychology University of California, San Diego La Jolla, CA 92093

Musical Illusions Diana Deutsch Department of Psychology University of California, San Diego La Jolla, CA 92093 Musical Illusions Diana Deutsch Department of Psychology University of California, San Diego La Jolla, CA 92093 ddeutsch@ucsd.edu In Squire, L. (Ed.) New Encyclopedia of Neuroscience, (Oxford, Elsevier,

More information

Music Radar: A Web-based Query by Humming System

Music Radar: A Web-based Query by Humming System Music Radar: A Web-based Query by Humming System Lianjie Cao, Peng Hao, Chunmeng Zhou Computer Science Department, Purdue University, 305 N. University Street West Lafayette, IN 47907-2107 {cao62, pengh,

More information

Audio Feature Extraction for Corpus Analysis

Audio Feature Extraction for Corpus Analysis Audio Feature Extraction for Corpus Analysis Anja Volk Sound and Music Technology 5 Dec 2017 1 Corpus analysis What is corpus analysis study a large corpus of music for gaining insights on general trends

More information

Effect of room acoustic conditions on masking efficiency

Effect of room acoustic conditions on masking efficiency Effect of room acoustic conditions on masking efficiency Hyojin Lee a, Graduate school, The University of Tokyo Komaba 4-6-1, Meguro-ku, Tokyo, 153-855, JAPAN Kanako Ueno b, Meiji University, JAPAN Higasimita

More information

Sound Quality PSY 310 Greg Francis. Lecture 32. Sound perception

Sound Quality PSY 310 Greg Francis. Lecture 32. Sound perception Prof. Greg Francis Sound Quality PSY 310 Greg Francis Lecture 32 Name that tune! Sound perception An integral part of our modern world Billions are spent annually on Creation of new sounds or sound sequences

More information

"The mind is a fire to be kindled, not a vessel to be filled." Plutarch

The mind is a fire to be kindled, not a vessel to be filled. Plutarch "The mind is a fire to be kindled, not a vessel to be filled." Plutarch -21 Special Topics: Music Perception Winter, 2004 TTh 11:30 to 12:50 a.m., MAB 125 Dr. Scott D. Lipscomb, Associate Professor Office

More information

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY Eugene Mikyung Kim Department of Music Technology, Korea National University of Arts eugene@u.northwestern.edu ABSTRACT

More information

A CAPPELLA EAR TRAINING

A CAPPELLA EAR TRAINING A CAPPELLA EAR TRAINING A METHOD FOR UNDERSTANDING MUSIC THEORY VIA UNACCOMPANIED HARMONY SINGING HELEN RUSSELL FOREWORD TO STUDENTS EMBARKING ON AET COURSE You will be aware by now that participating

More information

Measurement of overtone frequencies of a toy piano and perception of its pitch

Measurement of overtone frequencies of a toy piano and perception of its pitch Measurement of overtone frequencies of a toy piano and perception of its pitch PACS: 43.75.Mn ABSTRACT Akira Nishimura Department of Media and Cultural Studies, Tokyo University of Information Sciences,

More information

EXPLAINING AND PREDICTING THE PERCEPTION OF MUSICAL STRUCTURE

EXPLAINING AND PREDICTING THE PERCEPTION OF MUSICAL STRUCTURE JORDAN B. L. SMITH MATHEMUSICAL CONVERSATIONS STUDY DAY, 12 FEBRUARY 2015 RAFFLES INSTITUTION EXPLAINING AND PREDICTING THE PERCEPTION OF MUSICAL STRUCTURE OUTLINE What is musical structure? How do people

More information

Music 175: Pitch II. Tamara Smyth, Department of Music, University of California, San Diego (UCSD) June 2, 2015

Music 175: Pitch II. Tamara Smyth, Department of Music, University of California, San Diego (UCSD) June 2, 2015 Music 175: Pitch II Tamara Smyth, trsmyth@ucsd.edu Department of Music, University of California, San Diego (UCSD) June 2, 2015 1 Quantifying Pitch Logarithms We have seen several times so far that what

More information

Application Of Missing Feature Theory To The Recognition Of Musical Instruments In Polyphonic Audio

Application Of Missing Feature Theory To The Recognition Of Musical Instruments In Polyphonic Audio Application Of Missing Feature Theory To The Recognition Of Musical Instruments In Polyphonic Audio Jana Eggink and Guy J. Brown Department of Computer Science, University of Sheffield Regent Court, 11

More information

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC Vishweshwara Rao, Sachin Pant, Madhumita Bhaskar and Preeti Rao Department of Electrical Engineering, IIT Bombay {vishu, sachinp,

More information

A comparison of the acoustic vowel spaces of speech and song*20

A comparison of the acoustic vowel spaces of speech and song*20 Linguistic Research 35(2), 381-394 DOI: 10.17250/khisli.35.2.201806.006 A comparison of the acoustic vowel spaces of speech and song*20 Evan D. Bradley (The Pennsylvania State University Brandywine) Bradley,

More information

Topics in Computer Music Instrument Identification. Ioanna Karydi

Topics in Computer Music Instrument Identification. Ioanna Karydi Topics in Computer Music Instrument Identification Ioanna Karydi Presentation overview What is instrument identification? Sound attributes & Timbre Human performance The ideal algorithm Selected approaches

More information

Curriculum Development In the Fairfield Public Schools FAIRFIELD PUBLIC SCHOOLS FAIRFIELD, CONNECTICUT MUSIC THEORY I

Curriculum Development In the Fairfield Public Schools FAIRFIELD PUBLIC SCHOOLS FAIRFIELD, CONNECTICUT MUSIC THEORY I Curriculum Development In the Fairfield Public Schools FAIRFIELD PUBLIC SCHOOLS FAIRFIELD, CONNECTICUT MUSIC THEORY I Board of Education Approved 04/24/2007 MUSIC THEORY I Statement of Purpose Music is

More information

Harmony and tonality The vertical dimension. HST 725 Lecture 11 Music Perception & Cognition

Harmony and tonality The vertical dimension. HST 725 Lecture 11 Music Perception & Cognition Harvard-MIT Division of Health Sciences and Technology HST.725: Music Perception and Cognition Prof. Peter Cariani Harmony and tonality The vertical dimension HST 725 Lecture 11 Music Perception & Cognition

More information

PSYCHOACOUSTICS & THE GRAMMAR OF AUDIO (By Steve Donofrio NATF)

PSYCHOACOUSTICS & THE GRAMMAR OF AUDIO (By Steve Donofrio NATF) PSYCHOACOUSTICS & THE GRAMMAR OF AUDIO (By Steve Donofrio NATF) "The reason I got into playing and producing music was its power to travel great distances and have an emotional impact on people" Quincey

More information

Influence of tonal context and timbral variation on perception of pitch

Influence of tonal context and timbral variation on perception of pitch Perception & Psychophysics 2002, 64 (2), 198-207 Influence of tonal context and timbral variation on perception of pitch CATHERINE M. WARRIER and ROBERT J. ZATORRE McGill University and Montreal Neurological

More information

An Integrated Music Chromaticism Model

An Integrated Music Chromaticism Model An Integrated Music Chromaticism Model DIONYSIOS POLITIS and DIMITRIOS MARGOUNAKIS Dept. of Informatics, School of Sciences Aristotle University of Thessaloniki University Campus, Thessaloniki, GR-541

More information

Perceptual Considerations in Designing and Fitting Hearing Aids for Music Published on Friday, 14 March :01

Perceptual Considerations in Designing and Fitting Hearing Aids for Music Published on Friday, 14 March :01 Perceptual Considerations in Designing and Fitting Hearing Aids for Music Published on Friday, 14 March 2008 11:01 The components of music shed light on important aspects of hearing perception. To make

More information

CHAPTER 20.2 SPEECH AND MUSICAL SOUNDS

CHAPTER 20.2 SPEECH AND MUSICAL SOUNDS Source: STANDARD HANDBOOK OF ELECTRONIC ENGINEERING CHAPTER 20.2 SPEECH AND MUSICAL SOUNDS Daniel W. Martin, Ronald M. Aarts SPEECH SOUNDS Speech Level and Spectrum Both the sound-pressure level and the

More information

CSC475 Music Information Retrieval

CSC475 Music Information Retrieval CSC475 Music Information Retrieval Monophonic pitch extraction George Tzanetakis University of Victoria 2014 G. Tzanetakis 1 / 32 Table of Contents I 1 Motivation and Terminology 2 Psychacoustics 3 F0

More information

Pitch Perception. Roger Shepard

Pitch Perception. Roger Shepard Pitch Perception Roger Shepard Pitch Perception Ecological signals are complex not simple sine tones and not always periodic. Just noticeable difference (Fechner) JND, is the minimal physical change detectable

More information

Perceiving Differences and Similarities in Music: Melodic Categorization During the First Years of Life

Perceiving Differences and Similarities in Music: Melodic Categorization During the First Years of Life Perceiving Differences and Similarities in Music: Melodic Categorization During the First Years of Life Author Eugenia Costa-Giomi Volume 8: Number 2 - Spring 2013 View This Issue Eugenia Costa-Giomi University

More information

Pitch-Synchronous Spectrogram: Principles and Applications

Pitch-Synchronous Spectrogram: Principles and Applications Pitch-Synchronous Spectrogram: Principles and Applications C. Julian Chen Department of Applied Physics and Applied Mathematics May 24, 2018 Outline The traditional spectrogram Observations with the electroglottograph

More information

August Acoustics and Psychoacoustics Barbara Crowe Music Therapy Director. Notes from BC s copyrighted materials for IHTP

August Acoustics and Psychoacoustics Barbara Crowe Music Therapy Director. Notes from BC s copyrighted materials for IHTP The Physics of Sound and Sound Perception Sound is a word of perception used to report the aural, psychological sensation of physical vibration Vibration is any form of to-and-fro motion To perceive sound

More information