EE513 Audio Signals and Systems. Introduction Kevin D. Donohue Electrical and Computer Engineering University of Kentucky

Size: px
Start display at page:

Download "EE513 Audio Signals and Systems. Introduction Kevin D. Donohue Electrical and Computer Engineering University of Kentucky"

Transcription

1 EE513 Audio Signals and Systems Introduction Kevin D. Donohue Electrical and Computer Engineering University of Kentucky

2 Question! If a tree falls in the forest and nobody is there to hear it, will it make a sound? Sound provided by

3 Ambiguity! Merriam-Webster Dictionary: Sound a : a particular auditory impression b:the sensation perceived by the sense of hearing c :mechanical radiant energy that is transmitted by longitudinal pressure waves in a material medium (as air) and is the objective cause of hearing.

4 Electronic Audio Systems Sound Sources Vibrations at 20Hz-20kHz Transmission Media Electoacoustic Transducer Playback Amplification, Signal Conditioning Information Extraction / Measurement Storage Processing for Intended Application

5 Natural Audio Systems

6 Synthetic Audio: Imitating Nature 1780 Wolfgang von Kemplen s Speaking Machine Mid 1800 s Charles Wheatstone Late 1800 s Alexander Graham Bell 1939 Homer Dudley s Voder Thaddeus Cahill s Telharmonium (First Music Synthesizer) 1919 Lev Theremin s Theremin

7 Speech Analysis and Synthesis Communication channels (acoustic and electric) 1874/1876 (Antonio Meucci s) Alexander Graham Bell s Telephone s Homer Dudley s Channel Vocoder first analysis-synthesis system

8 Voice-Coding Models The general speech model: Quasi-Periodic Pulsed Air Air Burst or Continuous flow Voiced Speech Unvoiced Speech Vocal Tract Filter Vocal Radiator Speech sounds can be analyzed by determining the states of the vocal system components (vocal chords, track, lips, tongue ) for each fundamental sound of speech (phoneme).

9 Spectral Analysis Voiced Speech Spectral envelop => vocal tract formants Harmonic peaks => vocal chord pitch

10 Time Analysis Voiced Speech Time envelop => Volume dynamics Oscillations => Vocal chord motion

11 Spectrogram Analysis There shoe She do She bread Then bed Frequency Time

12 Spectogram of CD sound Tell Me Ma - Spectrogram in db Hertz Seconds 15

13 Speech Recognition 1920 s Radio Rex 1950 s (Bell Labs) Digit Recognition Spectral/Formant analysis Filter Banks 1960 s Neural Networks 1970 s ARPA Project for Speech Understanding Applications of spectral analysis methods FFT, Cepstral/homomorphic, LPC 1970 s Application of pattern matching methods DTW, and HMM

14 Speech Recognition 1980 s Standardize Training and Test with Large Corpora (TIMIT) (RM) (DARPA) New Front Ends (feature extractors) more perceptually based Dominance/Development of HMM Backpropagation and Neural Networks Rule-Base AI systems

15 Specification of Speech Recognition Speaker dependent or independent Recognize isolated, continuous, or spot speech Vocabulary Size, Grammar Perplexity, Speaking style Recording conditions

16 Components of Speech Recognition Speech Transduction Acoustic/Electronic Front End Input Speech Detected Speech String Local Match Global Detector Language Model

17 Matlab Examples %% Create and play a 2 second 440 Hz tone in Matlab: fs = 8000; % Set a sampling frequency fq = 440; % frequency to play t = [0:round(2*fs)-1]/fs; % Sampled time axis sig = cos(2*pi*fq*t); % Create sampled signal soundsc(sig,fs) % Play it plot(t,sig); xlabel('seconds'); ylabel('amplitude') wavwrite(sig,fs,'t440.wav') clear % Remove all variables from work space %% Reload tone and weight it with a decaying exponential of time constant.6 seconds tc =.6; % Set time constant [y, fs] = wavread('t440.wav'); % read in wave file t =[0:length(y)-1]'/fs; % Create sampled time axis dw = exp(-t/tc); % Compute sampled decaying exponential dsig = y.*dw; % Multiply sinusoid with decaying exponential soundsc(dsig,fs) plot(t,dsig); xlabel('seconds'); ylabel('amplitude')

18 Matlab Examples Explore demo and help files >> help script SCRIPT About MATLAB scripts and M-files. A SCRIPT file is an external file that contains a sequence of MATLAB statements. By typing the filename, subsequent MATLAB input is obtained from the file. SCRIPT files have a filename extension of ".m" and are often called "M-files". To make a SCRIPT file into a function, see FUNCTION. See also type, echo. Reference page in Help browser doc script In the help window (click on question mark) Go through section on programming and then go to the demo tab and view a few of the demo.

19 Matlab Examples In class examples

20 Matlab Exercise Use the sine/cosine function in Matlab to write a function that generates a minor scale (for testing the function use start tones between 100 and 440 Hz with a sampling rate of 8 khz). Let the Matlab function input arguments be the starting frequency and the time interval for each scale tone in seconds. Let the output be a vector of samples that can be played with Matlab command soundsc(v,8000) (where v is the vector output of your function). The frequency range of a scale covers one octave, which implies the last frequency is twice the starting frequency. On most fixed pitch instruments, 12 semi-tones or half steps make up the notes within an octave. A minor scale sequentially increases by a whole, half, whole, whole, half, whole, and whole (8 notes altogether including the starting note).

21 Matlab Exercise - Scales Just Pythagorean Equal Temperament Interval - 0 (1) 1/1 = 1 1 = 1 2^(0)=1 Interval /15 256/243 2^(1/12) Interval - 2 (2) 10/9 (or 9/8) 9/8 2^(2/12) Interval - 3 (3) 6/5 32/27 2^(3/12) Interval - 4 5/4 81/64 2^(4/12) Interval - 5 (4) 4/3 4/3 2^(5/12) Interval /32 (or 64/45) 1024/729 (or 729/512) 2^(6/12) Interval - 7 (5) 3/2 3/2 2^(7/12) Interval - 8 (6) 8/5 128/81 2^(8/12) Interval - 9 5/3 27/16 2^(9/12) Interval - 10 (7) 7/4 (or 16/19 or 9/5) 16/9 2^(10/12) Interval /8 243/128 2^(11/12) Interval - 12 (8) 2/1 = 2 2/1 = 2 2^(12/12) = 2

22 Matlab Exercise Famous Notes Middle C = Hz (standard tuning) Concert A (A above middle C) = 440 Hz Middle C = 256 Hz (Scientific tuning) Lowest note on piano A=27.5 Hz Highest note on piano C=

Laboratory Assignment 3. Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB

Laboratory Assignment 3. Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB Laboratory Assignment 3 Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB PURPOSE In this laboratory assignment, you will use MATLAB to synthesize the audio tones that make up a well-known

More information

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound Pitch Perception and Grouping HST.723 Neural Coding and Perception of Sound Pitch Perception. I. Pure Tones The pitch of a pure tone is strongly related to the tone s frequency, although there are small

More information

CTP 431 Music and Audio Computing. Basic Acoustics. Graduate School of Culture Technology (GSCT) Juhan Nam

CTP 431 Music and Audio Computing. Basic Acoustics. Graduate School of Culture Technology (GSCT) Juhan Nam CTP 431 Music and Audio Computing Basic Acoustics Graduate School of Culture Technology (GSCT) Juhan Nam 1 Outlines What is sound? Generation Propagation Reception Sound properties Loudness Pitch Timbre

More information

2018 Fall CTP431: Music and Audio Computing Fundamentals of Musical Acoustics

2018 Fall CTP431: Music and Audio Computing Fundamentals of Musical Acoustics 2018 Fall CTP431: Music and Audio Computing Fundamentals of Musical Acoustics Graduate School of Culture Technology, KAIST Juhan Nam Outlines Introduction to musical tones Musical tone generation - String

More information

Pitch-Synchronous Spectrogram: Principles and Applications

Pitch-Synchronous Spectrogram: Principles and Applications Pitch-Synchronous Spectrogram: Principles and Applications C. Julian Chen Department of Applied Physics and Applied Mathematics May 24, 2018 Outline The traditional spectrogram Observations with the electroglottograph

More information

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics)

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics) 1 Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics) Pitch Pitch is a subjective characteristic of sound Some listeners even assign pitch differently depending upon whether the sound was

More information

2. AN INTROSPECTION OF THE MORPHING PROCESS

2. AN INTROSPECTION OF THE MORPHING PROCESS 1. INTRODUCTION Voice morphing means the transition of one speech signal into another. Like image morphing, speech morphing aims to preserve the shared characteristics of the starting and final signals,

More information

Lab P-6: Synthesis of Sinusoidal Signals A Music Illusion. A k cos.! k t C k / (1)

Lab P-6: Synthesis of Sinusoidal Signals A Music Illusion. A k cos.! k t C k / (1) DSP First, 2e Signal Processing First Lab P-6: Synthesis of Sinusoidal Signals A Music Illusion Pre-Lab: Read the Pre-Lab and do all the exercises in the Pre-Lab section prior to attending lab. Verification:

More information

International Journal of Computer Architecture and Mobility (ISSN ) Volume 1-Issue 7, May 2013

International Journal of Computer Architecture and Mobility (ISSN ) Volume 1-Issue 7, May 2013 Carnatic Swara Synthesizer (CSS) Design for different Ragas Shruti Iyengar, Alice N Cheeran Abstract Carnatic music is one of the oldest forms of music and is one of two main sub-genres of Indian Classical

More information

CTP431- Music and Audio Computing Musical Acoustics. Graduate School of Culture Technology KAIST Juhan Nam

CTP431- Music and Audio Computing Musical Acoustics. Graduate School of Culture Technology KAIST Juhan Nam CTP431- Music and Audio Computing Musical Acoustics Graduate School of Culture Technology KAIST Juhan Nam 1 Outlines What is sound? Physical view Psychoacoustic view Sound generation Wave equation Wave

More information

Welcome to Vibrationdata

Welcome to Vibrationdata Welcome to Vibrationdata Acoustics Shock Vibration Signal Processing February 2004 Newsletter Greetings Feature Articles Speech is perhaps the most important characteristic that distinguishes humans from

More information

Pitch. The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high.

Pitch. The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high. Pitch The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high. 1 The bottom line Pitch perception involves the integration of spectral (place)

More information

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring 2009 Week 6 Class Notes Pitch Perception Introduction Pitch may be described as that attribute of auditory sensation in terms

More information

UNIVERSITY OF DUBLIN TRINITY COLLEGE

UNIVERSITY OF DUBLIN TRINITY COLLEGE UNIVERSITY OF DUBLIN TRINITY COLLEGE FACULTY OF ENGINEERING & SYSTEMS SCIENCES School of Engineering and SCHOOL OF MUSIC Postgraduate Diploma in Music and Media Technologies Hilary Term 31 st January 2005

More information

We realize that this is really small, if we consider that the atmospheric pressure 2 is

We realize that this is really small, if we consider that the atmospheric pressure 2 is PART 2 Sound Pressure Sound Pressure Levels (SPLs) Sound consists of pressure waves. Thus, a way to quantify sound is to state the amount of pressure 1 it exertsrelatively to a pressure level of reference.

More information

Analysis of the effects of signal distance on spectrograms

Analysis of the effects of signal distance on spectrograms 2014 Analysis of the effects of signal distance on spectrograms SGHA 8/19/2014 Contents Introduction... 3 Scope... 3 Data Comparisons... 5 Results... 10 Recommendations... 10 References... 11 Introduction

More information

9.35 Sensation And Perception Spring 2009

9.35 Sensation And Perception Spring 2009 MIT OpenCourseWare http://ocw.mit.edu 9.35 Sensation And Perception Spring 29 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. Hearing Kimo Johnson April

More information

Digital audio and computer music. COS 116, Spring 2012 Guest lecture: Rebecca Fiebrink

Digital audio and computer music. COS 116, Spring 2012 Guest lecture: Rebecca Fiebrink Digital audio and computer music COS 116, Spring 2012 Guest lecture: Rebecca Fiebrink Overview 1. Physics & perception of sound & music 2. Representations of music 3. Analyzing music with computers 4.

More information

Simple Harmonic Motion: What is a Sound Spectrum?

Simple Harmonic Motion: What is a Sound Spectrum? Simple Harmonic Motion: What is a Sound Spectrum? A sound spectrum displays the different frequencies present in a sound. Most sounds are made up of a complicated mixture of vibrations. (There is an introduction

More information

Creative Computing II

Creative Computing II Creative Computing II Christophe Rhodes c.rhodes@gold.ac.uk Autumn 2010, Wednesdays: 10:00 12:00: RHB307 & 14:00 16:00: WB316 Winter 2011, TBC The Ear The Ear Outer Ear Outer Ear: pinna: flap of skin;

More information

Phone-based Plosive Detection

Phone-based Plosive Detection Phone-based Plosive Detection 1 Andreas Madsack, Grzegorz Dogil, Stefan Uhlich, Yugu Zeng and Bin Yang Abstract We compare two segmentation approaches to plosive detection: One aproach is using a uniform

More information

Lab 5 Linear Predictive Coding

Lab 5 Linear Predictive Coding Lab 5 Linear Predictive Coding 1 of 1 Idea When plain speech audio is recorded and needs to be transmitted over a channel with limited bandwidth it is often necessary to either compress or encode the audio

More information

PHYSICS OF MUSIC. 1.) Charles Taylor, Exploring Music (Music Library ML3805 T )

PHYSICS OF MUSIC. 1.) Charles Taylor, Exploring Music (Music Library ML3805 T ) REFERENCES: 1.) Charles Taylor, Exploring Music (Music Library ML3805 T225 1992) 2.) Juan Roederer, Physics and Psychophysics of Music (Music Library ML3805 R74 1995) 3.) Physics of Sound, writeup in this

More information

Digital music synthesis using DSP

Digital music synthesis using DSP Digital music synthesis using DSP Rahul Bhat (124074002), Sandeep Bhagwat (123074011), Gaurang Naik (123079009), Shrikant Venkataramani (123079042) DSP Application Assignment, Group No. 4 Department of

More information

SOUND LABORATORY LING123: SOUND AND COMMUNICATION

SOUND LABORATORY LING123: SOUND AND COMMUNICATION SOUND LABORATORY LING123: SOUND AND COMMUNICATION In this assignment you will be using the Praat program to analyze two recordings: (1) the advertisement call of the North American bullfrog; and (2) the

More information

Vibration Measurement and Analysis

Vibration Measurement and Analysis Measurement and Analysis Why Analysis Spectrum or Overall Level Filters Linear vs. Log Scaling Amplitude Scales Parameters The Detector/Averager Signal vs. System analysis The Measurement Chain Transducer

More information

Acoustic Scene Classification

Acoustic Scene Classification Acoustic Scene Classification Marc-Christoph Gerasch Seminar Topics in Computer Music - Acoustic Scene Classification 6/24/2015 1 Outline Acoustic Scene Classification - definition History and state of

More information

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng S. Zhu, P. Ji, W. Kuang and J. Yang Institute of Acoustics, CAS, O.21, Bei-Si-huan-Xi Road, 100190 Beijing,

More information

Laugh when you re winning

Laugh when you re winning Laugh when you re winning Harry Griffin for the ILHAIRE Consortium 26 July, 2013 ILHAIRE Laughter databases Laugh when you re winning project Concept & Design Architecture Multimodal analysis Overview

More information

Getting Started with the LabVIEW Sound and Vibration Toolkit

Getting Started with the LabVIEW Sound and Vibration Toolkit 1 Getting Started with the LabVIEW Sound and Vibration Toolkit This tutorial is designed to introduce you to some of the sound and vibration analysis capabilities in the industry-leading software tool

More information

DSP First Lab 04: Synthesis of Sinusoidal Signals - Music Synthesis

DSP First Lab 04: Synthesis of Sinusoidal Signals - Music Synthesis DSP First Lab 04: Synthesis of Sinusoidal Signals - Music Synthesis Pre-Lab and Warm-Up: You should read at least the Pre-Lab and Warm-up sections of this lab assignment and go over all exercises in the

More information

Analysis, Synthesis, and Perception of Musical Sounds

Analysis, Synthesis, and Perception of Musical Sounds Analysis, Synthesis, and Perception of Musical Sounds The Sound of Music James W. Beauchamp Editor University of Illinois at Urbana, USA 4y Springer Contents Preface Acknowledgments vii xv 1. Analysis

More information

CHAPTER 20.2 SPEECH AND MUSICAL SOUNDS

CHAPTER 20.2 SPEECH AND MUSICAL SOUNDS Source: STANDARD HANDBOOK OF ELECTRONIC ENGINEERING CHAPTER 20.2 SPEECH AND MUSICAL SOUNDS Daniel W. Martin, Ronald M. Aarts SPEECH SOUNDS Speech Level and Spectrum Both the sound-pressure level and the

More information

BBN ANG 141 Foundations of phonology Phonetics 3: Acoustic phonetics 1

BBN ANG 141 Foundations of phonology Phonetics 3: Acoustic phonetics 1 BBN ANG 141 Foundations of phonology Phonetics 3: Acoustic phonetics 1 Zoltán Kiss Dept. of English Linguistics, ELTE z. kiss (elte/delg) intro phono 3/acoustics 1 / 49 Introduction z. kiss (elte/delg)

More information

Vocal-tract Influence in Trombone Performance

Vocal-tract Influence in Trombone Performance Proceedings of the International Symposium on Music Acoustics (Associated Meeting of the International Congress on Acoustics) 25-31 August 2, Sydney and Katoomba, Australia Vocal-tract Influence in Trombone

More information

Music Representations

Music Representations Lecture Music Processing Music Representations Meinard Müller International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de Book: Fundamentals of Music Processing Meinard Müller Fundamentals

More information

Course Web site:

Course Web site: The University of Texas at Austin Spring 2018 EE 445S Real- Time Digital Signal Processing Laboratory Prof. Evans Solutions for Homework #1 on Sinusoids, Transforms and Transfer Functions 1. Transfer Functions.

More information

Pitch. There is perhaps no aspect of music more important than pitch. It is notoriously

Pitch. There is perhaps no aspect of music more important than pitch. It is notoriously 12 A General Theory of Singing Voice Perception: Pitch / Howell Pitch There is perhaps no aspect of music more important than pitch. It is notoriously prescribed by composers and meaningfully recomposed

More information

Lecture 1: What we hear when we hear music

Lecture 1: What we hear when we hear music Lecture 1: What we hear when we hear music What is music? What is sound? What makes us find some sounds pleasant (like a guitar chord) and others unpleasant (a chainsaw)? Sound is variation in air pressure.

More information

CSC475 Music Information Retrieval

CSC475 Music Information Retrieval CSC475 Music Information Retrieval Monophonic pitch extraction George Tzanetakis University of Victoria 2014 G. Tzanetakis 1 / 32 Table of Contents I 1 Motivation and Terminology 2 Psychacoustics 3 F0

More information

Hybrid active noise barrier with sound masking

Hybrid active noise barrier with sound masking Hybrid active noise barrier with sound masking Xun WANG ; Yosuke KOBA ; Satoshi ISHIKAWA ; Shinya KIJIMOTO, Kyushu University, Japan ABSTRACT In this paper, a hybrid active noise barrier (ANB) with sound

More information

Measurement of overtone frequencies of a toy piano and perception of its pitch

Measurement of overtone frequencies of a toy piano and perception of its pitch Measurement of overtone frequencies of a toy piano and perception of its pitch PACS: 43.75.Mn ABSTRACT Akira Nishimura Department of Media and Cultural Studies, Tokyo University of Information Sciences,

More information

Quarterly Progress and Status Report. Formant frequency tuning in singing

Quarterly Progress and Status Report. Formant frequency tuning in singing Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Formant frequency tuning in singing Carlsson-Berndtsson, G. and Sundberg, J. journal: STL-QPSR volume: 32 number: 1 year: 1991 pages:

More information

A Matlab toolbox for. Characterisation Of Recorded Underwater Sound (CHORUS) USER S GUIDE

A Matlab toolbox for. Characterisation Of Recorded Underwater Sound (CHORUS) USER S GUIDE Centre for Marine Science and Technology A Matlab toolbox for Characterisation Of Recorded Underwater Sound (CHORUS) USER S GUIDE Version 5.0b Prepared for: Centre for Marine Science and Technology Prepared

More information

Automatic music transcription

Automatic music transcription Music transcription 1 Music transcription 2 Automatic music transcription Sources: * Klapuri, Introduction to music transcription, 2006. www.cs.tut.fi/sgn/arg/klap/amt-intro.pdf * Klapuri, Eronen, Astola:

More information

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY Eugene Mikyung Kim Department of Music Technology, Korea National University of Arts eugene@u.northwestern.edu ABSTRACT

More information

Book: Fundamentals of Music Processing. Audio Features. Book: Fundamentals of Music Processing. Book: Fundamentals of Music Processing

Book: Fundamentals of Music Processing. Audio Features. Book: Fundamentals of Music Processing. Book: Fundamentals of Music Processing Book: Fundamentals of Music Processing Lecture Music Processing Audio Features Meinard Müller International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de Meinard Müller Fundamentals

More information

OCTAVE C 3 D 3 E 3 F 3 G 3 A 3 B 3 C 4 D 4 E 4 F 4 G 4 A 4 B 4 C 5 D 5 E 5 F 5 G 5 A 5 B 5. Middle-C A-440

OCTAVE C 3 D 3 E 3 F 3 G 3 A 3 B 3 C 4 D 4 E 4 F 4 G 4 A 4 B 4 C 5 D 5 E 5 F 5 G 5 A 5 B 5. Middle-C A-440 DSP First Laboratory Exercise # Synthesis of Sinusoidal Signals This lab includes a project on music synthesis with sinusoids. One of several candidate songs can be selected when doing the synthesis program.

More information

HST 725 Music Perception & Cognition Assignment #1 =================================================================

HST 725 Music Perception & Cognition Assignment #1 ================================================================= HST.725 Music Perception and Cognition, Spring 2009 Harvard-MIT Division of Health Sciences and Technology Course Director: Dr. Peter Cariani HST 725 Music Perception & Cognition Assignment #1 =================================================================

More information

Automatic Construction of Synthetic Musical Instruments and Performers

Automatic Construction of Synthetic Musical Instruments and Performers Ph.D. Thesis Proposal Automatic Construction of Synthetic Musical Instruments and Performers Ning Hu Carnegie Mellon University Thesis Committee Roger B. Dannenberg, Chair Michael S. Lewicki Richard M.

More information

Topic 10. Multi-pitch Analysis

Topic 10. Multi-pitch Analysis Topic 10 Multi-pitch Analysis What is pitch? Common elements of music are pitch, rhythm, dynamics, and the sonic qualities of timbre and texture. An auditory perceptual attribute in terms of which sounds

More information

AN ALGORITHM FOR LOCATING FUNDAMENTAL FREQUENCY (F0) MARKERS IN SPEECH

AN ALGORITHM FOR LOCATING FUNDAMENTAL FREQUENCY (F0) MARKERS IN SPEECH AN ALGORITHM FOR LOCATING FUNDAMENTAL FREQUENCY (F0) MARKERS IN SPEECH by Princy Dikshit B.E (C.S) July 2000, Mangalore University, India A Thesis Submitted to the Faculty of Old Dominion University in

More information

Physics. Approximate Timeline. Students are expected to keep up with class work when absent.

Physics. Approximate Timeline. Students are expected to keep up with class work when absent. Physics Approximate Timeline Students are expected to keep up with class work when absent. CHAPTER 15 SOUND Day Plans for the day Assignments for the day 1 15.1 Properties & Detection of Sound Assignment

More information

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS Andrew N. Robertson, Mark D. Plumbley Centre for Digital Music

More information

advanced spectral processing

advanced spectral processing advanced spectral processing Jordi Janer Music Technology Group Universitat Pompeu Fabra, Barcelona jordi.janer @ upf.edu CDSIM UPF May 2014 hkp://mtg.upf.edu/ Outline 1. IntroducNon to spectral processing

More information

Advanced Signal Processing 2

Advanced Signal Processing 2 Advanced Signal Processing 2 Synthesis of Singing 1 Outline Features and requirements of signing synthesizers HMM based synthesis of singing Articulatory synthesis of singing Examples 2 Requirements of

More information

Music Representations

Music Representations Advanced Course Computer Science Music Processing Summer Term 00 Music Representations Meinard Müller Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Music Representations Music Representations

More information

The Physics Of Sound. Why do we hear what we hear? (Turn on your speakers)

The Physics Of Sound. Why do we hear what we hear? (Turn on your speakers) The Physics Of Sound Why do we hear what we hear? (Turn on your speakers) Sound is made when something vibrates. The vibration disturbs the air around it. This makes changes in air pressure. These changes

More information

Music 209 Advanced Topics in Computer Music Lecture 1 Introduction

Music 209 Advanced Topics in Computer Music Lecture 1 Introduction Music 209 Advanced Topics in Computer Music Lecture 1 Introduction 2006-1-19 Professor David Wessel (with John Lazzaro) (cnmat.berkeley.edu/~wessel, www.cs.berkeley.edu/~lazzaro) Website: Coming Soon...

More information

Speech and Speaker Recognition for the Command of an Industrial Robot

Speech and Speaker Recognition for the Command of an Industrial Robot Speech and Speaker Recognition for the Command of an Industrial Robot CLAUDIA MOISA*, HELGA SILAGHI*, ANDREI SILAGHI** *Dept. of Electric Drives and Automation University of Oradea University Street, nr.

More information

Figure 1: Feature Vector Sequence Generator block diagram.

Figure 1: Feature Vector Sequence Generator block diagram. 1 Introduction Figure 1: Feature Vector Sequence Generator block diagram. We propose designing a simple isolated word speech recognition system in Verilog. Our design is naturally divided into two modules.

More information

Making music with voice. Distinguished lecture, CIRMMT Jan 2009, Copyright Johan Sundberg

Making music with voice. Distinguished lecture, CIRMMT Jan 2009, Copyright Johan Sundberg Making music with voice MENU: A: The instrument B: Getting heard C: Expressivity The instrument Summary RADIATED SPECTRUM Level Frequency Velum VOCAL TRACT Frequency curve Formants Level Level Frequency

More information

Polytek Reference Manual

Polytek Reference Manual Polytek Reference Manual Table of Contents Installation 2 Navigation 3 Overview 3 How to Generate Sounds and Sequences 4 1) Create a Rhythm 4 2) Write a Melody 5 3) Craft your Sound 5 4) Apply FX 11 5)

More information

Tempo Estimation and Manipulation

Tempo Estimation and Manipulation Hanchel Cheng Sevy Harris I. Introduction Tempo Estimation and Manipulation This project was inspired by the idea of a smart conducting baton which could change the sound of audio in real time using gestures,

More information

EE391 Special Report (Spring 2005) Automatic Chord Recognition Using A Summary Autocorrelation Function

EE391 Special Report (Spring 2005) Automatic Chord Recognition Using A Summary Autocorrelation Function EE391 Special Report (Spring 25) Automatic Chord Recognition Using A Summary Autocorrelation Function Advisor: Professor Julius Smith Kyogu Lee Center for Computer Research in Music and Acoustics (CCRMA)

More information

2 Autocorrelation verses Strobed Temporal Integration

2 Autocorrelation verses Strobed Temporal Integration 11 th ISH, Grantham 1997 1 Auditory Temporal Asymmetry and Autocorrelation Roy D. Patterson* and Toshio Irino** * Center for the Neural Basis of Hearing, Physiology Department, Cambridge University, Downing

More information

Comparison Parameters and Speaker Similarity Coincidence Criteria:

Comparison Parameters and Speaker Similarity Coincidence Criteria: Comparison Parameters and Speaker Similarity Coincidence Criteria: The Easy Voice system uses two interrelating parameters of comparison (first and second error types). False Rejection, FR is a probability

More information

Music 170: Wind Instruments

Music 170: Wind Instruments Music 170: Wind Instruments Tamara Smyth, trsmyth@ucsd.edu Department of Music, University of California, San Diego (UCSD) December 4, 27 1 Review Question Question: A 440-Hz sinusoid is traveling in the

More information

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC Vishweshwara Rao, Sachin Pant, Madhumita Bhaskar and Preeti Rao Department of Electrical Engineering, IIT Bombay {vishu, sachinp,

More information

Singing voice synthesis in Spanish by concatenation of syllables based on the TD-PSOLA algorithm

Singing voice synthesis in Spanish by concatenation of syllables based on the TD-PSOLA algorithm Singing voice synthesis in Spanish by concatenation of syllables based on the TD-PSOLA algorithm ALEJANDRO RAMOS-AMÉZQUITA Computer Science Department Tecnológico de Monterrey (Campus Ciudad de México)

More information

Design of Speech Signal Analysis and Processing System. Based on Matlab Gateway

Design of Speech Signal Analysis and Processing System. Based on Matlab Gateway 1 Design of Speech Signal Analysis and Processing System Based on Matlab Gateway Weidong Li,Zhongwei Qin,Tongyu Xiao Electronic Information Institute, University of Science and Technology, Shaanxi, China

More information

Author Index. Absolu, Brandt 165. Montecchio, Nicola 187 Mukherjee, Bhaswati 285 Müllensiefen, Daniel 365. Bay, Mert 93

Author Index. Absolu, Brandt 165. Montecchio, Nicola 187 Mukherjee, Bhaswati 285 Müllensiefen, Daniel 365. Bay, Mert 93 Author Index Absolu, Brandt 165 Bay, Mert 93 Datta, Ashoke Kumar 285 Dey, Nityananda 285 Doraisamy, Shyamala 391 Downie, J. Stephen 93 Ehmann, Andreas F. 93 Esposito, Roberto 143 Gerhard, David 119 Golzari,

More information

ONLINE ACTIVITIES FOR MUSIC INFORMATION AND ACOUSTICS EDUCATION AND PSYCHOACOUSTIC DATA COLLECTION

ONLINE ACTIVITIES FOR MUSIC INFORMATION AND ACOUSTICS EDUCATION AND PSYCHOACOUSTIC DATA COLLECTION ONLINE ACTIVITIES FOR MUSIC INFORMATION AND ACOUSTICS EDUCATION AND PSYCHOACOUSTIC DATA COLLECTION Travis M. Doll Ray V. Migneco Youngmoo E. Kim Drexel University, Electrical & Computer Engineering {tmd47,rm443,ykim}@drexel.edu

More information

The Tone Height of Multiharmonic Sounds. Introduction

The Tone Height of Multiharmonic Sounds. Introduction Music-Perception Winter 1990, Vol. 8, No. 2, 203-214 I990 BY THE REGENTS OF THE UNIVERSITY OF CALIFORNIA The Tone Height of Multiharmonic Sounds ROY D. PATTERSON MRC Applied Psychology Unit, Cambridge,

More information

Semi-automated extraction of expressive performance information from acoustic recordings of piano music. Andrew Earis

Semi-automated extraction of expressive performance information from acoustic recordings of piano music. Andrew Earis Semi-automated extraction of expressive performance information from acoustic recordings of piano music Andrew Earis Outline Parameters of expressive piano performance Scientific techniques: Fourier transform

More information

Music Radar: A Web-based Query by Humming System

Music Radar: A Web-based Query by Humming System Music Radar: A Web-based Query by Humming System Lianjie Cao, Peng Hao, Chunmeng Zhou Computer Science Department, Purdue University, 305 N. University Street West Lafayette, IN 47907-2107 {cao62, pengh,

More information

MASTER'S THESIS. Listener Envelopment

MASTER'S THESIS. Listener Envelopment MASTER'S THESIS 2008:095 Listener Envelopment Effects of changing the sidewall material in a model of an existing concert hall Dan Nyberg Luleå University of Technology Master thesis Audio Technology Department

More information

AUD 6306 Speech Science

AUD 6306 Speech Science AUD 3 Speech Science Dr. Peter Assmann Spring semester 2 Role of Pitch Information Pitch contour is the primary cue for tone recognition Tonal languages rely on pitch level and differences to convey lexical

More information

Communication Theory and Engineering

Communication Theory and Engineering Communication Theory and Engineering Master's Degree in Electronic Engineering Sapienza University of Rome A.A. 2018-2019 Practice work 14 Image signals Example 1 Calculate the aspect ratio for an image

More information

1. Introduction NCMMSC2009

1. Introduction NCMMSC2009 NCMMSC9 Speech-to-Singing Synthesis System: Vocal Conversion from Speaking Voices to Singing Voices by Controlling Acoustic Features Unique to Singing Voices * Takeshi SAITOU 1, Masataka GOTO 1, Masashi

More information

Linear Time Invariant (LTI) Systems

Linear Time Invariant (LTI) Systems Linear Time Invariant (LTI) Systems Superposition Sound waves add in the air without interacting. Multiple paths in a room from source sum at your ear, only changing change phase and magnitude of particular

More information

Digital Signal. Continuous. Continuous. amplitude. amplitude. Discrete-time Signal. Analog Signal. Discrete. Continuous. time. time.

Digital Signal. Continuous. Continuous. amplitude. amplitude. Discrete-time Signal. Analog Signal. Discrete. Continuous. time. time. Discrete amplitude Continuous amplitude Continuous amplitude Digital Signal Analog Signal Discrete-time Signal Continuous time Discrete time Digital Signal Discrete time 1 Digital Signal contd. Analog

More information

I. LISTENING. For most people, sound is background only. To the sound designer/producer, sound is everything.!tc 243 2

I. LISTENING. For most people, sound is background only. To the sound designer/producer, sound is everything.!tc 243 2 To use sound properly, and fully realize its power, we need to do the following: (1) listen (2) understand basics of sound and hearing (3) understand sound's fundamental effects on human communication

More information

LOUDNESS EFFECT OF THE DIFFERENT TONES ON THE TIMBRE SUBJECTIVE PERCEPTION EXPERIMENT OF ERHU

LOUDNESS EFFECT OF THE DIFFERENT TONES ON THE TIMBRE SUBJECTIVE PERCEPTION EXPERIMENT OF ERHU The 21 st International Congress on Sound and Vibration 13-17 July, 2014, Beijing/China LOUDNESS EFFECT OF THE DIFFERENT TONES ON THE TIMBRE SUBJECTIVE PERCEPTION EXPERIMENT OF ERHU Siyu Zhu, Peifeng Ji,

More information

T.R.A.X. SPAT. Real-time Voice and Sonic Modeling Processor. by by. Multiformat Room Acoustic Simulation & Localization Processor

T.R.A.X. SPAT. Real-time Voice and Sonic Modeling Processor. by by. Multiformat Room Acoustic Simulation & Localization Processor SPAT Multiformat Room Acoustic Simulation & Localization Processor by by T.R.A.X. Real-time Voice and Sonic Modeling Processor Flux 2009. All rights reserved Transformer Common Cross Synthesis & Source

More information

Experiment 13 Sampling and reconstruction

Experiment 13 Sampling and reconstruction Experiment 13 Sampling and reconstruction Preliminary discussion So far, the experiments in this manual have concentrated on communications systems that transmit analog signals. However, digital transmission

More information

Robert Alexandru Dobre, Cristian Negrescu

Robert Alexandru Dobre, Cristian Negrescu ECAI 2016 - International Conference 8th Edition Electronics, Computers and Artificial Intelligence 30 June -02 July, 2016, Ploiesti, ROMÂNIA Automatic Music Transcription Software Based on Constant Q

More information

PSYCHOACOUSTICS & THE GRAMMAR OF AUDIO (By Steve Donofrio NATF)

PSYCHOACOUSTICS & THE GRAMMAR OF AUDIO (By Steve Donofrio NATF) PSYCHOACOUSTICS & THE GRAMMAR OF AUDIO (By Steve Donofrio NATF) "The reason I got into playing and producing music was its power to travel great distances and have an emotional impact on people" Quincey

More information

Musical Instrument Identification Using Principal Component Analysis and Multi-Layered Perceptrons

Musical Instrument Identification Using Principal Component Analysis and Multi-Layered Perceptrons Musical Instrument Identification Using Principal Component Analysis and Multi-Layered Perceptrons Róisín Loughran roisin.loughran@ul.ie Jacqueline Walker jacqueline.walker@ul.ie Michael O Neill University

More information

ECE438 - Laboratory 4: Sampling and Reconstruction of Continuous-Time Signals

ECE438 - Laboratory 4: Sampling and Reconstruction of Continuous-Time Signals Purdue University: ECE438 - Digital Signal Processing with Applications 1 ECE438 - Laboratory 4: Sampling and Reconstruction of Continuous-Time Signals October 6, 2010 1 Introduction It is often desired

More information

Harmonic Analysis of the Soprano Clarinet

Harmonic Analysis of the Soprano Clarinet Harmonic Analysis of the Soprano Clarinet A thesis submitted in partial fulfillment of the requirement for the degree of Bachelor of Science in Physics from the College of William and Mary in Virginia,

More information

EVTA SESSION HELSINKI JUNE 06 10, 2012

EVTA SESSION HELSINKI JUNE 06 10, 2012 EVTA SESSION HELSINKI JUNE 06 10, 2012 Reading Spectrograms FINATS Department of Communication and Arts University of Aveiro Campus Universitário de Santiago 3810-193 Aveiro Portugal ipa Lã (PhD) Department

More information

Musical Signal Processing with LabVIEW Introduction to Audio and Musical Signals. By: Ed Doering

Musical Signal Processing with LabVIEW Introduction to Audio and Musical Signals. By: Ed Doering Musical Signal Processing with LabVIEW Introduction to Audio and Musical Signals By: Ed Doering Musical Signal Processing with LabVIEW Introduction to Audio and Musical Signals By: Ed Doering Online:

More information

AN INTRODUCTION TO MUSIC THEORY Revision A. By Tom Irvine July 4, 2002

AN INTRODUCTION TO MUSIC THEORY Revision A. By Tom Irvine   July 4, 2002 AN INTRODUCTION TO MUSIC THEORY Revision A By Tom Irvine Email: tomirvine@aol.com July 4, 2002 Historical Background Pythagoras of Samos was a Greek philosopher and mathematician, who lived from approximately

More information

Experiments on musical instrument separation using multiplecause

Experiments on musical instrument separation using multiplecause Experiments on musical instrument separation using multiplecause models J Klingseisen and M D Plumbley* Department of Electronic Engineering King's College London * - Corresponding Author - mark.plumbley@kcl.ac.uk

More information

Signal processing in the Philips 'VLP' system

Signal processing in the Philips 'VLP' system Philips tech. Rev. 33, 181-185, 1973, No. 7 181 Signal processing in the Philips 'VLP' system W. van den Bussche, A. H. Hoogendijk and J. H. Wessels On the 'YLP' record there is a single information track

More information

Lab P5: Synthesis of Sinusoidal Signals Music Synthesis

Lab P5: Synthesis of Sinusoidal Signals Music Synthesis DSP First, 2e Signal Processing First Lab P5: Synthesis of Sinusoidal Signals Music Synthesis Pre-Lab and Warm-Up: You should read at least the Pre-Lab and Warm-up sections of this lab assignment and go

More information

Announcements. Project Turn-In Process. and URL for project on a Word doc Upload to Catalyst Collect It

Announcements. Project Turn-In Process. and URL for project on a Word doc Upload to Catalyst Collect It Announcements Project Turn-In Process Put name, lab, UW NetID, student ID, and URL for project on a Word doc Upload to Catalyst Collect It 1 Project 1A: Announcements Turn in the Word doc or.txt file before

More information

Signal Stability Analyser

Signal Stability Analyser Signal Stability Analyser o Real Time Phase or Frequency Display o Real Time Data, Allan Variance and Phase Noise Plots o 1MHz to 65MHz medium resolution (12.5ps) o 5MHz and 10MHz high resolution (50fs)

More information

FPFV-285/585 PRODUCTION SOUND Fall 2018 CRITICAL LISTENING Assignment

FPFV-285/585 PRODUCTION SOUND Fall 2018 CRITICAL LISTENING Assignment FPFV-285/585 PRODUCTION SOUND Fall 2018 CRITICAL LISTENING Assignment PREPARATION Track 1) Headphone check -- Left, Right, Left, Right. Track 2) A music excerpt for setting comfortable listening level.

More information