EE513 Audio Signals and Systems. Introduction Kevin D. Donohue Electrical and Computer Engineering University of Kentucky

Similar documents
Laboratory Assignment 3. Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound

CTP 431 Music and Audio Computing. Basic Acoustics. Graduate School of Culture Technology (GSCT) Juhan Nam

2018 Fall CTP431: Music and Audio Computing Fundamentals of Musical Acoustics

Pitch-Synchronous Spectrogram: Principles and Applications

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics)

2. AN INTROSPECTION OF THE MORPHING PROCESS

Lab P-6: Synthesis of Sinusoidal Signals A Music Illusion. A k cos.! k t C k / (1)

International Journal of Computer Architecture and Mobility (ISSN ) Volume 1-Issue 7, May 2013

CTP431- Music and Audio Computing Musical Acoustics. Graduate School of Culture Technology KAIST Juhan Nam

Welcome to Vibrationdata

Pitch. The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high.

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

UNIVERSITY OF DUBLIN TRINITY COLLEGE

We realize that this is really small, if we consider that the atmospheric pressure 2 is

Analysis of the effects of signal distance on spectrograms

9.35 Sensation And Perception Spring 2009

Digital audio and computer music. COS 116, Spring 2012 Guest lecture: Rebecca Fiebrink

Simple Harmonic Motion: What is a Sound Spectrum?

Creative Computing II

Phone-based Plosive Detection

Lab 5 Linear Predictive Coding

PHYSICS OF MUSIC. 1.) Charles Taylor, Exploring Music (Music Library ML3805 T )

Digital music synthesis using DSP

SOUND LABORATORY LING123: SOUND AND COMMUNICATION

Vibration Measurement and Analysis

Acoustic Scene Classification

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng

Laugh when you re winning

Getting Started with the LabVIEW Sound and Vibration Toolkit

DSP First Lab 04: Synthesis of Sinusoidal Signals - Music Synthesis

Analysis, Synthesis, and Perception of Musical Sounds

CHAPTER 20.2 SPEECH AND MUSICAL SOUNDS

BBN ANG 141 Foundations of phonology Phonetics 3: Acoustic phonetics 1

Vocal-tract Influence in Trombone Performance

Music Representations

Course Web site:

Pitch. There is perhaps no aspect of music more important than pitch. It is notoriously

Lecture 1: What we hear when we hear music

CSC475 Music Information Retrieval

Hybrid active noise barrier with sound masking

Measurement of overtone frequencies of a toy piano and perception of its pitch

Quarterly Progress and Status Report. Formant frequency tuning in singing

A Matlab toolbox for. Characterisation Of Recorded Underwater Sound (CHORUS) USER S GUIDE

Automatic music transcription

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY

Book: Fundamentals of Music Processing. Audio Features. Book: Fundamentals of Music Processing. Book: Fundamentals of Music Processing

OCTAVE C 3 D 3 E 3 F 3 G 3 A 3 B 3 C 4 D 4 E 4 F 4 G 4 A 4 B 4 C 5 D 5 E 5 F 5 G 5 A 5 B 5. Middle-C A-440

HST 725 Music Perception & Cognition Assignment #1 =================================================================

Automatic Construction of Synthetic Musical Instruments and Performers

Topic 10. Multi-pitch Analysis

AN ALGORITHM FOR LOCATING FUNDAMENTAL FREQUENCY (F0) MARKERS IN SPEECH

Physics. Approximate Timeline. Students are expected to keep up with class work when absent.

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

advanced spectral processing

Advanced Signal Processing 2

Music Representations

The Physics Of Sound. Why do we hear what we hear? (Turn on your speakers)

Music 209 Advanced Topics in Computer Music Lecture 1 Introduction

Speech and Speaker Recognition for the Command of an Industrial Robot

Figure 1: Feature Vector Sequence Generator block diagram.

Making music with voice. Distinguished lecture, CIRMMT Jan 2009, Copyright Johan Sundberg

Polytek Reference Manual

Tempo Estimation and Manipulation

EE391 Special Report (Spring 2005) Automatic Chord Recognition Using A Summary Autocorrelation Function

2 Autocorrelation verses Strobed Temporal Integration

Comparison Parameters and Speaker Similarity Coincidence Criteria:

Music 170: Wind Instruments

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

Singing voice synthesis in Spanish by concatenation of syllables based on the TD-PSOLA algorithm

Design of Speech Signal Analysis and Processing System. Based on Matlab Gateway

Author Index. Absolu, Brandt 165. Montecchio, Nicola 187 Mukherjee, Bhaswati 285 Müllensiefen, Daniel 365. Bay, Mert 93

ONLINE ACTIVITIES FOR MUSIC INFORMATION AND ACOUSTICS EDUCATION AND PSYCHOACOUSTIC DATA COLLECTION

The Tone Height of Multiharmonic Sounds. Introduction

Semi-automated extraction of expressive performance information from acoustic recordings of piano music. Andrew Earis

Music Radar: A Web-based Query by Humming System

MASTER'S THESIS. Listener Envelopment

AUD 6306 Speech Science

Communication Theory and Engineering

1. Introduction NCMMSC2009

Linear Time Invariant (LTI) Systems

Digital Signal. Continuous. Continuous. amplitude. amplitude. Discrete-time Signal. Analog Signal. Discrete. Continuous. time. time.

I. LISTENING. For most people, sound is background only. To the sound designer/producer, sound is everything.!tc 243 2

LOUDNESS EFFECT OF THE DIFFERENT TONES ON THE TIMBRE SUBJECTIVE PERCEPTION EXPERIMENT OF ERHU

T.R.A.X. SPAT. Real-time Voice and Sonic Modeling Processor. by by. Multiformat Room Acoustic Simulation & Localization Processor

Experiment 13 Sampling and reconstruction

Robert Alexandru Dobre, Cristian Negrescu

PSYCHOACOUSTICS & THE GRAMMAR OF AUDIO (By Steve Donofrio NATF)

Musical Instrument Identification Using Principal Component Analysis and Multi-Layered Perceptrons

ECE438 - Laboratory 4: Sampling and Reconstruction of Continuous-Time Signals

Harmonic Analysis of the Soprano Clarinet

EVTA SESSION HELSINKI JUNE 06 10, 2012

Musical Signal Processing with LabVIEW Introduction to Audio and Musical Signals. By: Ed Doering

AN INTRODUCTION TO MUSIC THEORY Revision A. By Tom Irvine July 4, 2002

Experiments on musical instrument separation using multiplecause

Signal processing in the Philips 'VLP' system

Lab P5: Synthesis of Sinusoidal Signals Music Synthesis

Announcements. Project Turn-In Process. and URL for project on a Word doc Upload to Catalyst Collect It

Signal Stability Analyser

FPFV-285/585 PRODUCTION SOUND Fall 2018 CRITICAL LISTENING Assignment

Transcription:

EE513 Audio Signals and Systems Introduction Kevin D. Donohue Electrical and Computer Engineering University of Kentucky

Question! If a tree falls in the forest and nobody is there to hear it, will it make a sound? Sound provided by http://www.therecordist.com/downloads.html

Ambiguity! Merriam-Webster Dictionary: Sound a : a particular auditory impression b:the sensation perceived by the sense of hearing c :mechanical radiant energy that is transmitted by longitudinal pressure waves in a material medium (as air) and is the objective cause of hearing.

Electronic Audio Systems Sound Sources Vibrations at 20Hz-20kHz Transmission Media Electoacoustic Transducer Playback Amplification, Signal Conditioning Information Extraction / Measurement Storage Processing for Intended Application

Natural Audio Systems

Synthetic Audio: Imitating Nature 1780 Wolfgang von Kemplen s Speaking Machine Mid 1800 s Charles Wheatstone Late 1800 s Alexander Graham Bell 1939 Homer Dudley s Voder http://www.acoustics.hut.fi/~slemmett/wave/track01f.wav 1898 Thaddeus Cahill s Telharmonium (First Music Synthesizer) 1919 Lev Theremin s Theremin

Speech Analysis and Synthesis Communication channels (acoustic and electric) 1874/1876 (Antonio Meucci s) Alexander Graham Bell s Telephone. 1940 s Homer Dudley s Channel Vocoder first analysis-synthesis system

Voice-Coding Models The general speech model: Quasi-Periodic Pulsed Air Air Burst or Continuous flow Voiced Speech Unvoiced Speech Vocal Tract Filter Vocal Radiator Speech sounds can be analyzed by determining the states of the vocal system components (vocal chords, track, lips, tongue ) for each fundamental sound of speech (phoneme).

Spectral Analysis Voiced Speech Spectral envelop => vocal tract formants Harmonic peaks => vocal chord pitch

Time Analysis Voiced Speech Time envelop => Volume dynamics Oscillations => Vocal chord motion

Spectrogram Analysis There shoe She do She bread Then bed Frequency Time

Spectogram of CD sound Tell Me Ma - Spectrogram in db 8000 120 7000 100 6000 80 Hertz 5000 60 4000 3000 40 2000 20 1000 0 0 5 10 Seconds 15

Speech Recognition 1920 s Radio Rex 1950 s (Bell Labs) Digit Recognition Spectral/Formant analysis Filter Banks 1960 s Neural Networks 1970 s ARPA Project for Speech Understanding Applications of spectral analysis methods FFT, Cepstral/homomorphic, LPC 1970 s Application of pattern matching methods DTW, and HMM

Speech Recognition 1980 s Standardize Training and Test with Large Corpora (TIMIT) (RM) (DARPA) New Front Ends (feature extractors) more perceptually based Dominance/Development of HMM Backpropagation and Neural Networks Rule-Base AI systems

Specification of Speech Recognition Speaker dependent or independent Recognize isolated, continuous, or spot speech Vocabulary Size, Grammar Perplexity, Speaking style Recording conditions

Components of Speech Recognition Speech Transduction Acoustic/Electronic Front End Input Speech Detected Speech String Local Match Global Detector Language Model

Matlab Examples %% Create and play a 2 second 440 Hz tone in Matlab: fs = 8000; % Set a sampling frequency fq = 440; % frequency to play t = [0:round(2*fs)-1]/fs; % Sampled time axis sig = cos(2*pi*fq*t); % Create sampled signal soundsc(sig,fs) % Play it plot(t,sig); xlabel('seconds'); ylabel('amplitude') wavwrite(sig,fs,'t440.wav') clear % Remove all variables from work space %% Reload tone and weight it with a decaying exponential of time constant.6 seconds tc =.6; % Set time constant [y, fs] = wavread('t440.wav'); % read in wave file t =[0:length(y)-1]'/fs; % Create sampled time axis dw = exp(-t/tc); % Compute sampled decaying exponential dsig = y.*dw; % Multiply sinusoid with decaying exponential soundsc(dsig,fs) plot(t,dsig); xlabel('seconds'); ylabel('amplitude')

Matlab Examples Explore demo and help files >> help script SCRIPT About MATLAB scripts and M-files. A SCRIPT file is an external file that contains a sequence of MATLAB statements. By typing the filename, subsequent MATLAB input is obtained from the file. SCRIPT files have a filename extension of ".m" and are often called "M-files". To make a SCRIPT file into a function, see FUNCTION. See also type, echo. Reference page in Help browser doc script In the help window (click on question mark) Go through section on programming and then go to the demo tab and view a few of the demo.

Matlab Examples In class examples

Matlab Exercise Use the sine/cosine function in Matlab to write a function that generates a minor scale (for testing the function use start tones between 100 and 440 Hz with a sampling rate of 8 khz). Let the Matlab function input arguments be the starting frequency and the time interval for each scale tone in seconds. Let the output be a vector of samples that can be played with Matlab command soundsc(v,8000) (where v is the vector output of your function). The frequency range of a scale covers one octave, which implies the last frequency is twice the starting frequency. On most fixed pitch instruments, 12 semi-tones or half steps make up the notes within an octave. A minor scale sequentially increases by a whole, half, whole, whole, half, whole, and whole (8 notes altogether including the starting note).

Matlab Exercise - Scales Just Pythagorean Equal Temperament Interval - 0 (1) 1/1 = 1 1 = 1 2^(0)=1 Interval - 1 16/15 256/243 2^(1/12) Interval - 2 (2) 10/9 (or 9/8) 9/8 2^(2/12) Interval - 3 (3) 6/5 32/27 2^(3/12) Interval - 4 5/4 81/64 2^(4/12) Interval - 5 (4) 4/3 4/3 2^(5/12) Interval - 6 45/32 (or 64/45) 1024/729 (or 729/512) 2^(6/12) Interval - 7 (5) 3/2 3/2 2^(7/12) Interval - 8 (6) 8/5 128/81 2^(8/12) Interval - 9 5/3 27/16 2^(9/12) Interval - 10 (7) 7/4 (or 16/19 or 9/5) 16/9 2^(10/12) Interval - 11 15/8 243/128 2^(11/12) Interval - 12 (8) 2/1 = 2 2/1 = 2 2^(12/12) = 2

Matlab Exercise Famous Notes Middle C = 261.626 Hz (standard tuning) Concert A (A above middle C) = 440 Hz Middle C = 256 Hz (Scientific tuning) Lowest note on piano A=27.5 Hz Highest note on piano C= 4186.009