Musical frequency tracking using the methods of conventional and "narrowed" autocorrelation

Similar documents
Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Transcription An Historical Overview

Measurement of overtone frequencies of a toy piano and perception of its pitch

Simple Harmonic Motion: What is a Sound Spectrum?

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES

Audio-Based Video Editing with Two-Channel Microphone

Tempo and Beat Analysis

DATA COMPRESSION USING THE FFT

DELTA MODULATION AND DPCM CODING OF COLOR SIGNALS

Robert Alexandru Dobre, Cristian Negrescu

Music Segmentation Using Markov Chain Methods

Melody transcription for interactive applications

Music Radar: A Web-based Query by Humming System

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY

CS229 Project Report Polyphonic Piano Transcription

Acoustic Measurements Using Common Computer Accessories: Do Try This at Home. Dale H. Litwhiler, Terrance D. Lovell

Getting Started. Connect green audio output of SpikerBox/SpikerShield using green cable to your headphones input on iphone/ipad.

Upgrading E-learning of basic measurement algorithms based on DSP and MATLAB Web Server. Milos Sedlacek 1, Ondrej Tomiska 2

Onset Detection and Music Transcription for the Irish Tin Whistle

Spectrum Analyser Basics

EE391 Special Report (Spring 2005) Automatic Chord Recognition Using A Summary Autocorrelation Function

Available online at ScienceDirect. Procedia Computer Science 46 (2015 )

PHYSICS OF MUSIC. 1.) Charles Taylor, Exploring Music (Music Library ML3805 T )

Speech and Speaker Recognition for the Command of an Industrial Robot

DIGITAL COMMUNICATION

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng

The Effect of Time-Domain Interpolation on Response Spectral Calculations. David M. Boore

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

Department of Electrical & Electronic Engineering Imperial College of Science, Technology and Medicine. Project: Real-Time Speech Enhancement

CSC475 Music Information Retrieval

A NEW LOOK AT FREQUENCY RESOLUTION IN POWER SPECTRAL DENSITY ESTIMATION. Sudeshna Pal, Soosan Beheshti

A FUNCTIONAL CLASSIFICATION OF ONE INSTRUMENT S TIMBRES

Voice & Music Pattern Extraction: A Review

Automatic Laughter Detection

Analysis, Synthesis, and Perception of Musical Sounds

2. AN INTROSPECTION OF THE MORPHING PROCESS

Experiments on musical instrument separation using multiplecause

Beat Tracking based on Multiple-agent Architecture A Real-time Beat Tracking System for Audio Signals

Analysis of local and global timing and pitch change in ordinary

Pitch correction on the human voice

Advanced Techniques for Spurious Measurements with R&S FSW-K50 White Paper

Topic 4. Single Pitch Detection

AN ALGORITHM FOR LOCATING FUNDAMENTAL FREQUENCY (F0) MARKERS IN SPEECH

Processing. Electrical Engineering, Department. IIT Kanpur. NPTEL Online - IIT Kanpur

TOWARD AN INTELLIGENT EDITOR FOR JAZZ MUSIC

Musicians Adjustment of Performance to Room Acoustics, Part III: Understanding the Variations in Musical Expressions

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

Calibrate, Characterize and Emulate Systems Using RFXpress in AWG Series

AN APPROACH FOR MELODY EXTRACTION FROM POLYPHONIC AUDIO: USING PERCEPTUAL PRINCIPLES AND MELODIC SMOOTHNESS

MIE 402: WORKSHOP ON DATA ACQUISITION AND SIGNAL PROCESSING Spring 2003

Music Source Separation

Detection and demodulation of non-cooperative burst signal Feng Yue 1, Wu Guangzhi 1, Tao Min 1

Automatic Rhythmic Notation from Single Voice Audio Sources

Hands-on session on timing analysis

Laboratory Assignment 3. Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB

Polyphonic music transcription through dynamic networks and spectral pattern identification

Removal of Decaying DC Component in Current Signal Using a ovel Estimation Algorithm

Analysis of Packet Loss for Compressed Video: Does Burst-Length Matter?

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound

JOURNAL OF BUILDING ACOUSTICS. Volume 20 Number

Electrospray-MS Charge Deconvolutions without Compromise an Enhanced Data Reconstruction Algorithm utilising Variable Peak Modelling

Automatic Laughter Detection

PS User Guide Series Seismic-Data Display

1 Ver.mob Brief guide

Available online at International Journal of Current Research Vol. 9, Issue, 08, pp , August, 2017

ACCURATE ANALYSIS AND VISUAL FEEDBACK OF VIBRATO IN SINGING. University of Porto - Faculty of Engineering -DEEC Porto, Portugal

Investigation of Digital Signal Processing of High-speed DACs Signals for Settling Time Testing

Virtual Vibration Analyzer

A Parametric Autoregressive Model for the Extraction of Electric Network Frequency Fluctuations in Audio Forensic Authentication

Reconstruction of Ca 2+ dynamics from low frame rate Ca 2+ imaging data CS229 final project. Submitted by: Limor Bursztyn

Keywords Separation of sound, percussive instruments, non-percussive instruments, flexible audio source separation toolbox

Processes for the Intersection

Lecture 2 Video Formation and Representation

Registration Reference Book

Pitch. The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high.

Spectroscopy on Thick HgI 2 Detectors: A Comparison Between Planar and Pixelated Electrodes

Chapter 4. Displaying Quantitative Data. Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley

Research Article. ISSN (Print) *Corresponding author Shireen Fathima

LOUDNESS EFFECT OF THE DIFFERENT TONES ON THE TIMBRE SUBJECTIVE PERCEPTION EXPERIMENT OF ERHU

Received 27 July ; Perturbations of Synthetic Orchestral Wind-Instrument

MUSICAL INSTRUMENT IDENTIFICATION BASED ON HARMONIC TEMPORAL TIMBRE FEATURES

Realizing Waveform Characteristics up to a Digitizer s Full Bandwidth Increasing the effective sampling rate when measuring repetitive signals

Spectral Sounds Summary

TERRESTRIAL broadcasting of digital television (DTV)

UNIVERSAL SPATIAL UP-SCALER WITH NONLINEAR EDGE ENHANCEMENT

NON-LINEAR EFFECTS MODELING FOR POLYPHONIC PIANO TRANSCRIPTION

MPEG has been established as an international standard

Appendix D. UW DigiScope User s Manual. Willis J. Tompkins and Annie Foong

Guidance For Scrambling Data Signals For EMC Compliance

Reducing False Positives in Video Shot Detection

Figure 1: Feature Vector Sequence Generator block diagram.

ON THE INTERPOLATION OF ULTRASONIC GUIDED WAVE SIGNALS

Analysis of Musical Content in Digital Audio

A review of the implementation of HDTV technology over SDTV technology

Query By Humming: Finding Songs in a Polyphonic Database

Music Database Retrieval Based on Spectral Similarity

Sentiment Extraction in Music

Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods

Violin Timbre Space Features

Using the new psychoacoustic tonality analyses Tonality (Hearing Model) 1

Transcription:

Musical frequency tracking using the methods of conventional and "narrowed" autocorrelation Judith C. Brown and Bin Zhang a) Physics Department, Feellesley College, Fee/lesley, Massachusetts 01281 and Media Lab, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139 (Received 20 August 1990; accepted for publication 23 December 1990) In two recent papers, a description is given of a means of obtaining an arbitrarily narrow peak in the calculation of the autocorrelation function [J. C. Brown, and M. S. Puckette, "Calculation of a narrowed autocorrelation function," J. Acoust. Sec. Am. 85, 1595-1601 (1989)] or era narrow valley in the calculation of an inverse autocorrelation [J. C. Brown, and M. S. Puekette, "Musical information from a narrowed autocorrelation function," Proceedings of the 1987 International Conference on Computer Music, Urbana, Illinois, 84-88 (1987)]. These calculations are applied to the determination of the fundamental frequency of musical signals produced by keyboard, wind, and string instruments. These results are compared to frequency tracking results obtained on these sounds with conventional autocorrelation. In so doing it is determined first whether the method of autocorrelation is well-adapted to the problem of tracking the frequency of musical signals, and, second, under what conditions "narrowed" autocorrelation is advantageous. PACS numbers: 43.60.Gk, 43.75.Yy INTRODUCTION Musical frequency tracking has been relatively little explored in comparison to the massivefforts that have been carried out by the speech community for use with various speech encoders for communications purposes. Musical applications have, for the most part, been in the area of intelligent systems, where an accurate frequency tracker is a necessity at the front end. An initial effort at a musical transcription system was made by Piszczalski and Galler (1977). This is a system whose goal is to take an incoming audio signal, process it, and turn out a musical score. Their frequency tracker (Piszczalski and Galler, 1979) examined frequency ratios of spectral components to form a hypothesis for the fundamental frequency. Another musical frequency tracker proposed originally in the same year (Terhardt, 1979; Terhardt et al, 1982) examined submultiples of spectral components to arrive at a fundamental frequency. Both of these methods are similar to the Schroeder (1968) histogram method. In a series of excellent papers going back to the 1970s, the group at CCRMA in the Music Department at Stanford has worked on a musical transcription system. Various frequency trackers are discussed by Moorer (1975), Schloss (1985), Foster et al. (1982), Mont-Reynaud (1985), and Chafe et al. (1985, 1986). A recent report by Serra and Wood (1988) summarizes current work at CCRMA and gives extensive references to earlier work. Barry Vercoe at MIT has worked on a machine intelligence problem called "the synthetic performer." Its goal is Present address: GW Instruments, 35 Medford St., Somerville, MA 02143. to be able to replace a member of a live ensemble of performers with a computer so that the remaining members cannot tell the difference. Early work by Vercoe (1984) and Vercoe and Puckette (1985) used a frequency tracker based on key sensors of a flute. More recently, as reported in a segment on the TV show "Discover," a spectral method similar to that of Schroeder (1968) and Amuedo (1985) has been used to track the audio signal of a violin. I. BACKGROUND It has been shown previously that the following equation (Brown and Puckette, 1989) can be used for the calculation of a narrowed autocorrelation function: S (r) = If(t) +f(t + r) +f(t + 2r) + --- +fit+ (N-- 1)r]l'. (1) In the discussion, we showed that terms of the form f(t)f(t + 2r),f( t)f( t + 3r)... etc., in addition to the "ordinary" autocorrelation term f( t )f( t + r) give rise to a narrowing of the autocorrelation function. Here, f(t) is the time wave analyzed, t is the time, and r is the autocorrelation time. For a periodic function, peaks occur at values of the autocorrelation time equal to multiples of the period T. When iv is equal to 2 in the above equation, the conventional or standard autocorrelation function is given by the cross term with the other two terms contributing a constant shift. The equation can be normalized with the value ofsn (r) 2 for r = 0 so that the values of SN(r) 2 range from 0-1. Most important, the width of the peaks measured from the maximum to the first zero is T/N for the time wave of a single harmonic component. The equation for the inverted autocorrelation function was discussed by Brown and Puckette (1987). It was shown that the function 2346 d. Acoust. Sec. Am. 89 (5). May 1991 0001-4966/91/052346-09500.80 1991 Acoustical Society of America 2346

p(t,r) = (f(t) +f(t + r) +f(t + 2 ) +..f[t+ (N-- 1 is the periodic function having period r which best approximatesf (t) over the interval Nr in the mean-square sense. If we consider the difference E(r) = {If(t) --p(t,r) ]2), (2) then E(r) will have minima for r equal to multiples of the period if f (t) is a periodic function. The average over time indicated by the angle brackets is taken over a time equal to or greater than r. Again we normalize so that the function varies from 0 to 1. In the discussion that follows, we will refer to calculations with Eq. (2) as those of inverted autocorrelation. Where necessary for distinction, calculations using Eq. ( 1 ) will be referred to as "erect" or standard autocorrelation. Within each of these categories, the calculations with N = 2 will be called conventional, and those with N> 2 will be called "narrowed." An important property of musical sounds is that, for the most part, they have harmonic spectral components. The consequence for the autocorrelation function is that one of the peaks of each of the higher components occurs at lhe same position as that of the fundamental. For example, the second harmonic has a period equal to half of the fundamental so its peaks occur at T/2, 2(T/2), 3( T/2)... and so on. The second peak of the second harmonic, then, will coincide with the first peak of the fundamental with similar reasoning for the other harmonics. Thus a large peak corresponding to the sum of all spectral components should occur at the period of the fundamental (and all integral multiples of the period of the fundamental). This is the property that makes the method of autocorrelation appear to be a good one for frequency tracking of musical signals. This property, of course, also holds true for the valleys for the inverse autocorrelation of Eq. (2). II. CALCULATIONS Examples of the conventional and narrowed autocorrelation functions for scales played by a piano, a violin, and a flute are shown in Figs. 1-6. Each curve is a 200-point autocorrelation function of sound from acoustic instruments sampled at 32000 samples per second. An average is taken over 500 sample so each curve represents approximately 15 ms of sound. The time in the sound is given on they axis with the autocorrelation time in samples on the x axis. Equation ( 1 ) was used for these calculations with N = 2 for the conventional autocorrelation and N = 5 for the narrowed auto- correlation. Examples calculated for the inverted autocorrelation function with Eq. (2) are not included. The piano sound analyzed in Figs. I and 2 was from a 9- foot Bosendorfer which radiated surprisingly little energy in the higher harmonics. Since the peak widths are proportional to the period, a sound with higher harmonics present will have an innately narrow peak. As this is not the case here, this piano sound represents a good case to see the effect of 1.2 1.6 AUTOCO R ELATION TIME (SAMPLES) FIG. 1. Conventional autocorrelation for a C major piano scale over several octaves. This portion is from F3 to E5. 2347 J. Acoust. Soc. Am., Vol. 89, No. 5, May 1991 J.C. Brown and B. Zhang: Musical pitch tracking 2347

AUTOCO R ELATION TIM E (SAMPLES) FIG. 2. Narrowed autocorrelation for the sams piano sound as that of Fig. 1. 200 AUTOCO R ELATION TIME (SAMPLES) FIG. 3. Conventional autocorrelation of a flute scale from C3 to C5. 2348 J. Acoust. Soc. Am., Vol. 89, No. 5, May 1991 J. C. Brown and B. Zhang: Musical pitch tracking 2348

.:'.8-..9-1.9- {. -.5- l. - {.7-.3-0 Y'5 5 75 100 AUTOCOR ELATION TIME I 125 150 (SAMPLES) J79 200 FIG. 4. Narrowed autocorrelation of a flute scale from C3 to C5. narrowing on a broad peak. This is quite striking and is shown in Figs. 1 and 2. The comparison of the flute scale in Figs. 3 and 4 and the violin scale in Figs. 5 and 6 show successively narrower natu- ral peak widths (Figs. 3 and 5). The effect of narrowing Fig. 6 compared to Fig. 5 is quite apparent and is visually preferable. One might, however, suspect that a computer could pick the natural peaks of Fig. 5 as easily as the nar- o LU { I t I I I o 5 50 75 0o,.5,50 I 175 200 I A U TOC O R ELATION T IM E (SA M P L E S) FIG. 5. Conventional autocorrelation oœa violin scale from G3 to GS. 2349 J. Acoust. Soc, Am., Vol. 89, No. 5, May 1991 J. C. Brown and B. Zhang: Musical pitch tracking 2349

z.- o z 3.6-3.5-1.4-3.3-3.2- I I I I AUTOCOR ELATION TIME I I I I (SAMPLES) FIG. 6. Narrowed autocorrelation of a violin scale from G3 to G5. rowed peaks of Fig. 6 since the peaks in Fig. 5 are already rather narrow. The problem of determining the fundamental frequency for each of the curves of these figures is to find the position of the peak corresponding to one period of the fundamental. For the inverted autocorrelation, the problem is to identify the corresponding minimum. The problem is complicated by the fact that we have calculated a discrete autocorrelation, and there may not be a sample at the exact postion of the maximum (or minimum). An example is given in Fig. 7 where the valley corresponding to one period (occurring between samples 88 and 89) has a value greater than that of the valley corresponding to two periods (occurring at sample 179). To circumvent this problem we have an adjustable parameter to denote how much higher (or lower) a peak (or ]00 150 AUTOCOR ELATION TIME (SAMPLES) FIG. 7. Example of the narrowed inverted autocorrelation function for a violin sound where the peak at autocorrelation time equal one period is low due to discrete sampling. valley) must be to replace the previous winner. This was 0.03 for inverted autocorrelation for both narrowed (N> 2) and conventional (N = 2) cases. For "erect" autocorrelation a percentage difference was used. An exhaustive study revealed that a lower value (approximately 15%) should be used for conventional than for narrowed (25%-35%) calculations. There was a small variation of the optimum value among the instruments. The larger value for the narrowed autocorrelation means that there is a greater difference in peak heights of the winning peak (sum of all harmonics) compared to other combinations of harmonics for the narrowed case. Since relative heights of individual harmonics are theoretically the same for both cases, this must result from less overlap of the narrower peaks, and is an advantage of the narrowed calculation. We have also tried several methods of curve fitting for obtaining a better value of the maximum including a qua- dratic fit, cubic fit, linear extrapolation of the two points on either side of the maximum, and calculation of the maximum by considering sums of two adjacent points. This latter method was used for the narrowed autocorrelation for the violin as its results were as good as those of the other methods, and it is more efficient computationally. Other curves gave good results with no fitting procedure. Another useful parameter is based on the normalization of these functions. Errors in frequency determination occur almost exclusively at note transitions where there is a mixture of both notes. There is often a change in amplitude of the waveform in these regions as well, and this affects the nor- malization. Both of these effects contribute to cause the peaks (or valleys for inverted autocorrelation) differ from the ideal value of 1 (or 0). Our programs have the option of 2350 J. Acoust. Soc. Am., Vol. 89, No. 5, May 1991 J.C. Brown and B. Zhang: Musical pitch tracking 2350

returning a zero meaning no frequency determination tf a peak was greater than 1.2 or less than 0.8 or ira valley was greater than 0.1. This property can be used by an intelligent system to signal note changes, if desired. It is to be noted above that the quantity E(r) from Eq. (2) is expected to be closer to the ideal value 0 than the quantity Ss ( -) 2 from Eq. ( 1 ) is to I at - equal to an integral number of periods. For example, iff(t) differs fromf(t ur T) by 10%, then E(T) in Eq. (2) is (0. I)2 _-- 0.01, whereas in Eq. (1), S.,( -) 2 is equal to!.2. This is an advantage of in- verted autocorrelation over the usual form. Once determined, the period in samples for a given frame was passed to an array which converted the period to a midinote. The array could be "tuned" so that the tuning of the instrument studied corresponded to the appropriate indices of the array. III. RESULTS The frequency tracking program was run on the sounds graphed in Figs. 1-6 as representatives of the keyboard, wind, and string families of instruments. For each sound sample, we determined the maximum, subjecto the con- ERECT o INVERTED E t i rn e FIG. 8. Frequency tracking results using four autocorrelation methods on the violin. 2351 J. Acoust. Soc. Am., Vol. 89, No. 5, May 1991 J. C. Brown and B. Zhang: Musical pitch tracking 2351

straints previously described, for the autocorrelation function and for the narrowed autocorrelation function. We similarly determined the minimum for the inverted and narrowed inverted autocorrelation functions. These periods were then converted to midinotes to map the results to musical notes. Our results are shown as graphs of midinote againstime in Figs. 8-10. Since each of the instruments was playing a scale, perfect results would consist of a sequential set of horizontalines rising by one or two midinotes corresponding to a half or a whole step in the scale. Thus each error can be recognized as a point off the appropriate horizontal line. Errors are summarized in Table I. The poorest frequency tracking results of any of our calculations were obtained using narrowed inverted autocorrelation on the violin as shown at the bottom of Fig. 8. Conventional (N = 2) inverted did the best with less than a 3% error. Both narrowed and conventional "erect" autocorrelation did well for this case, with conventional slightly better. The violin spectrum is the richest of the instruments we studied with many high harmonics present. As mentioned previously, this means an autocorrelation line width that is already narrow, and it is not surprising that the calculations with narrowing are poor. This is due to the difficulties with discrete sampling discussed previously. Consistent with this ERECT '"l INVERTED E t i rn e FIG. 9. Frequency tracking results using four autocorrelation methods on the piano. 2352 J. Acoust. Soc. Am., Vol. 89, No. 5, May 1991 J.C. Brown and B. Zhang: Musical pitch tracking 2352

mechanism, all errors have a midinote that is too low, usually by an octave. This means that the peak (or valley) corresponding to the period was missed and the second peak (or valley), corresponding to two periods, was reported. We should note that, with the exception of the narrowed inverted results, both these and the following results could be perfect simply by requiring the frequency reported by two adjacent frames to agree. This was not done as it wotdd eliminate the basis for comparisons in the results. The results on the piano in Fig. 9 again show errors in transition regions. There is no significant difference in the results of narrowed over conventional for the inverted autocorrelation. Narrowed erect autocorrelation does best of all. Two errors on the graph using conventional erect autocorrelation are marked where an average of the two notes in a transition is obtained. Correct results are obtained by the narrowed calculation, and the mechanism for this improvement will be discussed further in the following section. ERECT INVERTED E t i rn e FIG. 10. Frequency tracking results using four autocorrelation methods on the flute. 2353 J. Acoust. Soc. Am., Vol. 89, No. 5, May 1991 J.C. Brown and B. Zhang: Musical pitch tracking 2353

TABLE I. Summary of errors by frequency trackers. Instrument Method Terms Frames Errors Violin inverted N' = 2 147 5 N= 4 26 erect N = 2 6 N---5 9 Piano inverted N = 2 528 10 N=4 9 erect N = 2 11 N=5 8 Flute inverted N = 2 246 5 N=4 3 erect N = 2 2 N=5 I cal, the spectra in this study varied widely and must thus indicate success for this method for a broad variety of musical sounds. With the exception of the narrowed inverted calculation for the violin sound, perfect results could have been obtained for the sounds in this study simply by requiring that results from two successive frames agree. ACKNOWLEDGMENTS JCB is very grateful to the Marilyn Brachman Hoffman Committee of Wellesley College for a fellowship for released time during which some of this work was accomplished. We would also like to thank Ken Malsky for some of the early work on frequency tracking with the inverted autocorrelation function. The results on the flute in Fig. 10 were the best of the instruments we studied. Again narrowed autocorrelation does a little better than conventional for both calculations with erect doing slightly better than inverted. Frames marked with arrows give an average of two notes in transition regions with narrowed doing better than conventional at reporting a single correct note. IV. DISCUSSION AND CONCLUSIONS As mentioned, conventional autocorrelation returns the average of two notes in some transition regions, whereas narrowed autocorrelation is better able to choose between the two notes. This mechanism was discussed in Brown and Puckette (1989). A simulation of a signal in the presence of low-amplitude noise was carried out. The signal consisted of a pure tone with another pure tone with a nearby frequency serving as noise. It was found that the presence of the noise shifted the position of the peak due to the signal for the conventional autocorrelation. With sufficient narrowing, however, the single shifted peak splits into two recognizable components with the signal peak at its proper position. It is this mechanism that is responsible for some of the improved results of narrowed over conventional autocorrelation in the transition regions for the flute and the piano. The disadvantages of narrowed autocorrelation are twofold. First, the longer analysis time means less is known about the exact time for which the calculated midinote applies; i.e., we have the usual time/frequency trade-off. Second, the calculation is a little more expensive computationally as we have an extra N-- 2 additions, where N is the number of terms included in Eq. ( 1 ) or (2). Finally, it should be emphasized that both conventional and narrowed autocorrelation have proven to be excellent frequency trackers for the musical sounds of this study. While single examples of musical instruments can be atypi- It is easy to verify that for the case N = 2 with a sinusold, one oblains a function identical to the usual autocorrelation but with the maxima and minima interchanged. Amuedo, J. (1985). "Periodicity Estimation by Hypothesis-Directed Search," ICASSP-IEEE International Conference on Acoustics, Speech, and Signal Processing, 395-398. Brown, J. C., and Puckette, M. S. (1987). "Musical Information from a Narrowed Autocorrelation Function," Proceedings of the 1987 International Conference Computer Music, Urbana, Illinois, pp. 84-88. Brown, J. C., and Puckene, M. S. (1989). "Calculation of a Narrowed Autocorrelation Function," J. Acoust. Soc. Am. 85, 1595-1601. Chafe, C., Jaffe, D., Kashima, K., Mont-Reynaud, B., and Smith, J. (1985). "Techniques for Note Identification in Polyphonic Music," Proceedings of the International Conference of Computer Music, Vancouver, B.C., pp. 399-405. Chafe, C., and Jaffe, D. (1986). "Source Separation and Note Identification in Polyphonic Music," Proc. ICASSP, Tokyo. Foster, S., Schloss, W. A., and Rockmore, A. J. (1982). "Toward an lnlelligent Editor of Digital Audio: Signal Processing Methods," Cornput. Music J. 6, 42-5 I. Mont-Reynaud, B. (1985). "Problem-solving Strategies in a Music Transcription System," Proe. IJCAI, 916-918. Moorer, James A. (1975). "On the Segmentation and Analysis of Continuous Musical Sound by Digital Computer," Ph.D. dissertation, Stanford Department of Music Rep. No. STAN-M3. Piszczalski, M., and Galler, B. F. (1977). "Automatic Music Transcription," Cornput. Music J. 1, 24-31. Piszczalski, M., and Galler, B. F. (1979}."Predicting Musical Pitch from Component Frequency Ratios," J. Acoust. Soc. Am. 66, 710-720. Schloss, W. A. (1985). "On the Automatic Transcription of Percussive Music--from Acoustic Signal to High-Level Analysis," Ph.D. thesis, Department of Music Rep. No. STAN-M-27, Stanford University. Schroeder, M. R. (1968). "Period Histogram and Product Spectrum: New Methods for Fundamental-Frequency Measurements," J. Acoust. Soc. Am. 43, 829-834. Serra, X., and Wood, P. (1988}. "Overview CCRMA (Recent Work)," Department of Music Tech. Rep. STAN-M-44, Stanford University. Terhardt, E. (1979). "Calculating Virtual Pitch," Hear. Res. 1, 155-182. Terhardt, E., Stoll, G., and Swarm, M. (1982). "Algorithms of Extraction of Pitch and Pitch Saliance from Complex Tonal Signals," J. Aeoust. Soc. Am. 71, 679-688. Vercoe B., and Puckette, M. (1985). "Synthetic Rehearsal: Training the Synthetic Performer," Proceedings of the ICMC, pp. 275-289. Vercoe, B. (1984). "The Synthetic Performer in the Context of Live Performance," Proceedings of the ICMC, pp. 199-200. 2354 J. Acoust. Soc. Am., Vol. 89, No. 5, May 1991 J.C. Brown and B. Zhang: Musical pitch tracking 2354