ELEC 484 Project Pitch Synchronous Overlap-Add

Size: px
Start display at page:

Download "ELEC 484 Project Pitch Synchronous Overlap-Add"

Transcription

1 ELEC 484 Project Pitch Synchronous Overlap-Add Joshua Patton University of Victoria, BC, Canada This report will discuss steps towards implementing a real-time audio system based on the Pitch Synchronous Overlap and Add (PSOLA) algorithm. This time based algorithm along with Formant Preservation (PSOLAF) will be explored to produce the desired pitch manipulation effects. Some background information will be provided as well as motivation for using PSOLAF pitch shifting methods over less complex methods such as Time Stretch and Resample, and Delay Line Modulation will be discussed. An ideal solution to implementing the system will be discussed, along with an events timeline to completion and some possible audio test clips for evaluation will be determined. 1.0 Introduction The importance of pitch manipulation in the digital audio processing and effects world cannot be understated. Applications for pitch shifting may be found in vocoders, such as in cell phones, creation of realistic choir effects from a single singer, high audio playback equipment, audio editing and recording software, and voice disguising applications [SPL00]. The major motivation behind this project was to demonstrate a reliable way to modify pitch of an audio signal source for any of the above aforementioned applications. PSOLA methods offer some of the best sound reproduction with the fewest drawbacks and will be contrasted briefly with several other ways to modify audio signal pitch. 2.0 Pitch Related Methods There are several key ways to modify a source signal s pitch. The below methods are related to pitch shifting and cause a change in pitch but are not well suited for modern applications for several reasons to be explained. 2.1 Variable Speed Replay This method of pitch shifting is very straightforward and works by playing back the original sound at an increased or decreased rate, thus creating a shift in pitch. For example x(n),replay = x(n),in * c Where c < 1 is time expansion and c > 1 is time compression. Figure 1: VSR Leading to Time and Spectral Envelope Distortion [DAFX] Figure 1 shows the detrimental effects on the signal, mainly that the time of the clip is expanded and compressed deping on the pitch shift. This type of shifting also changes the spectral envelope, which makes the signal qualitatively sound like a chipmunk when compressed, and more like a baritone when expanded (c < 1). These effects are undesirable for practical use. 2.2 Delay-Line Modulation This method was described in several publications and can be implemented in several ways [BB89,DAFX]. The first principle of the proposed methods was to implement a pitch shift using two saw tooth waves to control the time varying delay line which were set half a period apart. The resulting output waveforms were multiplied by a cross fade filter and divided in to blocks. When the blocks were read faster or slower the pitch would go up or down 1

2 accordingly. The downside is a fair amount of distortion in the signal and the output signal becomes more noise prone. Figure 3: SOLA Time Manipulation Step 2: Shift the overlapping segments by the scaling factor (alpha). Figure 2: Pitch Shifting by Delay Line Modulation Alternatively an overlap and add scheme that does not require estimation of the fundamental frequency can be employed using three in phase time varying delay lines. Each line is used on a block that overlaps 2/3 of the next full block length. The result gives the same desired effect [DZ99]. 2.3 SOLA Time Stretch and Resample Basically this method takes the original signal uses the below SOLA algorithm and does a linear resample to get an output signal of the same time duration but with a shifted pitch. Resampling is done at the rate of alpha*ƒ s, where alpha is the time stretch or constant. 2.4 Synchronous Overlap Add (SOLA) This algorithm is important to all further study and thus is required to understand the more complex algorithms that are to be implemented in the real-time system. The synchronous overlap and add is done in several steps [MEJ86,RW85]: Step 1: Separate the input signal in to segments of fixed length and overlap as shown in Figure 3 below. Step 3: Search the overlapping samples for discrete time lag of max similarity. At a maximum point weight the samples by a fade in out function to avoid transients. Then add together to create final signal of changed time length. 3.0 Background The goal of pitch shifting is to modify up or down the pitch of an audio signal without losing its information, which is preserved in the frequency information and the harmonic ratios. If done correctly the new audio signal will be of the same length, sound like the original signal, but at a desired pitch. 3.1 Pitch Detection/Marking Detection and marking of pitches for the input sound are crucial to the next two algorithms. For input signals of constant pitch the desired pitch marks can be found at the time index location where the signal reaches it s maximum amplitude. However for more complicated signals involving multiple instruments and vocals this becomes a much more involved task. The main problem to solve requires then ls it s self to finding a way to separate the different pitch periods of the in order to accurately determine the pitch marks for each segment. 3.2 Pitch Synchronous Overlap Add (PSOLA) This method implements the SOLA algorithm and the time domain resample in a similar manner as mentioned previously in section 2.3. The major difference between the two comes in 2

3 the re-sampling where an interpolation is used between pitch marks to create the desired pitch effect as described by Moulines et al. [HMC89, MC90]. Voice and speech processing fall in to the category of applications that this particular algorithm excels at. Based on the assumption that the input can be characterized by a series of pitches, PSOLA remains a two-step process. First the input sound is segmented in to its harmonic, non-harmonic and transient parts then characterized by pitches, known as analysis. The second part is known as synthesis whereby various transformations can be then applied to the signal by a parameter set [SPL00]. These two phases are done as follows, with illustrations below for clarification: if the time signal is to be expanded or compressed. Scaling factors less than 1 will result in discarding of segments resulting in time compression. While a scaling factor greater than 1 will cause segments to be repeated resulting in time expansion. 3. Finally the new time index is found in order to centre the next synthesis segment and preserve the pitch. I. Analysis: 1. Determine the pitch period. Divide the signal in to small blocks where the pitch is considered constant. Finally do pitch detection on each block in succession. 2. Use a Hanning window centered on the pitch mark to extract each block length of two individual pitch periods. Thus providing for a smooth transition between blocks using a fade-in/fadeout effect between blocks [BJ95]. Figure 4: PSOLA Pitch Analysis [DAFX] II. Synthesis: Figure 5: PSOLA Synthesis (time stretching) [DAFX] The effect of this process is a shift in pitch. This is accomplished using a linear interpolation on the time stretched signal to recreate samples between the samples and then re-sampling to get the desired pitch. This approach is used rather than a simple re-sampling as seen in the SOLA algorithm and should offer much improved sound quality over the previously discussed methods. 3.3 PSOLA with Formant Preservation (PSOLAF) Using formant preservation is similar to resampling the time domain with the difference being that frequency re-sampling occurs for the short time spectral envelope rather than on the entire signal. The spectral envelope is defined as the line that goes through all the harmonic amplitudes as seen below in Figure Choose the analysis segment identified by its corresponding time marking. 2. Use the Overlap and Add algorithm where the scaling factor (alpha) decides 3

4 Figure 6: PSOLA Pitch Shifting: Frequency Re-sampling of Spectral Envelope [DAFX] All harmonics are scaled by the scaling factor, but the amplitudes are determined by sampling of the spectral envelope. Pitch markers must be placed pitch synchronously in accordance to the local maxima of each windowed function for good results during analysis [SPL00]. Figure 8: PSOLA Synthesis (pitch shifting) [DAFX] It is apparent during synthesis that rather than purely adding or removing segments from the signal in blocks and therefore stretching the time, the process results in an addition or removal of segments by overlapping of Hanning windows thus preserving the time duration of the signal while modifying its pitch. 4.0 Discussion and Results The project s final realization was achieved with some difficulties encountered along the way, which are to be examined below. Figure 7: PSOLA Analysis (pitch shifting) [DAFX] Preserving the formants of the signal effectively preserves the voice or instrument identity after synthesis has been completed [ML95]. Figure 7 above shows that PSOLA analysis when applied to pitch shifting is identical to the analysis for time stretching. Figure 8 below shows the difference during synthesis between time-stretch and resample method and pure pitch shifting. 4.1 PSOLA Final Implementation As it happened the bulk of the frustration came in trying to implement this algorithm of pitch scaling using the psola.m file from the DAFX text and a timescale and resampling method shown above. The m-file TimescaleResamplePSOLA.m simply calls the psola function with different alpha values to set the timescaling that is to occur. However there was a problem in matching matrix dimensions, the index dimensions at the Hanning window and during resampling of some signals which caused an outright faileur to process the signal for reasons that were unclear. The output of the psola algorithm gives a sound that is indeed shifted in pitch, but does not preserve the sound of the original signal. This can be observed on the x1.wav clip where the higher pitched voice sounds chipmunk like, and the lower pitched one sounds very baritone. These effects were successfully overcome using formant preservation as seen in the next section. 4

5 4.2 PSOLA with Formant Preservation Final Implementation This method was overall successful in producing the desired effect of pitch scaling. The produced sounds are almost identical to the original with no modification, with a slight addition of noise or clipping as it may be due to the Hanning windows. During scaling either up or down the integrity of the source is preserved well so that the resulting signal sounds like the source but at a higher or lower pitch deping on the alpha parameter. An alpha value higher than 1 results in a pitch that is higher while a fractional alpha less than 1 resulted in a lower pitch. Changing the gamma of the signal modified offered another range of options that was explored only briefly. Test files and outputs are available for very simple and short tones to longer ones including vocals. The parameters used in the test code to generate the resulting sounds can be found in the Matlab script file PSOLA_Formant.m available in the appix and on my website. Also available are several original.wav files from the DAFX text and the modified ones in.zip format to save space. The original files that were used in testing are: 1) la.wav 2) flute2.wav 3) moore_guitar.wav 4) x1.wav Sound files and m-files can be accessed at: /Elec484/Elec484.html Several sound files were tested that did not work with the algorithm, these included some proposed in the initial report submission and also included extra samples of music from my own library. The error message as before with the PSOLA algorithm seemed to be related to pitch marks. This conclusion is based on an educated guess that the pitch marker program that was developed is not sophisticated enough to properly place the marks for complex signals with many harmonics. It could also be said that many of these signals which included multiple instruments and the like may not have had any primary harmonics to work on and this could have lead to the errors incurred. Another explanation may be that too many pitch marks were found (erroneously) such that the shifted Hanning window could not properly operate on the signal as this is where the psolaf1.m program failed with the more complex signals and the psola.m program failed for those signals as well as others. Since all the signals that did run for the psolaf1.m algorithm had fairly distinct pitches, it is safe to assume that the algorithm should work for all signals provided that the equivalent pitch marks are determined with very good accuracy. 5.0 Conclusions Considering the limited time of this project it is evident that much further work could be done in this area. However this being said, it was evident from the produced sound files that the project was successful in realizing a system that can modify pitch and maintain the integrity of the original sound signal and source. 6.0 Future Considerations Although this project was inted to be implemented as a real-time system it was impossible to do so with the amount of time and problems encountered. With further resources available and more understanding of transferring programs from a Matlab environment to a realtime system this PSOLA with Formant preservation program would be implemented in Marsays. Due to limitations in time and ability this did not occur. More importantly the detection of pitch with great accuracy should be considered a high priority as the better methods that preserve the message quality need input pitch marks to centre some sort of windowing method on. Without these marks placed properly this project is not very useful for any real world application. 5

6 7.0 References [BB89] K. Bogdanowicz and R. Blecher. Using Multiple Processors for real-time audio effects. In AES 7th International Conference, pp , [BJ95] R. Bristow-Johnson. A detailed analysis of a time-domain format-corrected pitch shifting algorithm. J. Audio Eng. Soc., 43(5): , [DAFX] U. Zolzer. Digital Audio Effects. John Wiley and Sons, pp , [DZ99] S. Disch and U. Zolzer. Modulation and delay line based digital audio effects. In Proc. DAFX-99 Digital Audio Effects Workshop, pp.4-8, Trondheim, December [HMC89] C. Hamon, E. Moulines and F. Charpentier. A diphone synthesis system based on time-domain prosodic modifications of speech. In Proc. ICASSP, pp , [MC90] E. Moulines and F. Charpentier. Pitch synchronous waveform processing technique for text-to speech synthesis using diphones. Speech Communication, 16: , [MEJ86] J. Makhoul and A. El-Jaroudi. Timescale modification in medium to low rate speech coding. In Proc. ICASSP, pp , [ML95] E. Moulines and J. Laroche. Nonparameter technique for pitch-scale and timescale modification of speech. Speech Communication, 9(5/6): , [RW85] S. Roucos and A.M. Wilgus. High quality time-scale modification for speech. In Proc. ICASSP, pp , [SPL00] N. Schnell, G. Peeters, S. Lemouton, P. Manoury, X. Rodet, Synthesizing a choir in realtime using Pitch Synchronous Overlap Add (PSOLA). Ircam Centre Georges-Pompidou, pp. 1-4,

7 Appix: PSOLA_Formant.m % Pitch Shifting by PSOLA with Formant Preservation % Josh Patton % PSOLA_Formant.m % Files required: % psolaf1.m % pitchmarker.m clear all close all clc %% la.wav [x,fs,nbits]=wavread('la.wav'); gamma=2; wavwrite(y, Fs, 'la_gamma2.wav'); beta=(3/2); wavwrite(y, Fs, 'la_high.wav'); beta=(3/4); wavwrite(y, Fs, 'la_low.wav'); %% flute2.wav [x,fs,nbits]=wavread('flute2.wav'); gamma=2; wavwrite(y, Fs, 'flute2_gamma2.wav'); beta=(3/2); wavwrite(y, Fs, 'flute2_high.wav'); beta=(3/4); wavwrite(y, Fs, 'flute2_low.wav'); %% moore_guitar.wav [x,fs,nbits]=wavread('moore_guitar.w av'); wavwrite(y, Fs, 'moore_guitar_gamma1.wav'); gamma=2; wavwrite(y, Fs, 'moore_guitar_gamma2.wav'); beta=(3/2); wavwrite(y, Fs, 'moore_guitar_high.wav'); beta=(3/4); wavwrite(y, Fs, 'moore_guitar_low.wav'); %% x1.wav [x,fs,nbits]=wavread('x1.wav'); wavwrite(y, Fs, 'x1_gamma1.wav'); gamma=2; wavwrite(y, Fs, 'x1_gamma2.wav'); beta=(3/2); wavwrite(y, Fs, 'x1_high.wav'); beta=(3/4); wavwrite(y, Fs, 'x1_low.wav'); 7

8 Appix: psolaf1.m % This function file preforms pitch shifting synchrounous overlap add with % formant preservation using pitch marks from an external source, and % psolaf1.m % based off of psolaf.m from DAFX function out=psolaf1(in,m,alpha,beta,gamma) %... % gamma newformantfreq/oldformantfreq %... P = diff(m); %compute pitch periods if m(1)<=p(1), %remove first pitch mark m=m(2:length(m)); P=P(2:length(P)); if m(length(m))+p(length(p))>length(in) %remove last pitch mark m=m(1:length(m)-1); else P=[P P(length(P))]; Lout=ceil(length(in)*alpha); out=zeros(1,lout); %output signal tk = P(1)+1; %output pitch mark while round(tk)<lout [minimum i]=min(abs(alpha*m-tk) ); % find analysis segment pit=p(i);pitstr=floor(pit/gamma); gr=in(m(i)-pit:m(i)+pit).*hanning(2*pit+1); gr=interp1(-pit:1:pit,gr,-pitstr*gamma:gamma:pit);% stretch segm. inigr=round(tk)-pitstr;gr=round(tk)+pitstr; if Gr>Lout, break; out(inigr:gr)=out(inigr:gr)+gr; % overlap new segment tk=tk+pit/beta; 8

9 Appix: TimescaleResamplePSOLA.m % Pitch Shifting by PSOLA Time Stretching and Resampling % Josh Patton % TimescaleResamplePSOLA.m % Files required: % psola.m % pitchmarker.m %% test one flute2 [x,fs,nbits]=wavread('x1.wav'); alpha=(3/2); y=psola(x,m,alpha,beta); y=resample(y,length(x),length(y)); wavwrite(y, Fs, 'psola_high_x1.wav'); alpha=(3/4); y=psola(x,m,alpha,beta); y=resample(y,length(x),length(y)); wavwrite(y, Fs, 'psola_low_x1.wav'); %% test moore_guitar [x,fs,nbits]=wavread('moore_guitar.wav'); alpha=1.5; y=psola(x,m,alpha,beta); y=resample(y,length(x),length(y)); wavwrite(y, Fs, 'psola_high_moore_guitar.wav'); alpha=0.75; y=psola(x,m,alpha,beta); y=resample(y,length(x),length(y)); wavwrite(y, Fs, 'psola_low_moore_guitar.wav'); 9

10 Appix: psola.m %psola.m %from DAFX %Josh Patton function out=psola(in,m,alpha,beta) % in input signal % m pitch marks (from PitchMarker.m function) % alpha time stretching factor % beta pitch shifting factor P = diff(m); %compute pitch periods if m(1)<=p(1), %remove first pitch mark m=m(2:length(m)); P=P(2:length(P)); if m(length(m))+p(length(p))>length(in) %remove last pitch mark m=m(1:length(m)-1); else P=[P P(length(P))]; Lout=ceil(length(in)*alpha); out=zeros(1,lout); %output signal tk = P(1)+1; %output pitch mark while round(tk)<lout [minimum i] = min( abs(alpha*m - tk) ); %find analysis segment pit=p(i); st=m(i)-pit; en=m(i)+pit; gr = in(st:en).* hanning(2*pit+1); inigr=round(tk)-pit; Gr=round(tk)+pit; if Gr>Lout, break; out(inigr:gr) = out(inigr:gr)+gr'; %overlap new segment tk=tk+pit/beta; 10

11 Appix: pitchmarker.m % pitchmarker.m % Josh Patton % Finds all the pitch marks in the input file and returns the % markings in a matrix function [ pitch ] = pitchmarker(blk_section) %% test from within (comment out the above function line) %[x,fs,bit]=wavread('moore_guitar.wav'); %blk_section=x; %% Detection % initial setup blk_size=400; mark=[1:length(blk_section)]*0; last_pos=1; place=1; blk_size=300; i=1; while last_pos+floor(blk_size*1.7) < length(blk_section) % grabs the next block to examine temp=blk_section(last_pos+50:last_pos+floor(bl k_size*1.7)); % finds the high point in the block [mag,place]=max(temp); % check for a signal in the current block if mag < 0.01 place=length(temp); mode = 0; mark(place+last_pos+50)=1; pitch(i)=place+last_pos+50; else mode = 1; % check for pitch mark before current pitch mark while mode == 1 % find the largest point in block from start to current pitch mark [mag2,place2]=max(temp(1:place-50)); % check if high mark has great enough magnitude to be a pitch mark if mag2 > 0.90*mag mag=mag2; place=place2; else mode = 0; mark(place+last_pos+50)=1; pitch(i)=place+last_pos+50; % next block to look at is 50 samples after current block blk_size=place+50; % makes sure next blk_size is of large enough size if blk_size < 150 blk_size=150; last_pos=place+last_pos+50; i=i+1; %% Plotting if needed % figure(1) % hold on % plot(mark) % plot(blk_section,'r') 11

2. AN INTROSPECTION OF THE MORPHING PROCESS

2. AN INTROSPECTION OF THE MORPHING PROCESS 1. INTRODUCTION Voice morphing means the transition of one speech signal into another. Like image morphing, speech morphing aims to preserve the shared characteristics of the starting and final signals,

More information

A prototype system for rule-based expressive modifications of audio recordings

A prototype system for rule-based expressive modifications of audio recordings International Symposium on Performance Science ISBN 0-00-000000-0 / 000-0-00-000000-0 The Author 2007, Published by the AEC All rights reserved A prototype system for rule-based expressive modifications

More information

Pitch correction on the human voice

Pitch correction on the human voice University of Arkansas, Fayetteville ScholarWorks@UARK Computer Science and Computer Engineering Undergraduate Honors Theses Computer Science and Computer Engineering 5-2008 Pitch correction on the human

More information

Laboratory Assignment 3. Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB

Laboratory Assignment 3. Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB Laboratory Assignment 3 Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB PURPOSE In this laboratory assignment, you will use MATLAB to synthesize the audio tones that make up a well-known

More information

Automatic Construction of Synthetic Musical Instruments and Performers

Automatic Construction of Synthetic Musical Instruments and Performers Ph.D. Thesis Proposal Automatic Construction of Synthetic Musical Instruments and Performers Ning Hu Carnegie Mellon University Thesis Committee Roger B. Dannenberg, Chair Michael S. Lewicki Richard M.

More information

Analysis, Synthesis, and Perception of Musical Sounds

Analysis, Synthesis, and Perception of Musical Sounds Analysis, Synthesis, and Perception of Musical Sounds The Sound of Music James W. Beauchamp Editor University of Illinois at Urbana, USA 4y Springer Contents Preface Acknowledgments vii xv 1. Analysis

More information

Synthesizing a choir in real-time using Pitch Synchronous Overlap Add (PSOLA)

Synthesizing a choir in real-time using Pitch Synchronous Overlap Add (PSOLA) Synthesizing a choir in real-time using Pitch Synchronous Overlap Add (PSOLA) Norbert Schnell, Geoffroy Peeters, Serge Lemouton, Philippe Manoury, Xavier Rodet! " % & ( )! *, IRCAM -CENTRE GEORGES-POMPIDOU

More information

Tempo Estimation and Manipulation

Tempo Estimation and Manipulation Hanchel Cheng Sevy Harris I. Introduction Tempo Estimation and Manipulation This project was inspired by the idea of a smart conducting baton which could change the sound of audio in real time using gestures,

More information

Auto-Tune. Collection Editors: Navaneeth Ravindranath Tanner Songkakul Andrew Tam

Auto-Tune. Collection Editors: Navaneeth Ravindranath Tanner Songkakul Andrew Tam Auto-Tune Collection Editors: Navaneeth Ravindranath Tanner Songkakul Andrew Tam Auto-Tune Collection Editors: Navaneeth Ravindranath Tanner Songkakul Andrew Tam Authors: Navaneeth Ravindranath Blaine

More information

AN ON-THE-FLY MANDARIN SINGING VOICE SYNTHESIS SYSTEM

AN ON-THE-FLY MANDARIN SINGING VOICE SYNTHESIS SYSTEM AN ON-THE-FLY MANDARIN SINGING VOICE SYNTHESIS SYSTEM Cheng-Yuan Lin*, J.-S. Roger Jang*, and Shaw-Hwa Hwang** *Dept. of Computer Science, National Tsing Hua University, Taiwan **Dept. of Electrical Engineering,

More information

S I N E V I B E S FRACTION AUDIO SLICING WORKSTATION

S I N E V I B E S FRACTION AUDIO SLICING WORKSTATION S I N E V I B E S FRACTION AUDIO SLICING WORKSTATION INTRODUCTION Fraction is a plugin for deep on-the-fly remixing and mangling of sound. It features 8x independent slicers which record and repeat short

More information

Singing voice synthesis in Spanish by concatenation of syllables based on the TD-PSOLA algorithm

Singing voice synthesis in Spanish by concatenation of syllables based on the TD-PSOLA algorithm Singing voice synthesis in Spanish by concatenation of syllables based on the TD-PSOLA algorithm ALEJANDRO RAMOS-AMÉZQUITA Computer Science Department Tecnológico de Monterrey (Campus Ciudad de México)

More information

Audio Compression Technology for Voice Transmission

Audio Compression Technology for Voice Transmission Audio Compression Technology for Voice Transmission 1 SUBRATA SAHA, 2 VIKRAM REDDY 1 Department of Electrical and Computer Engineering 2 Department of Computer Science University of Manitoba Winnipeg,

More information

An interdisciplinary approach to audio effect classification

An interdisciplinary approach to audio effect classification An interdisciplinary approach to audio effect classification Vincent Verfaille, Catherine Guastavino Caroline Traube, SPCL / CIRMMT, McGill University GSLIS / CIRMMT, McGill University LIAM / OICM, Université

More information

Robert Alexandru Dobre, Cristian Negrescu

Robert Alexandru Dobre, Cristian Negrescu ECAI 2016 - International Conference 8th Edition Electronics, Computers and Artificial Intelligence 30 June -02 July, 2016, Ploiesti, ROMÂNIA Automatic Music Transcription Software Based on Constant Q

More information

A METHOD OF MORPHING SPECTRAL ENVELOPES OF THE SINGING VOICE FOR USE WITH BACKING VOCALS

A METHOD OF MORPHING SPECTRAL ENVELOPES OF THE SINGING VOICE FOR USE WITH BACKING VOCALS A METHOD OF MORPHING SPECTRAL ENVELOPES OF THE SINGING VOICE FOR USE WITH BACKING VOCALS Matthew Roddy Dept. of Computer Science and Information Systems, University of Limerick, Ireland Jacqueline Walker

More information

Department of Electrical & Electronic Engineering Imperial College of Science, Technology and Medicine. Project: Real-Time Speech Enhancement

Department of Electrical & Electronic Engineering Imperial College of Science, Technology and Medicine. Project: Real-Time Speech Enhancement Department of Electrical & Electronic Engineering Imperial College of Science, Technology and Medicine Project: Real-Time Speech Enhancement Introduction Telephones are increasingly being used in noisy

More information

Lab 5 Linear Predictive Coding

Lab 5 Linear Predictive Coding Lab 5 Linear Predictive Coding 1 of 1 Idea When plain speech audio is recorded and needs to be transmitted over a channel with limited bandwidth it is often necessary to either compress or encode the audio

More information

Fraction by Sinevibes audio slicing workstation

Fraction by Sinevibes audio slicing workstation Fraction by Sinevibes audio slicing workstation INTRODUCTION Fraction is an effect plugin for deep real-time manipulation and re-engineering of sound. It features 8 slicers which record and repeat the

More information

1 Introduction to PSQM

1 Introduction to PSQM A Technical White Paper on Sage s PSQM Test Renshou Dai August 7, 2000 1 Introduction to PSQM 1.1 What is PSQM test? PSQM stands for Perceptual Speech Quality Measure. It is an ITU-T P.861 [1] recommended

More information

Investigation of Digital Signal Processing of High-speed DACs Signals for Settling Time Testing

Investigation of Digital Signal Processing of High-speed DACs Signals for Settling Time Testing Universal Journal of Electrical and Electronic Engineering 4(2): 67-72, 2016 DOI: 10.13189/ujeee.2016.040204 http://www.hrpub.org Investigation of Digital Signal Processing of High-speed DACs Signals for

More information

OCTAVE C 3 D 3 E 3 F 3 G 3 A 3 B 3 C 4 D 4 E 4 F 4 G 4 A 4 B 4 C 5 D 5 E 5 F 5 G 5 A 5 B 5. Middle-C A-440

OCTAVE C 3 D 3 E 3 F 3 G 3 A 3 B 3 C 4 D 4 E 4 F 4 G 4 A 4 B 4 C 5 D 5 E 5 F 5 G 5 A 5 B 5. Middle-C A-440 DSP First Laboratory Exercise # Synthesis of Sinusoidal Signals This lab includes a project on music synthesis with sinusoids. One of several candidate songs can be selected when doing the synthesis program.

More information

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes hello Jay Biernat Third author University of Rochester University of Rochester Affiliation3 words jbiernat@ur.rochester.edu author3@ismir.edu

More information

Study of White Gaussian Noise with Varying Signal to Noise Ratio in Speech Signal using Wavelet

Study of White Gaussian Noise with Varying Signal to Noise Ratio in Speech Signal using Wavelet American International Journal of Research in Science, Technology, Engineering & Mathematics Available online at http://www.iasir.net ISSN (Print): 2328-3491, ISSN (Online): 2328-3580, ISSN (CD-ROM): 2328-3629

More information

Lecture 9 Source Separation

Lecture 9 Source Separation 10420CS 573100 音樂資訊檢索 Music Information Retrieval Lecture 9 Source Separation Yi-Hsuan Yang Ph.D. http://www.citi.sinica.edu.tw/pages/yang/ yang@citi.sinica.edu.tw Music & Audio Computing Lab, Research

More information

Music Source Separation

Music Source Separation Music Source Separation Hao-Wei Tseng Electrical and Engineering System University of Michigan Ann Arbor, Michigan Email: blakesen@umich.edu Abstract In popular music, a cover version or cover song, or

More information

Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting

Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting Dalwon Jang 1, Seungjae Lee 2, Jun Seok Lee 2, Minho Jin 1, Jin S. Seo 2, Sunil Lee 1 and Chang D. Yoo 1 1 Korea Advanced

More information

Keywords Separation of sound, percussive instruments, non-percussive instruments, flexible audio source separation toolbox

Keywords Separation of sound, percussive instruments, non-percussive instruments, flexible audio source separation toolbox Volume 4, Issue 4, April 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Investigation

More information

A Study of Synchronization of Audio Data with Symbolic Data. Music254 Project Report Spring 2007 SongHui Chon

A Study of Synchronization of Audio Data with Symbolic Data. Music254 Project Report Spring 2007 SongHui Chon A Study of Synchronization of Audio Data with Symbolic Data Music254 Project Report Spring 2007 SongHui Chon Abstract This paper provides an overview of the problem of audio and symbolic synchronization.

More information

Implementation of an 8-Channel Real-Time Spontaneous-Input Time Expander/Compressor

Implementation of an 8-Channel Real-Time Spontaneous-Input Time Expander/Compressor Implementation of an 8-Channel Real-Time Spontaneous-Input Time Expander/Compressor Introduction: The ability to time stretch and compress acoustical sounds without effecting their pitch has been an attractive

More information

Design of a pitch quantization and pitch correction system for real-time music effects signal processing

Design of a pitch quantization and pitch correction system for real-time music effects signal processing Design of a pitch quantization and pitch correction system for real-time music effects signal processing Corey Cheng * * Massachusetts Institute of Technology, 617-253-2268, coreyc@mit.edu EconoSonoMetrics,

More information

POLYPHONIC INSTRUMENT RECOGNITION USING SPECTRAL CLUSTERING

POLYPHONIC INSTRUMENT RECOGNITION USING SPECTRAL CLUSTERING POLYPHONIC INSTRUMENT RECOGNITION USING SPECTRAL CLUSTERING Luis Gustavo Martins Telecommunications and Multimedia Unit INESC Porto Porto, Portugal lmartins@inescporto.pt Juan José Burred Communication

More information

ECE 4220 Real Time Embedded Systems Final Project Spectrum Analyzer

ECE 4220 Real Time Embedded Systems Final Project Spectrum Analyzer ECE 4220 Real Time Embedded Systems Final Project Spectrum Analyzer by: Matt Mazzola 12222670 Abstract The design of a spectrum analyzer on an embedded device is presented. The device achieves minimum

More information

ANALYSIS-ASSISTED SOUND PROCESSING WITH AUDIOSCULPT

ANALYSIS-ASSISTED SOUND PROCESSING WITH AUDIOSCULPT ANALYSIS-ASSISTED SOUND PROCESSING WITH AUDIOSCULPT Niels Bogaards To cite this version: Niels Bogaards. ANALYSIS-ASSISTED SOUND PROCESSING WITH AUDIOSCULPT. 8th International Conference on Digital Audio

More information

An Effective Filtering Algorithm to Mitigate Transient Decaying DC Offset

An Effective Filtering Algorithm to Mitigate Transient Decaying DC Offset An Effective Filtering Algorithm to Mitigate Transient Decaying DC Offset By: Abouzar Rahmati Authors: Abouzar Rahmati IS-International Services LLC Reza Adhami University of Alabama in Huntsville April

More information

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES Vishweshwara Rao and Preeti Rao Digital Audio Processing Lab, Electrical Engineering Department, IIT-Bombay, Powai,

More information

PCM ENCODING PREPARATION... 2 PCM the PCM ENCODER module... 4

PCM ENCODING PREPARATION... 2 PCM the PCM ENCODER module... 4 PCM ENCODING PREPARATION... 2 PCM... 2 PCM encoding... 2 the PCM ENCODER module... 4 front panel features... 4 the TIMS PCM time frame... 5 pre-calculations... 5 EXPERIMENT... 5 patching up... 6 quantizing

More information

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng S. Zhu, P. Ji, W. Kuang and J. Yang Institute of Acoustics, CAS, O.21, Bei-Si-huan-Xi Road, 100190 Beijing,

More information

A New "Duration-Adapted TR" Waveform Capture Method Eliminates Severe Limitations

A New Duration-Adapted TR Waveform Capture Method Eliminates Severe Limitations 31 st Conference of the European Working Group on Acoustic Emission (EWGAE) Th.3.B.4 More Info at Open Access Database www.ndt.net/?id=17567 A New "Duration-Adapted TR" Waveform Capture Method Eliminates

More information

System Identification

System Identification System Identification Arun K. Tangirala Department of Chemical Engineering IIT Madras July 26, 2013 Module 9 Lecture 2 Arun K. Tangirala System Identification July 26, 2013 16 Contents of Lecture 2 In

More information

SINGING EXPRESSION TRANSFER FROM ONE VOICE TO ANOTHER FOR A GIVEN SONG. Sangeon Yong, Juhan Nam

SINGING EXPRESSION TRANSFER FROM ONE VOICE TO ANOTHER FOR A GIVEN SONG. Sangeon Yong, Juhan Nam SINGING EXPRESSION TRANSFER FROM ONE VOICE TO ANOTHER FOR A GIVEN SONG Sangeon Yong, Juhan Nam Graduate School of Culture Technology, KAIST {koragon2, juhannam}@kaist.ac.kr ABSTRACT We present a vocal

More information

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC Vishweshwara Rao, Sachin Pant, Madhumita Bhaskar and Preeti Rao Department of Electrical Engineering, IIT Bombay {vishu, sachinp,

More information

Single Channel Speech Enhancement Using Spectral Subtraction Based on Minimum Statistics

Single Channel Speech Enhancement Using Spectral Subtraction Based on Minimum Statistics Master Thesis Signal Processing Thesis no December 2011 Single Channel Speech Enhancement Using Spectral Subtraction Based on Minimum Statistics Md Zameari Islam GM Sabil Sajjad This thesis is presented

More information

CONTENT-BASED MELODIC TRANSFORMATIONS OF AUDIO MATERIAL FOR A MUSIC PROCESSING APPLICATION

CONTENT-BASED MELODIC TRANSFORMATIONS OF AUDIO MATERIAL FOR A MUSIC PROCESSING APPLICATION CONTENT-BASED MELODIC TRANSFORMATIONS OF AUDIO MATERIAL FOR A MUSIC PROCESSING APPLICATION Emilia Gómez, Gilles Peterschmitt, Xavier Amatriain, Perfecto Herrera Music Technology Group Universitat Pompeu

More information

CSC475 Music Information Retrieval

CSC475 Music Information Retrieval CSC475 Music Information Retrieval Monophonic pitch extraction George Tzanetakis University of Victoria 2014 G. Tzanetakis 1 / 32 Table of Contents I 1 Motivation and Terminology 2 Psychacoustics 3 F0

More information

A Composition for Clarinet and Real-Time Signal Processing: Using Max on the IRCAM Signal Processing Workstation

A Composition for Clarinet and Real-Time Signal Processing: Using Max on the IRCAM Signal Processing Workstation A Composition for Clarinet and Real-Time Signal Processing: Using Max on the IRCAM Signal Processing Workstation Cort Lippe IRCAM, 31 rue St-Merri, Paris, 75004, France email: lippe@ircam.fr Introduction.

More information

AN AUDIO effect is a signal processing technique used

AN AUDIO effect is a signal processing technique used IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING 1 Adaptive Digital Audio Effects (A-DAFx): A New Class of Sound Transformations Vincent Verfaille, Member, IEEE, Udo Zölzer, Member, IEEE, and

More information

Comparison Parameters and Speaker Similarity Coincidence Criteria:

Comparison Parameters and Speaker Similarity Coincidence Criteria: Comparison Parameters and Speaker Similarity Coincidence Criteria: The Easy Voice system uses two interrelating parameters of comparison (first and second error types). False Rejection, FR is a probability

More information

SYNTHESIS FROM MUSICAL INSTRUMENT CHARACTER MAPS

SYNTHESIS FROM MUSICAL INSTRUMENT CHARACTER MAPS Published by Institute of Electrical Engineers (IEE). 1998 IEE, Paul Masri, Nishan Canagarajah Colloquium on "Audio and Music Technology"; November 1998, London. Digest No. 98/470 SYNTHESIS FROM MUSICAL

More information

6.111 Final Project: Digital Debussy- A Hardware Music Composition Tool. Jordan Addison and Erin Ibarra November 6, 2014

6.111 Final Project: Digital Debussy- A Hardware Music Composition Tool. Jordan Addison and Erin Ibarra November 6, 2014 6.111 Final Project: Digital Debussy- A Hardware Music Composition Tool Jordan Addison and Erin Ibarra November 6, 2014 1 Purpose Professional music composition software is expensive $150-$600, typically

More information

Appendix D. UW DigiScope User s Manual. Willis J. Tompkins and Annie Foong

Appendix D. UW DigiScope User s Manual. Willis J. Tompkins and Annie Foong Appendix D UW DigiScope User s Manual Willis J. Tompkins and Annie Foong UW DigiScope is a program that gives the user a range of basic functions typical of a digital oscilloscope. Included are such features

More information

Why Engineers Ignore Cable Loss

Why Engineers Ignore Cable Loss Why Engineers Ignore Cable Loss By Brig Asay, Agilent Technologies Companies spend large amounts of money on test and measurement equipment. One of the largest purchases for high speed designers is a real

More information

Music Radar: A Web-based Query by Humming System

Music Radar: A Web-based Query by Humming System Music Radar: A Web-based Query by Humming System Lianjie Cao, Peng Hao, Chunmeng Zhou Computer Science Department, Purdue University, 305 N. University Street West Lafayette, IN 47907-2107 {cao62, pengh,

More information

Agilent PN Time-Capture Capabilities of the Agilent Series Vector Signal Analyzers Product Note

Agilent PN Time-Capture Capabilities of the Agilent Series Vector Signal Analyzers Product Note Agilent PN 89400-10 Time-Capture Capabilities of the Agilent 89400 Series Vector Signal Analyzers Product Note Figure 1. Simplified block diagram showing basic signal flow in the Agilent 89400 Series VSAs

More information

Calibrate, Characterize and Emulate Systems Using RFXpress in AWG Series

Calibrate, Characterize and Emulate Systems Using RFXpress in AWG Series Calibrate, Characterize and Emulate Systems Using RFXpress in AWG Series Introduction System designers and device manufacturers so long have been using one set of instruments for creating digitally modulated

More information

TERRESTRIAL broadcasting of digital television (DTV)

TERRESTRIAL broadcasting of digital television (DTV) IEEE TRANSACTIONS ON BROADCASTING, VOL 51, NO 1, MARCH 2005 133 Fast Initialization of Equalizers for VSB-Based DTV Transceivers in Multipath Channel Jong-Moon Kim and Yong-Hwan Lee Abstract This paper

More information

Elasticity Imaging with Ultrasound JEE 4980 Final Report. George Michaels and Mary Watts

Elasticity Imaging with Ultrasound JEE 4980 Final Report. George Michaels and Mary Watts Elasticity Imaging with Ultrasound JEE 4980 Final Report George Michaels and Mary Watts University of Missouri, St. Louis Washington University Joint Engineering Undergraduate Program St. Louis, Missouri

More information

Removal of Decaying DC Component in Current Signal Using a ovel Estimation Algorithm

Removal of Decaying DC Component in Current Signal Using a ovel Estimation Algorithm Removal of Decaying DC Component in Current Signal Using a ovel Estimation Algorithm Majid Aghasi*, and Alireza Jalilian** *Department of Electrical Engineering, Iran University of Science and Technology,

More information

Automatic Rhythmic Notation from Single Voice Audio Sources

Automatic Rhythmic Notation from Single Voice Audio Sources Automatic Rhythmic Notation from Single Voice Audio Sources Jack O Reilly, Shashwat Udit Introduction In this project we used machine learning technique to make estimations of rhythmic notation of a sung

More information

Audio Processing Exercise

Audio Processing Exercise Name: Date : Audio Processing Exercise In this exercise you will learn to load, playback, modify, and plot audio files. Commands for loading and characterizing an audio file To load an audio file (.wav)

More information

TEPZZ A_T EP A1 (19) (11) EP A1 (12) EUROPEAN PATENT APPLICATION. (51) Int Cl.: H04S 7/00 ( ) H04R 25/00 (2006.

TEPZZ A_T EP A1 (19) (11) EP A1 (12) EUROPEAN PATENT APPLICATION. (51) Int Cl.: H04S 7/00 ( ) H04R 25/00 (2006. (19) TEPZZ 94 98 A_T (11) EP 2 942 982 A1 (12) EUROPEAN PATENT APPLICATION (43) Date of publication: 11.11. Bulletin /46 (1) Int Cl.: H04S 7/00 (06.01) H04R /00 (06.01) (21) Application number: 141838.7

More information

TEPZZ 94 98_A_T EP A1 (19) (11) EP A1 (12) EUROPEAN PATENT APPLICATION. (43) Date of publication: Bulletin 2015/46

TEPZZ 94 98_A_T EP A1 (19) (11) EP A1 (12) EUROPEAN PATENT APPLICATION. (43) Date of publication: Bulletin 2015/46 (19) TEPZZ 94 98_A_T (11) EP 2 942 981 A1 (12) EUROPEAN PATENT APPLICATION (43) Date of publication: 11.11.1 Bulletin 1/46 (1) Int Cl.: H04S 7/00 (06.01) H04R /00 (06.01) (21) Application number: 1418384.0

More information

Experiments on musical instrument separation using multiplecause

Experiments on musical instrument separation using multiplecause Experiments on musical instrument separation using multiplecause models J Klingseisen and M D Plumbley* Department of Electronic Engineering King's College London * - Corresponding Author - mark.plumbley@kcl.ac.uk

More information

Iterative Direct DPD White Paper

Iterative Direct DPD White Paper Iterative Direct DPD White Paper Products: ı ı R&S FSW-K18D R&S FPS-K18D Digital pre-distortion (DPD) is a common method to linearize the output signal of a power amplifier (PA), which is being operated

More information

Signal processing in the Philips 'VLP' system

Signal processing in the Philips 'VLP' system Philips tech. Rev. 33, 181-185, 1973, No. 7 181 Signal processing in the Philips 'VLP' system W. van den Bussche, A. H. Hoogendijk and J. H. Wessels On the 'YLP' record there is a single information track

More information

PRELIMINARY INFORMATION. Professional Signal Generation and Monitoring Options for RIFEforLIFE Research Equipment

PRELIMINARY INFORMATION. Professional Signal Generation and Monitoring Options for RIFEforLIFE Research Equipment Integrated Component Options Professional Signal Generation and Monitoring Options for RIFEforLIFE Research Equipment PRELIMINARY INFORMATION SquareGENpro is the latest and most versatile of the frequency

More information

y POWER USER MUSIC PRODUCTION and PERFORMANCE With the MOTIF ES Mastering the Sample SLICE function

y POWER USER MUSIC PRODUCTION and PERFORMANCE With the MOTIF ES Mastering the Sample SLICE function y POWER USER MUSIC PRODUCTION and PERFORMANCE With the MOTIF ES Mastering the Sample SLICE function Phil Clendeninn Senior Product Specialist Technology Products Yamaha Corporation of America Working with

More information

The Measurement Tools and What They Do

The Measurement Tools and What They Do 2 The Measurement Tools The Measurement Tools and What They Do JITTERWIZARD The JitterWizard is a unique capability of the JitterPro package that performs the requisite scope setup chores while simplifying

More information

6.UAP Project. FunPlayer: A Real-Time Speed-Adjusting Music Accompaniment System. Daryl Neubieser. May 12, 2016

6.UAP Project. FunPlayer: A Real-Time Speed-Adjusting Music Accompaniment System. Daryl Neubieser. May 12, 2016 6.UAP Project FunPlayer: A Real-Time Speed-Adjusting Music Accompaniment System Daryl Neubieser May 12, 2016 Abstract: This paper describes my implementation of a variable-speed accompaniment system that

More information

Query By Humming: Finding Songs in a Polyphonic Database

Query By Humming: Finding Songs in a Polyphonic Database Query By Humming: Finding Songs in a Polyphonic Database John Duchi Computer Science Department Stanford University jduchi@stanford.edu Benjamin Phipps Computer Science Department Stanford University bphipps@stanford.edu

More information

Tempo and Beat Analysis

Tempo and Beat Analysis Advanced Course Computer Science Music Processing Summer Term 2010 Meinard Müller, Peter Grosche Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Tempo and Beat Analysis Musical Properties:

More information

Module 8 : Numerical Relaying I : Fundamentals

Module 8 : Numerical Relaying I : Fundamentals Module 8 : Numerical Relaying I : Fundamentals Lecture 28 : Sampling Theorem Objectives In this lecture, you will review the following concepts from signal processing: Role of DSP in relaying. Sampling

More information

Onset Detection and Music Transcription for the Irish Tin Whistle

Onset Detection and Music Transcription for the Irish Tin Whistle ISSC 24, Belfast, June 3 - July 2 Onset Detection and Music Transcription for the Irish Tin Whistle Mikel Gainza φ, Bob Lawlor*, Eugene Coyle φ and Aileen Kelleher φ φ Digital Media Centre Dublin Institute

More information

Automatic LP Digitalization Spring Group 6: Michael Sibley, Alexander Su, Daphne Tsatsoulis {msibley, ahs1,

Automatic LP Digitalization Spring Group 6: Michael Sibley, Alexander Su, Daphne Tsatsoulis {msibley, ahs1, Automatic LP Digitalization 18-551 Spring 2011 Group 6: Michael Sibley, Alexander Su, Daphne Tsatsoulis {msibley, ahs1, ptsatsou}@andrew.cmu.edu Introduction This project was originated from our interest

More information

Voice & Music Pattern Extraction: A Review

Voice & Music Pattern Extraction: A Review Voice & Music Pattern Extraction: A Review 1 Pooja Gautam 1 and B S Kaushik 2 Electronics & Telecommunication Department RCET, Bhilai, Bhilai (C.G.) India pooja0309pari@gmail.com 2 Electrical & Instrumentation

More information

A few white papers on various. Digital Signal Processing algorithms. used in the DAC501 / DAC502 units

A few white papers on various. Digital Signal Processing algorithms. used in the DAC501 / DAC502 units A few white papers on various Digital Signal Processing algorithms used in the DAC501 / DAC502 units Contents: 1) Parametric Equalizer, page 2 2) Room Equalizer, page 5 3) Crosstalk Cancellation (XTC),

More information

Topic 10. Multi-pitch Analysis

Topic 10. Multi-pitch Analysis Topic 10 Multi-pitch Analysis What is pitch? Common elements of music are pitch, rhythm, dynamics, and the sonic qualities of timbre and texture. An auditory perceptual attribute in terms of which sounds

More information

Music 209 Advanced Topics in Computer Music Lecture 4 Time Warping

Music 209 Advanced Topics in Computer Music Lecture 4 Time Warping Music 209 Advanced Topics in Computer Music Lecture 4 Time Warping 2006-2-9 Professor David Wessel (with John Lazzaro) (cnmat.berkeley.edu/~wessel, www.cs.berkeley.edu/~lazzaro) www.cs.berkeley.edu/~lazzaro/class/music209

More information

Musical Hit Detection

Musical Hit Detection Musical Hit Detection CS 229 Project Milestone Report Eleanor Crane Sarah Houts Kiran Murthy December 12, 2008 1 Problem Statement Musical visualizers are programs that process audio input in order to

More information

Expressive Singing Synthesis based on Unit Selection for the Singing Synthesis Challenge 2016

Expressive Singing Synthesis based on Unit Selection for the Singing Synthesis Challenge 2016 Expressive Singing Synthesis based on Unit Selection for the Singing Synthesis Challenge 2016 Jordi Bonada, Martí Umbert, Merlijn Blaauw Music Technology Group, Universitat Pompeu Fabra, Spain jordi.bonada@upf.edu,

More information

DATA COMPRESSION USING THE FFT

DATA COMPRESSION USING THE FFT EEE 407/591 PROJECT DUE: NOVEMBER 21, 2001 DATA COMPRESSION USING THE FFT INSTRUCTOR: DR. ANDREAS SPANIAS TEAM MEMBERS: IMTIAZ NIZAMI - 993 21 6600 HASSAN MANSOOR - 993 69 3137 Contents TECHNICAL BACKGROUND...

More information

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY Eugene Mikyung Kim Department of Music Technology, Korea National University of Arts eugene@u.northwestern.edu ABSTRACT

More information

Reference Guide Version 1.0

Reference Guide Version 1.0 Reference Guide Version 1.0 1 1) Introduction Thank you for purchasing Monster MIX. If this is the first time you install Monster MIX you should first refer to Sections 2, 3 and 4. Those chapters of the

More information

A Parametric Autoregressive Model for the Extraction of Electric Network Frequency Fluctuations in Audio Forensic Authentication

A Parametric Autoregressive Model for the Extraction of Electric Network Frequency Fluctuations in Audio Forensic Authentication Proceedings of the 3 rd International Conference on Control, Dynamic Systems, and Robotics (CDSR 16) Ottawa, Canada May 9 10, 2016 Paper No. 110 DOI: 10.11159/cdsr16.110 A Parametric Autoregressive Model

More information

Combining Instrument and Performance Models for High-Quality Music Synthesis

Combining Instrument and Performance Models for High-Quality Music Synthesis Combining Instrument and Performance Models for High-Quality Music Synthesis Roger B. Dannenberg and Istvan Derenyi dannenberg@cs.cmu.edu, derenyi@cs.cmu.edu School of Computer Science, Carnegie Mellon

More information

Design of a Speaker Recognition Code using MATLAB

Design of a Speaker Recognition Code using MATLAB Design of a Speaker Recognition Code using MATLAB E. Darren Ellis Department of Computer and Electrical Engineering University of Tennessee, Knoxville Tennessee 37996 (Submitted: 09 May 2001) This project

More information

hit), and assume that longer incidental sounds (forest noise, water, wind noise) resemble a Gaussian noise distribution.

hit), and assume that longer incidental sounds (forest noise, water, wind noise) resemble a Gaussian noise distribution. CS 229 FINAL PROJECT A SOUNDHOUND FOR THE SOUNDS OF HOUNDS WEAKLY SUPERVISED MODELING OF ANIMAL SOUNDS ROBERT COLCORD, ETHAN GELLER, MATTHEW HORTON Abstract: We propose a hybrid approach to generating

More information

ACCURATE ANALYSIS AND VISUAL FEEDBACK OF VIBRATO IN SINGING. University of Porto - Faculty of Engineering -DEEC Porto, Portugal

ACCURATE ANALYSIS AND VISUAL FEEDBACK OF VIBRATO IN SINGING. University of Porto - Faculty of Engineering -DEEC Porto, Portugal ACCURATE ANALYSIS AND VISUAL FEEDBACK OF VIBRATO IN SINGING José Ventura, Ricardo Sousa and Aníbal Ferreira University of Porto - Faculty of Engineering -DEEC Porto, Portugal ABSTRACT Vibrato is a frequency

More information

SMS Composer and SMS Conductor: Applications for Spectral Modeling Synthesis Composition and Performance

SMS Composer and SMS Conductor: Applications for Spectral Modeling Synthesis Composition and Performance SMS Composer and SMS Conductor: Applications for Spectral Modeling Synthesis Composition and Performance Eduard Resina Audiovisual Institute, Pompeu Fabra University Rambla 31, 08002 Barcelona, Spain eduard@iua.upf.es

More information

Hugo Technology. An introduction into Rob Watts' technology

Hugo Technology. An introduction into Rob Watts' technology Hugo Technology An introduction into Rob Watts' technology Copyright Rob Watts 2014 About Rob Watts Audio chip designer both analogue and digital Consultant to silicon chip manufacturers Designer of Chord

More information

Erasing 9840 and 9940 tapes

Erasing 9840 and 9940 tapes Erasing 9840 and 9940 tapes Erasing data tapes was fairly simple in the past. Bulk erasers, also known as degausers, did a good job of demagnetizing the tapes and erasing all data. With newer tapes, such

More information

Real-time Granular Sampling Using the IRCAM Signal Processing Workstation. Cort Lippe IRCAM, 31 rue St-Merri, Paris, 75004, France

Real-time Granular Sampling Using the IRCAM Signal Processing Workstation. Cort Lippe IRCAM, 31 rue St-Merri, Paris, 75004, France Cort Lippe 1 Real-time Granular Sampling Using the IRCAM Signal Processing Workstation Cort Lippe IRCAM, 31 rue St-Merri, Paris, 75004, France Running Title: Real-time Granular Sampling [This copy of this

More information

Toward a Computationally-Enhanced Acoustic Grand Piano

Toward a Computationally-Enhanced Acoustic Grand Piano Toward a Computationally-Enhanced Acoustic Grand Piano Andrew McPherson Electrical & Computer Engineering Drexel University 3141 Chestnut St. Philadelphia, PA 19104 USA apm@drexel.edu Youngmoo Kim Electrical

More information

Rec. ITU-R BT RECOMMENDATION ITU-R BT * WIDE-SCREEN SIGNALLING FOR BROADCASTING

Rec. ITU-R BT RECOMMENDATION ITU-R BT * WIDE-SCREEN SIGNALLING FOR BROADCASTING Rec. ITU-R BT.111-2 1 RECOMMENDATION ITU-R BT.111-2 * WIDE-SCREEN SIGNALLING FOR BROADCASTING (Signalling for wide-screen and other enhanced television parameters) (Question ITU-R 42/11) Rec. ITU-R BT.111-2

More information

Using the new psychoacoustic tonality analyses Tonality (Hearing Model) 1

Using the new psychoacoustic tonality analyses Tonality (Hearing Model) 1 02/18 Using the new psychoacoustic tonality analyses 1 As of ArtemiS SUITE 9.2, a very important new fully psychoacoustic approach to the measurement of tonalities is now available., based on the Hearing

More information

An Introduction to the Spectral Dynamics Rotating Machinery Analysis (RMA) package For PUMA and COUGAR

An Introduction to the Spectral Dynamics Rotating Machinery Analysis (RMA) package For PUMA and COUGAR An Introduction to the Spectral Dynamics Rotating Machinery Analysis (RMA) package For PUMA and COUGAR Introduction: The RMA package is a PC-based system which operates with PUMA and COUGAR hardware to

More information

ECE438 - Laboratory 4: Sampling and Reconstruction of Continuous-Time Signals

ECE438 - Laboratory 4: Sampling and Reconstruction of Continuous-Time Signals Purdue University: ECE438 - Digital Signal Processing with Applications 1 ECE438 - Laboratory 4: Sampling and Reconstruction of Continuous-Time Signals October 6, 2010 1 Introduction It is often desired

More information

Music for Alto Saxophone & Computer

Music for Alto Saxophone & Computer Music for Alto Saxophone & Computer by Cort Lippe 1997 for Stephen Duke 1997 Cort Lippe All International Rights Reserved Performance Notes There are four classes of multiphonics in section III. The performer

More information

Digital Signal Processing

Digital Signal Processing COMP ENG 4TL4: Digital Signal Processing Notes for Lecture #1 Friday, September 5, 2003 Dr. Ian C. Bruce Room CRL-229, Ext. 26984 ibruce@mail.ece.mcmaster.ca Office Hours: TBA Instructor: Teaching Assistants:

More information

Interacting with a Virtual Conductor

Interacting with a Virtual Conductor Interacting with a Virtual Conductor Pieter Bos, Dennis Reidsma, Zsófia Ruttkay, Anton Nijholt HMI, Dept. of CS, University of Twente, PO Box 217, 7500AE Enschede, The Netherlands anijholt@ewi.utwente.nl

More information