advanced spectral processing

Size: px
Start display at page:

Download "advanced spectral processing"

Transcription

1 advanced spectral processing Jordi Janer Music Technology Group Universitat Pompeu Fabra, Barcelona upf.edu CDSIM UPF May 2014 hkp://mtg.upf.edu/

2 Outline 1. IntroducNon to spectral processing 2. Decomposing sound signals

3 1- IntroducNon to spectral processing

4 Simple Periodic Waves (sine waves) Characterized by: period: T amplitude A phase φ Fundamental frequency in cycles per second, or Hz F 0 =1/T A y(0)=a sin(φ) Time (s) T y = A sin(2πf 0 t+φ) (Many slides come from materials from Dan Jurafsky) CDSIM UPF May 2014

5 Simple periodic waves Frequency: 5 cycles in.5 seconds = 10 cycles/second = 10 Hz Amplitude: 1 Phase: at time 0 seconds, y(0)=a sin(2π10t+φ)=sin(φ)=0 φ=πk, k! φ=0 Equation: y(t) = A sin(20πt) CDSIM UPF May 2014

6 (more) Basic facts about sound waves f = c/λ where c = speed of sound, and λ = wave length (longitud d ona) in meters λ c=3440 cm/s ( 345 m/s) at 21 degrees Celsius at sea level Example: with λ=10m, frequency f=34,5hz CDSIM UPF May 2014

7 Speech sound waves A likle piece from the waveform of a vowel Y axis: Amplitude = amount of air pressure at that Nme point PosiNve is compression Zero is normal air pressure, neganve is rarefacnon (expansion) X axis: Nme. CDSIM UPF May 2014

8 Fundamental frequency The fundamental frequency (or F0) is the lowest frequency of a periodic (voiced) waveform, produced by any particular instrument (our vocal folds are like a complicated instrument) It is also called the first harmonic, in comparison with its integer multiples called second, third, etc. harmonics Fundamental Frequency = first harmonic 2 nd harmonic 3 rd harmonic 4 th harmonic 5 th harmonic 6 th harmonic 7 th harmonic CDSIM UPF May 2014

9 Fundamental frequency In speech, see for example the waveform of a vowel The fundamental frequency could be computed as the number of repennons/second of the wave: Above vowel has 10 reps in secs - > freq. is 10/ = 258 Hz This is the speed at which vocal folds move, hence voicing speed Each peak corresponds to an opening of the vocal folds CDSIM UPF May 2014

10 Pitch Pitch is defined as the perceived fundamental frequency of a sound F0 and pitch are different concepts: F0 corresponds to a physically measurable frequency Pitch corresponds to a perceivable frequency The relanonship between pitch and F0 is not linear human pitch perception is most accurate between 100Hz and 1000Hz. Linear in this range: At F0 1 =200Hz, if Pitch 2 =Pitch 1 /2 then F Hz Logarithmic above 1000Hz: At F0 1 =5KHz if Pitch 2 =Pitch 1 /2 then F0 2 <2KHz SNll, in the literature many Nmes F0 and pitch are treated as the same CDSIM UPF May 2014

11 F0 tracking F0 can be computed using several techniques, and using tools like PRAAT CDSIM UPF May 2014

12 Frequency analysis Waves have different frequencies Hz Time (s) Hz Time (s) CDSIM UPF May 2014

13 Frequency analysis Complex waves: Adding a 100 Hz and 1000 Hz wave together Time (s) CDSIM UPF May 2014

14 Spectrum Frequency components (100 and 1000 Hz) on x- axis Amplitude 100 Frequency in Hz 1000 CDSIM UPF May 2014

15 Fourier transform analysis Fourier analysis: any wave can be represented as the (infinite) sum of sine waves of different frequencies (amplitude, phase) For connnuous signals: For discrete signals: When N is finite (and relanvely short) we call the resulnng signal the short term spectrum (STFT) CDSIM UPF May 2014

16 Spectrum example 40 Magnitude (in db) Frequency (Hz) Spectrum of one instant in an actual soundwave: many components across the frequency range Each frequency component of the wave is separated CDSIM UPF May 2014

17 Formants Formants are defined as the spectra peaks of the sound spectrum envelope Formants are independent of the F0 frequency, as they are defined over the envelope of the spectrum They are created by the pass of the sound through the vocal tract CDSIM UPF May 2014

18 Seeing formants: the spectrogram CDSIM UPF May 2014

19 Example What about Helium voice? hkp:// CDSIM UPF May 2014

20 1. IntroducNon to acousnc signals 2. Spectral analysis 3. ApplicaNons of spectral processing

21 Spectrogram CDSIM UPF May 2014

22 Spectrogram Time- frequency representanon Short- Nme windowing Fast Fourier Transform (FFT) Available tools: Sonic Visualizer (for music analysis) Praat (for speech analysis) Other resources: Live spectrogram: hkp://labrosa.ee.columbia.edu/expo/ CDSIM UPF May 2014

23 Window size Understanding Time- Frequency resolunon Long windows: good freq resolunon Short windows: good temporal resolunon CDSIM UPF May 2014

24 Observing test signals Two near tones Noise burst Chirp Pure tones Harmonic richness (square/saw) Low tone SonicVisualizer h.p://mtg.upf.edu/~jjaner/teaching/cdsim2014/test_various_signals.wav CDSIM UPF May 2014

25 ApplicaNons of spectral processing technologies for the analysis of sound and music technologies for the transforma9on of sound and music technologies for the synthesis of sound and music CDSIM UPF May 2014

26 Analysis Skore automanc singing voice ranng CDSIM UPF May 2014

27 Transforming signals Approaches for spectral transformanons: SMS: hkp://mtg.upf.edu/sms Phase Vocoder Basic transformanons Pitch transposinon Harmonic/noise decomposinon Time- stretching (Matlab internal MTG sosware) CDSIM UPF May 2014

28 Transforming signals Basic transformanons Original Pitch transposinon Harmonic/noise decomposinon Time- stretching (50x) CDSIM UPF May 2014

29 TransformaNon Time scaling DetecNon of transients RepeNNon/Removal of spectral frames Demo: Fast Remixing Original fast Nme- varying remix Swing detecnon Tempo detecnon at 8 th note level Change swing factor Demo: video CDSIM UPF May 2014

30 Synthesis Sample- based (Violin) Gesture modelling to provide a more realisnc synthesis Voice- driven synthesis Voice analysis is used to control the synthesis of a violin sound CDSIM UPF May 2014

31 2- Decomposing sound signals Signal decomposinon and Source separanon

32 source separanon CDSIM UPF May 2014

33 The objecnve Music is distributed as mixdowns in various formats Users aim to further manipulate music signals in mulnple applicanon contexts (karaoke, soloing, remixing, etc.) * from mulntrack originals

34 The problem Music signals are complex Variety of music styles and instrumentanons Modern producnon techniques go beyond linear combinanon of recorded acousnc sources (FX s, digital synth, etc.)

35 Background I ExisNng generic SS approaches: Spectral subtrac9on IntuiNve Well- studied (industrial interest) Good for speech/stanonary noise reducnon Less appropriate for music signals

36 Background II ExisNng music- specific approaches I: Pan- frequency masks o Assumes non- overlapping signals in Nme- frequency bins o Stereo signals are required o Amplitude rano between L and R FFT bins o 2D user interface Examples o Good for simple excerpts o Bad for complex mixes * Loses brightness, vocals less reduced due to reverb, flute is also removed,.,

37 Background III ExisNng music- specific approaches II: Non- nega9ve Matrix Factoriza9on (NMF) Magnitude spectrogram (non- neganve) DecomposiNon as matrix product W (spectral basis) and H (gain acivaions over Ime) Spectrum frame explained as linear combinanon of R basis. MinimizaNon problem that finds W and H: min(d (V, WH)) H W V

38 NMF details Non- nega9ve Matrix Factoriza9on I 3 spectral basis W 1 overlapping note H: acivaion gains

39 NMF details Non- nega9ve Matrix Factoriza9on I 3 spectral basis W 2 overlapping notes H: acivaion gains

40 NMF challenges Predominant instrument separanon (pitch/nmbre analysis) Completeness of instrument removal (akack/sustain, residual/breathing noise, unvoiced consonants, ) Percussive instruments separanon (Transient detecnon, wideband spectrum) Polyphonic instrument separanon (blind and score- informed)

41 Vocals/Background separanon Music print decomposinon: song containing a region without target (e.g. vocals), basis model learnt from the user- selected music- print Music print (without vocals) Region with vocals

42 Vocals/Background separanon Music print decomposinon: Demos: original mute Background excerpt Basis decomposinon W H W bgd Input Basis decomposinon [W bgd, W other ] [H bgd,h oth er ] Wiener filtering (W bgd, H bgd )/(W M) output mute

43 Vocals/Background separanon Music print decomposinon: Demos: original solo Background excerpt Basis decomposinon W H W bgd Input Basis decomposinon [W bgd, W other ] [H bgd,h oth er ] Wiener filtering (W other, H other )/(W M) output solo

44 Vocals/Background separanon Music print decomposinon: not always possible accompaniment (music print) changes throughout the song target always present in some secnons

45 Vocals/Background separanon Solu9on à Predominant Pitch detec9on e.g MELODIA (J. Salomon, MTG) SeparaNon à Binary mask from pitch informanon Simplest approach Nme- frequency mask 1 s at harmonic posinons, 0 s rest Can be combined with pan- frequency mask Demos Voice is properly removed/akenuated Bass guitar is comb- filtered, and horns akenuated Soloing produces more arnfacts original mute solo

46 Vocals/Background removal Advanced separa9on approaches Special treatment for vocals: source / filter models Breathiness residual (noise added on formant envelope) Demos: Original Solo version without residual Solo version with residual

47 Vocals/Background removal Advanced separa9on approaches Special treatment for vocals Breathiness residual (noise added on formant envelope) Unvoiced FricaIve modelling /s/, /f/, /sh/, supervised basis from solo phoneme recordings o Demos: Original Solo version /s/ are missing Solo version /s/ are present Spectrogram of the fricanve recording used to train the spectral basis.

48 Piano decomposinon/retouch Using instrument- specific NMF dicnonaries Piano model of 88 notes (W matrix is pre- learned). Retouch use- case: Amateur recording with errors. The user can select and correct individual notes aser decomposinon/ separanon. Original (ref) Original (played with errors) Separated notes Corrected remix

49 Score- informed separanon Mul9ple sources in an orchestral recording Score data is used to ininalize acnvanons matrix H Video Demo: Isolated instruments: violin, cello, oboe, bassoon, flute

50 Other potennal applicanons

51 Other potennal applicanons Singer replacement Original Vocals mute Vocaloid Clara Vocaloid Clara Mix Drums enhancement Original Drums+6dB Drums- 6dB Step- remixer for drums user- supervised transients (onsets Nme and instrument) Original All Drums Single Instrument

52 Other potennal applicanons (piano) Mono- to- stereo upmixing Input Mozart K331 recording (RWC dataset) Output Upmixing from Mono les/right hands are panned in stereo

53 Other potennal applicanons (piano) Automa9c accompaniment Input Mozart K331 recording (RWC dataset) Output automanc object detecnon String ensemble resynthesis synth solo (Kontakt) mixture

54 Thanks! Jordi Janer Music Technology Group Universitat Pompeu Fabra, Barcelona upf.edu CDSIM UPF May 2014 hkp://mtg.upf.edu/~jjaner

Simple Harmonic Motion: What is a Sound Spectrum?

Simple Harmonic Motion: What is a Sound Spectrum? Simple Harmonic Motion: What is a Sound Spectrum? A sound spectrum displays the different frequencies present in a sound. Most sounds are made up of a complicated mixture of vibrations. (There is an introduction

More information

UNIVERSITY OF DUBLIN TRINITY COLLEGE

UNIVERSITY OF DUBLIN TRINITY COLLEGE UNIVERSITY OF DUBLIN TRINITY COLLEGE FACULTY OF ENGINEERING & SYSTEMS SCIENCES School of Engineering and SCHOOL OF MUSIC Postgraduate Diploma in Music and Media Technologies Hilary Term 31 st January 2005

More information

Experiments on musical instrument separation using multiplecause

Experiments on musical instrument separation using multiplecause Experiments on musical instrument separation using multiplecause models J Klingseisen and M D Plumbley* Department of Electronic Engineering King's College London * - Corresponding Author - mark.plumbley@kcl.ac.uk

More information

Automatic music transcription

Automatic music transcription Music transcription 1 Music transcription 2 Automatic music transcription Sources: * Klapuri, Introduction to music transcription, 2006. www.cs.tut.fi/sgn/arg/klap/amt-intro.pdf * Klapuri, Eronen, Astola:

More information

Digital music synthesis using DSP

Digital music synthesis using DSP Digital music synthesis using DSP Rahul Bhat (124074002), Sandeep Bhagwat (123074011), Gaurang Naik (123079009), Shrikant Venkataramani (123079042) DSP Application Assignment, Group No. 4 Department of

More information

Laboratory Assignment 3. Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB

Laboratory Assignment 3. Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB Laboratory Assignment 3 Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB PURPOSE In this laboratory assignment, you will use MATLAB to synthesize the audio tones that make up a well-known

More information

BBN ANG 141 Foundations of phonology Phonetics 3: Acoustic phonetics 1

BBN ANG 141 Foundations of phonology Phonetics 3: Acoustic phonetics 1 BBN ANG 141 Foundations of phonology Phonetics 3: Acoustic phonetics 1 Zoltán Kiss Dept. of English Linguistics, ELTE z. kiss (elte/delg) intro phono 3/acoustics 1 / 49 Introduction z. kiss (elte/delg)

More information

PSYCHOACOUSTICS & THE GRAMMAR OF AUDIO (By Steve Donofrio NATF)

PSYCHOACOUSTICS & THE GRAMMAR OF AUDIO (By Steve Donofrio NATF) PSYCHOACOUSTICS & THE GRAMMAR OF AUDIO (By Steve Donofrio NATF) "The reason I got into playing and producing music was its power to travel great distances and have an emotional impact on people" Quincey

More information

Music Source Separation

Music Source Separation Music Source Separation Hao-Wei Tseng Electrical and Engineering System University of Michigan Ann Arbor, Michigan Email: blakesen@umich.edu Abstract In popular music, a cover version or cover song, or

More information

2. AN INTROSPECTION OF THE MORPHING PROCESS

2. AN INTROSPECTION OF THE MORPHING PROCESS 1. INTRODUCTION Voice morphing means the transition of one speech signal into another. Like image morphing, speech morphing aims to preserve the shared characteristics of the starting and final signals,

More information

Topic 10. Multi-pitch Analysis

Topic 10. Multi-pitch Analysis Topic 10 Multi-pitch Analysis What is pitch? Common elements of music are pitch, rhythm, dynamics, and the sonic qualities of timbre and texture. An auditory perceptual attribute in terms of which sounds

More information

Pitch-Synchronous Spectrogram: Principles and Applications

Pitch-Synchronous Spectrogram: Principles and Applications Pitch-Synchronous Spectrogram: Principles and Applications C. Julian Chen Department of Applied Physics and Applied Mathematics May 24, 2018 Outline The traditional spectrogram Observations with the electroglottograph

More information

Lecture 9 Source Separation

Lecture 9 Source Separation 10420CS 573100 音樂資訊檢索 Music Information Retrieval Lecture 9 Source Separation Yi-Hsuan Yang Ph.D. http://www.citi.sinica.edu.tw/pages/yang/ yang@citi.sinica.edu.tw Music & Audio Computing Lab, Research

More information

Drum Source Separation using Percussive Feature Detection and Spectral Modulation

Drum Source Separation using Percussive Feature Detection and Spectral Modulation ISSC 25, Dublin, September 1-2 Drum Source Separation using Percussive Feature Detection and Spectral Modulation Dan Barry φ, Derry Fitzgerald^, Eugene Coyle φ and Bob Lawlor* φ Digital Audio Research

More information

Further Topics in MIR

Further Topics in MIR Tutorial Automatisierte Methoden der Musikverarbeitung 47. Jahrestagung der Gesellschaft für Informatik Further Topics in MIR Meinard Müller, Christof Weiss, Stefan Balke International Audio Laboratories

More information

Tempo and Beat Analysis

Tempo and Beat Analysis Advanced Course Computer Science Music Processing Summer Term 2010 Meinard Müller, Peter Grosche Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Tempo and Beat Analysis Musical Properties:

More information

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes hello Jay Biernat Third author University of Rochester University of Rochester Affiliation3 words jbiernat@ur.rochester.edu author3@ismir.edu

More information

MELODY EXTRACTION FROM POLYPHONIC AUDIO OF WESTERN OPERA: A METHOD BASED ON DETECTION OF THE SINGER S FORMANT

MELODY EXTRACTION FROM POLYPHONIC AUDIO OF WESTERN OPERA: A METHOD BASED ON DETECTION OF THE SINGER S FORMANT MELODY EXTRACTION FROM POLYPHONIC AUDIO OF WESTERN OPERA: A METHOD BASED ON DETECTION OF THE SINGER S FORMANT Zheng Tang University of Washington, Department of Electrical Engineering zhtang@uw.edu Dawn

More information

THE importance of music content analysis for musical

THE importance of music content analysis for musical IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 15, NO. 1, JANUARY 2007 333 Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With

More information

GCT535- Sound Technology for Multimedia Timbre Analysis. Graduate School of Culture Technology KAIST Juhan Nam

GCT535- Sound Technology for Multimedia Timbre Analysis. Graduate School of Culture Technology KAIST Juhan Nam GCT535- Sound Technology for Multimedia Timbre Analysis Graduate School of Culture Technology KAIST Juhan Nam 1 Outlines Timbre Analysis Definition of Timbre Timbre Features Zero-crossing rate Spectral

More information

CSC475 Music Information Retrieval

CSC475 Music Information Retrieval CSC475 Music Information Retrieval Monophonic pitch extraction George Tzanetakis University of Victoria 2014 G. Tzanetakis 1 / 32 Table of Contents I 1 Motivation and Terminology 2 Psychacoustics 3 F0

More information

Lecture 10 Harmonic/Percussive Separation

Lecture 10 Harmonic/Percussive Separation 10420CS 573100 音樂資訊檢索 Music Information Retrieval Lecture 10 Harmonic/Percussive Separation Yi-Hsuan Yang Ph.D. http://www.citi.sinica.edu.tw/pages/yang/ yang@citi.sinica.edu.tw Music & Audio Computing

More information

Low-Latency Instrument Separation in Polyphonic Audio Using Timbre Models

Low-Latency Instrument Separation in Polyphonic Audio Using Timbre Models Low-Latency Instrument Separation in Polyphonic Audio Using Timbre Models Ricard Marxer, Jordi Janer, and Jordi Bonada Universitat Pompeu Fabra, Music Technology Group, Roc Boronat 138, Barcelona {ricard.marxer,jordi.janer,jordi.bonada}@upf.edu

More information

Voice & Music Pattern Extraction: A Review

Voice & Music Pattern Extraction: A Review Voice & Music Pattern Extraction: A Review 1 Pooja Gautam 1 and B S Kaushik 2 Electronics & Telecommunication Department RCET, Bhilai, Bhilai (C.G.) India pooja0309pari@gmail.com 2 Electrical & Instrumentation

More information

Topic 11. Score-Informed Source Separation. (chroma slides adapted from Meinard Mueller)

Topic 11. Score-Informed Source Separation. (chroma slides adapted from Meinard Mueller) Topic 11 Score-Informed Source Separation (chroma slides adapted from Meinard Mueller) Why Score-informed Source Separation? Audio source separation is useful Music transcription, remixing, search Non-satisfying

More information

Measurement of overtone frequencies of a toy piano and perception of its pitch

Measurement of overtone frequencies of a toy piano and perception of its pitch Measurement of overtone frequencies of a toy piano and perception of its pitch PACS: 43.75.Mn ABSTRACT Akira Nishimura Department of Media and Cultural Studies, Tokyo University of Information Sciences,

More information

9.35 Sensation And Perception Spring 2009

9.35 Sensation And Perception Spring 2009 MIT OpenCourseWare http://ocw.mit.edu 9.35 Sensation And Perception Spring 29 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. Hearing Kimo Johnson April

More information

Making music with voice. Distinguished lecture, CIRMMT Jan 2009, Copyright Johan Sundberg

Making music with voice. Distinguished lecture, CIRMMT Jan 2009, Copyright Johan Sundberg Making music with voice MENU: A: The instrument B: Getting heard C: Expressivity The instrument Summary RADIATED SPECTRUM Level Frequency Velum VOCAL TRACT Frequency curve Formants Level Level Frequency

More information

Expressive Singing Synthesis based on Unit Selection for the Singing Synthesis Challenge 2016

Expressive Singing Synthesis based on Unit Selection for the Singing Synthesis Challenge 2016 Expressive Singing Synthesis based on Unit Selection for the Singing Synthesis Challenge 2016 Jordi Bonada, Martí Umbert, Merlijn Blaauw Music Technology Group, Universitat Pompeu Fabra, Spain jordi.bonada@upf.edu,

More information

ADSR AMP. ENVELOPE. Moog Music s Guide To Analog Synthesized Percussion. The First Step COMMON VOLUME ENVELOPES

ADSR AMP. ENVELOPE. Moog Music s Guide To Analog Synthesized Percussion. The First Step COMMON VOLUME ENVELOPES Moog Music s Guide To Analog Synthesized Percussion Creating tones for reproducing the family of instruments in which sound arises from the striking of materials with sticks, hammers, or the hands. The

More information

Single Channel Vocal Separation using Median Filtering and Factorisation Techniques

Single Channel Vocal Separation using Median Filtering and Factorisation Techniques Single Channel Vocal Separation using Median Filtering and Factorisation Techniques Derry FitzGerald, Mikel Gainza, Audio Research Group, Dublin Institute of Technology, Kevin St, Dublin 2, Ireland Abstract

More information

Robert Alexandru Dobre, Cristian Negrescu

Robert Alexandru Dobre, Cristian Negrescu ECAI 2016 - International Conference 8th Edition Electronics, Computers and Artificial Intelligence 30 June -02 July, 2016, Ploiesti, ROMÂNIA Automatic Music Transcription Software Based on Constant Q

More information

EE513 Audio Signals and Systems. Introduction Kevin D. Donohue Electrical and Computer Engineering University of Kentucky

EE513 Audio Signals and Systems. Introduction Kevin D. Donohue Electrical and Computer Engineering University of Kentucky EE513 Audio Signals and Systems Introduction Kevin D. Donohue Electrical and Computer Engineering University of Kentucky Question! If a tree falls in the forest and nobody is there to hear it, will it

More information

A METHOD OF MORPHING SPECTRAL ENVELOPES OF THE SINGING VOICE FOR USE WITH BACKING VOCALS

A METHOD OF MORPHING SPECTRAL ENVELOPES OF THE SINGING VOICE FOR USE WITH BACKING VOCALS A METHOD OF MORPHING SPECTRAL ENVELOPES OF THE SINGING VOICE FOR USE WITH BACKING VOCALS Matthew Roddy Dept. of Computer Science and Information Systems, University of Limerick, Ireland Jacqueline Walker

More information

2018 Fall CTP431: Music and Audio Computing Fundamentals of Musical Acoustics

2018 Fall CTP431: Music and Audio Computing Fundamentals of Musical Acoustics 2018 Fall CTP431: Music and Audio Computing Fundamentals of Musical Acoustics Graduate School of Culture Technology, KAIST Juhan Nam Outlines Introduction to musical tones Musical tone generation - String

More information

Lab 5 Linear Predictive Coding

Lab 5 Linear Predictive Coding Lab 5 Linear Predictive Coding 1 of 1 Idea When plain speech audio is recorded and needs to be transmitted over a channel with limited bandwidth it is often necessary to either compress or encode the audio

More information

Data Driven Music Understanding

Data Driven Music Understanding Data Driven Music Understanding Dan Ellis Laboratory for Recognition and Organization of Speech and Audio Dept. Electrical Engineering, Columbia University, NY USA http://labrosa.ee.columbia.edu/ 1. Motivation:

More information

FPFV-285/585 PRODUCTION SOUND Fall 2018 CRITICAL LISTENING Assignment

FPFV-285/585 PRODUCTION SOUND Fall 2018 CRITICAL LISTENING Assignment FPFV-285/585 PRODUCTION SOUND Fall 2018 CRITICAL LISTENING Assignment PREPARATION Track 1) Headphone check -- Left, Right, Left, Right. Track 2) A music excerpt for setting comfortable listening level.

More information

I. LISTENING. For most people, sound is background only. To the sound designer/producer, sound is everything.!tc 243 2

I. LISTENING. For most people, sound is background only. To the sound designer/producer, sound is everything.!tc 243 2 To use sound properly, and fully realize its power, we need to do the following: (1) listen (2) understand basics of sound and hearing (3) understand sound's fundamental effects on human communication

More information

Music Information Retrieval

Music Information Retrieval Music Information Retrieval When Music Meets Computer Science Meinard Müller International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de Berlin MIR Meetup 20.03.2017 Meinard Müller

More information

A prototype system for rule-based expressive modifications of audio recordings

A prototype system for rule-based expressive modifications of audio recordings International Symposium on Performance Science ISBN 0-00-000000-0 / 000-0-00-000000-0 The Author 2007, Published by the AEC All rights reserved A prototype system for rule-based expressive modifications

More information

Lecture 15: Research at LabROSA

Lecture 15: Research at LabROSA ELEN E4896 MUSIC SIGNAL PROCESSING Lecture 15: Research at LabROSA 1. Sources, Mixtures, & Perception 2. Spatial Filtering 3. Time-Frequency Masking 4. Model-Based Separation Dan Ellis Dept. Electrical

More information

Note on Posted Slides. Noise and Music. Noise and Music. Pitch. PHY205H1S Physics of Everyday Life Class 15: Musical Sounds

Note on Posted Slides. Noise and Music. Noise and Music. Pitch. PHY205H1S Physics of Everyday Life Class 15: Musical Sounds Note on Posted Slides These are the slides that I intended to show in class on Tue. Mar. 11, 2014. They contain important ideas and questions from your reading. Due to time constraints, I was probably

More information

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications Matthias Mauch Chris Cannam György Fazekas! 1 Matthias Mauch, Chris Cannam, George Fazekas Problem Intonation in Unaccompanied

More information

Audio Source Separation: "De-mixing" for Production

Audio Source Separation: De-mixing for Production Audio Source Separation: "De-mixing" for Production De-mixing The Beatles at the Hollywood Bowl using Sound Source Separation James Clarke Abbey Road Studios Overview Historical Background Sound Source

More information

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound Pitch Perception and Grouping HST.723 Neural Coding and Perception of Sound Pitch Perception. I. Pure Tones The pitch of a pure tone is strongly related to the tone s frequency, although there are small

More information

Book: Fundamentals of Music Processing. Audio Features. Book: Fundamentals of Music Processing. Book: Fundamentals of Music Processing

Book: Fundamentals of Music Processing. Audio Features. Book: Fundamentals of Music Processing. Book: Fundamentals of Music Processing Book: Fundamentals of Music Processing Lecture Music Processing Audio Features Meinard Müller International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de Meinard Müller Fundamentals

More information

MIE 402: WORKSHOP ON DATA ACQUISITION AND SIGNAL PROCESSING Spring 2003

MIE 402: WORKSHOP ON DATA ACQUISITION AND SIGNAL PROCESSING Spring 2003 MIE 402: WORKSHOP ON DATA ACQUISITION AND SIGNAL PROCESSING Spring 2003 OBJECTIVE To become familiar with state-of-the-art digital data acquisition hardware and software. To explore common data acquisition

More information

An Overview of Lead and Accompaniment Separation in Music

An Overview of Lead and Accompaniment Separation in Music Rafii et al.: An Overview of Lead and Accompaniment Separation in Music 1 An Overview of Lead and Accompaniment Separation in Music Zafar Rafii, Member, IEEE, Antoine Liutkus, Member, IEEE, Fabian-Robert

More information

Physics and Neurophysiology of Hearing

Physics and Neurophysiology of Hearing Physics and Neurophysiology of Hearing H.G. Dosch, Inst. Theor. Phys. Heidelberg I Signal and Percept II The Physics of the Ear III From the Ear to the Cortex IV Electrophysiology Part I: Signal and Percept

More information

Keywords Separation of sound, percussive instruments, non-percussive instruments, flexible audio source separation toolbox

Keywords Separation of sound, percussive instruments, non-percussive instruments, flexible audio source separation toolbox Volume 4, Issue 4, April 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Investigation

More information

A Parametric Autoregressive Model for the Extraction of Electric Network Frequency Fluctuations in Audio Forensic Authentication

A Parametric Autoregressive Model for the Extraction of Electric Network Frequency Fluctuations in Audio Forensic Authentication Journal of Energy and Power Engineering 10 (2016) 504-512 doi: 10.17265/1934-8975/2016.08.007 D DAVID PUBLISHING A Parametric Autoregressive Model for the Extraction of Electric Network Frequency Fluctuations

More information

Spectrum Analyser Basics

Spectrum Analyser Basics Hands-On Learning Spectrum Analyser Basics Peter D. Hiscocks Syscomp Electronic Design Limited Email: phiscock@ee.ryerson.ca June 28, 2014 Introduction Figure 1: GUI Startup Screen In a previous exercise,

More information

Supervised Musical Source Separation from Mono and Stereo Mixtures based on Sinusoidal Modeling

Supervised Musical Source Separation from Mono and Stereo Mixtures based on Sinusoidal Modeling Supervised Musical Source Separation from Mono and Stereo Mixtures based on Sinusoidal Modeling Juan José Burred Équipe Analyse/Synthèse, IRCAM burred@ircam.fr Communication Systems Group Technische Universität

More information

Linear Time Invariant (LTI) Systems

Linear Time Invariant (LTI) Systems Linear Time Invariant (LTI) Systems Superposition Sound waves add in the air without interacting. Multiple paths in a room from source sum at your ear, only changing change phase and magnitude of particular

More information

Experimenting with Musically Motivated Convolutional Neural Networks

Experimenting with Musically Motivated Convolutional Neural Networks Experimenting with Musically Motivated Convolutional Neural Networks Jordi Pons 1, Thomas Lidy 2 and Xavier Serra 1 1 Music Technology Group, Universitat Pompeu Fabra, Barcelona 2 Institute of Software

More information

Pitch. The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high.

Pitch. The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high. Pitch The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high. 1 The bottom line Pitch perception involves the integration of spectral (place)

More information

Chapter 1. Introduction to Digital Signal Processing

Chapter 1. Introduction to Digital Signal Processing Chapter 1 Introduction to Digital Signal Processing 1. Introduction Signal processing is a discipline concerned with the acquisition, representation, manipulation, and transformation of signals required

More information

Welcome to Vibrationdata

Welcome to Vibrationdata Welcome to Vibrationdata Acoustics Shock Vibration Signal Processing February 2004 Newsletter Greetings Feature Articles Speech is perhaps the most important characteristic that distinguishes humans from

More information

DTS Neural Mono2Stereo

DTS Neural Mono2Stereo WAVES DTS Neural Mono2Stereo USER GUIDE Table of Contents Chapter 1 Introduction... 3 1.1 Welcome... 3 1.2 Product Overview... 3 1.3 Sample Rate Support... 4 Chapter 2 Interface and Controls... 5 2.1 Interface...

More information

/$ IEEE

/$ IEEE 564 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 18, NO. 3, MARCH 2010 Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals Jean-Louis Durrieu,

More information

Gaussian Mixture Model for Singing Voice Separation from Stereophonic Music

Gaussian Mixture Model for Singing Voice Separation from Stereophonic Music Gaussian Mixture Model for Singing Voice Separation from Stereophonic Music Mine Kim, Seungkwon Beack, Keunwoo Choi, and Kyeongok Kang Realistic Acoustics Research Team, Electronics and Telecommunications

More information

Module 8 : Numerical Relaying I : Fundamentals

Module 8 : Numerical Relaying I : Fundamentals Module 8 : Numerical Relaying I : Fundamentals Lecture 28 : Sampling Theorem Objectives In this lecture, you will review the following concepts from signal processing: Role of DSP in relaying. Sampling

More information

S I N E V I B E S FRACTION AUDIO SLICING WORKSTATION

S I N E V I B E S FRACTION AUDIO SLICING WORKSTATION S I N E V I B E S FRACTION AUDIO SLICING WORKSTATION INTRODUCTION Fraction is a plugin for deep on-the-fly remixing and mangling of sound. It features 8x independent slicers which record and repeat short

More information

A Parametric Autoregressive Model for the Extraction of Electric Network Frequency Fluctuations in Audio Forensic Authentication

A Parametric Autoregressive Model for the Extraction of Electric Network Frequency Fluctuations in Audio Forensic Authentication Proceedings of the 3 rd International Conference on Control, Dynamic Systems, and Robotics (CDSR 16) Ottawa, Canada May 9 10, 2016 Paper No. 110 DOI: 10.11159/cdsr16.110 A Parametric Autoregressive Model

More information

Analysis, Synthesis, and Perception of Musical Sounds

Analysis, Synthesis, and Perception of Musical Sounds Analysis, Synthesis, and Perception of Musical Sounds The Sound of Music James W. Beauchamp Editor University of Illinois at Urbana, USA 4y Springer Contents Preface Acknowledgments vii xv 1. Analysis

More information

CHAPTER 20.2 SPEECH AND MUSICAL SOUNDS

CHAPTER 20.2 SPEECH AND MUSICAL SOUNDS Source: STANDARD HANDBOOK OF ELECTRONIC ENGINEERING CHAPTER 20.2 SPEECH AND MUSICAL SOUNDS Daniel W. Martin, Ronald M. Aarts SPEECH SOUNDS Speech Level and Spectrum Both the sound-pressure level and the

More information

1. Introduction NCMMSC2009

1. Introduction NCMMSC2009 NCMMSC9 Speech-to-Singing Synthesis System: Vocal Conversion from Speaking Voices to Singing Voices by Controlling Acoustic Features Unique to Singing Voices * Takeshi SAITOU 1, Masataka GOTO 1, Masashi

More information

PHYSICS OF MUSIC. 1.) Charles Taylor, Exploring Music (Music Library ML3805 T )

PHYSICS OF MUSIC. 1.) Charles Taylor, Exploring Music (Music Library ML3805 T ) REFERENCES: 1.) Charles Taylor, Exploring Music (Music Library ML3805 T225 1992) 2.) Juan Roederer, Physics and Psychophysics of Music (Music Library ML3805 R74 1995) 3.) Physics of Sound, writeup in this

More information

A Matlab toolbox for. Characterisation Of Recorded Underwater Sound (CHORUS) USER S GUIDE

A Matlab toolbox for. Characterisation Of Recorded Underwater Sound (CHORUS) USER S GUIDE Centre for Marine Science and Technology A Matlab toolbox for Characterisation Of Recorded Underwater Sound (CHORUS) USER S GUIDE Version 5.0b Prepared for: Centre for Marine Science and Technology Prepared

More information

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES Vishweshwara Rao and Preeti Rao Digital Audio Processing Lab, Electrical Engineering Department, IIT-Bombay, Powai,

More information

Getting Started with the LabVIEW Sound and Vibration Toolkit

Getting Started with the LabVIEW Sound and Vibration Toolkit 1 Getting Started with the LabVIEW Sound and Vibration Toolkit This tutorial is designed to introduce you to some of the sound and vibration analysis capabilities in the industry-leading software tool

More information

CTP431- Music and Audio Computing Musical Acoustics. Graduate School of Culture Technology KAIST Juhan Nam

CTP431- Music and Audio Computing Musical Acoustics. Graduate School of Culture Technology KAIST Juhan Nam CTP431- Music and Audio Computing Musical Acoustics Graduate School of Culture Technology KAIST Juhan Nam 1 Outlines What is sound? Physical view Psychoacoustic view Sound generation Wave equation Wave

More information

Automatic Construction of Synthetic Musical Instruments and Performers

Automatic Construction of Synthetic Musical Instruments and Performers Ph.D. Thesis Proposal Automatic Construction of Synthetic Musical Instruments and Performers Ning Hu Carnegie Mellon University Thesis Committee Roger B. Dannenberg, Chair Michael S. Lewicki Richard M.

More information

Lecture 7: Music

Lecture 7: Music Matthew Schwartz Lecture 7: Music Why do notes sound good? In the previous lecture, we saw that if you pluck a string, it will excite various frequencies. The amplitude of each frequency which is excited

More information

Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods

Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods Drum Sound Identification for Polyphonic Music Using Template Adaptation and Matching Methods Kazuyoshi Yoshii, Masataka Goto and Hiroshi G. Okuno Department of Intelligence Science and Technology National

More information

CTP 431 Music and Audio Computing. Basic Acoustics. Graduate School of Culture Technology (GSCT) Juhan Nam

CTP 431 Music and Audio Computing. Basic Acoustics. Graduate School of Culture Technology (GSCT) Juhan Nam CTP 431 Music and Audio Computing Basic Acoustics Graduate School of Culture Technology (GSCT) Juhan Nam 1 Outlines What is sound? Generation Propagation Reception Sound properties Loudness Pitch Timbre

More information

Digital Signal. Continuous. Continuous. amplitude. amplitude. Discrete-time Signal. Analog Signal. Discrete. Continuous. time. time.

Digital Signal. Continuous. Continuous. amplitude. amplitude. Discrete-time Signal. Analog Signal. Discrete. Continuous. time. time. Discrete amplitude Continuous amplitude Continuous amplitude Digital Signal Analog Signal Discrete-time Signal Continuous time Discrete time Digital Signal Discrete time 1 Digital Signal contd. Analog

More information

Tempo and Beat Tracking

Tempo and Beat Tracking Tutorial Automatisierte Methoden der Musikverarbeitung 47. Jahrestagung der Gesellschaft für Informatik Tempo and Beat Tracking Meinard Müller, Christof Weiss, Stefan Balke International Audio Laboratories

More information

Advanced Signal Processing 2

Advanced Signal Processing 2 Advanced Signal Processing 2 Synthesis of Singing 1 Outline Features and requirements of signing synthesizers HMM based synthesis of singing Articulatory synthesis of singing Examples 2 Requirements of

More information

Video-based Vibrato Detection and Analysis for Polyphonic String Music

Video-based Vibrato Detection and Analysis for Polyphonic String Music Video-based Vibrato Detection and Analysis for Polyphonic String Music Bochen Li, Karthik Dinesh, Gaurav Sharma, Zhiyao Duan Audio Information Research Lab University of Rochester The 18 th International

More information

Analysis of the effects of signal distance on spectrograms

Analysis of the effects of signal distance on spectrograms 2014 Analysis of the effects of signal distance on spectrograms SGHA 8/19/2014 Contents Introduction... 3 Scope... 3 Data Comparisons... 5 Results... 10 Recommendations... 10 References... 11 Introduction

More information

ECE438 - Laboratory 4: Sampling and Reconstruction of Continuous-Time Signals

ECE438 - Laboratory 4: Sampling and Reconstruction of Continuous-Time Signals Purdue University: ECE438 - Digital Signal Processing with Applications 1 ECE438 - Laboratory 4: Sampling and Reconstruction of Continuous-Time Signals October 6, 2010 1 Introduction It is often desired

More information

Music 209 Advanced Topics in Computer Music Lecture 1 Introduction

Music 209 Advanced Topics in Computer Music Lecture 1 Introduction Music 209 Advanced Topics in Computer Music Lecture 1 Introduction 2006-1-19 Professor David Wessel (with John Lazzaro) (cnmat.berkeley.edu/~wessel, www.cs.berkeley.edu/~lazzaro) Website: Coming Soon...

More information

Lecture 1: What we hear when we hear music

Lecture 1: What we hear when we hear music Lecture 1: What we hear when we hear music What is music? What is sound? What makes us find some sounds pleasant (like a guitar chord) and others unpleasant (a chainsaw)? Sound is variation in air pressure.

More information

Music Representations

Music Representations Lecture Music Processing Music Representations Meinard Müller International Audio Laboratories Erlangen meinard.mueller@audiolabs-erlangen.de Book: Fundamentals of Music Processing Meinard Müller Fundamentals

More information

EVALUATION OF A SCORE-INFORMED SOURCE SEPARATION SYSTEM

EVALUATION OF A SCORE-INFORMED SOURCE SEPARATION SYSTEM EVALUATION OF A SCORE-INFORMED SOURCE SEPARATION SYSTEM Joachim Ganseman, Paul Scheunders IBBT - Visielab Department of Physics, University of Antwerp 2000 Antwerp, Belgium Gautham J. Mysore, Jonathan

More information

ADDING (INJECTING) NOISE TO IMPROVE RESULTS.

ADDING (INJECTING) NOISE TO IMPROVE RESULTS. D. Lee Fugal DIGITAL SIGNAL PROCESSING PRACTICAL TECHNIQUES, TIPS, AND TRICKS ADDING (INJECTING) NOISE TO IMPROVE RESULTS. 1 DITHERING 2 DITHERING -1 Dithering comes from the word Didder meaning to tremble,

More information

Using the new psychoacoustic tonality analyses Tonality (Hearing Model) 1

Using the new psychoacoustic tonality analyses Tonality (Hearing Model) 1 02/18 Using the new psychoacoustic tonality analyses 1 As of ArtemiS SUITE 9.2, a very important new fully psychoacoustic approach to the measurement of tonalities is now available., based on the Hearing

More information

Fourier Transforms 1D

Fourier Transforms 1D Fourier Transforms 1D 3D Image Processing Torsten Möller Overview Recap Function representations shift-invariant spaces linear, time-invariant (LTI) systems complex numbers Fourier Transforms Transform

More information

Introduction! User Interface! Bitspeek Versus Vocoders! Using Bitspeek in your Host! Change History! Requirements!...

Introduction! User Interface! Bitspeek Versus Vocoders! Using Bitspeek in your Host! Change History! Requirements!... version 1.5 Table of Contents Introduction!... 3 User Interface!... 4 Bitspeek Versus Vocoders!... 6 Using Bitspeek in your Host!... 6 Change History!... 9 Requirements!... 9 Credits and Contacts!... 10

More information

Creative Computing II

Creative Computing II Creative Computing II Christophe Rhodes c.rhodes@gold.ac.uk Autumn 2010, Wednesdays: 10:00 12:00: RHB307 & 14:00 16:00: WB316 Winter 2011, TBC The Ear The Ear Outer Ear Outer Ear: pinna: flap of skin;

More information

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics)

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics) 1 Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics) Pitch Pitch is a subjective characteristic of sound Some listeners even assign pitch differently depending upon whether the sound was

More information

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST)

Computational Models of Music Similarity. Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST) Computational Models of Music Similarity 1 Elias Pampalk National Institute for Advanced Industrial Science and Technology (AIST) Abstract The perceived similarity of two pieces of music is multi-dimensional,

More information

TIMBRE REPLACEMENT OF HARMONIC AND DRUM COMPONENTS FOR MUSIC AUDIO SIGNALS

TIMBRE REPLACEMENT OF HARMONIC AND DRUM COMPONENTS FOR MUSIC AUDIO SIGNALS 2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) TIMBRE REPLACEMENT OF HARMONIC AND DRUM COMPONENTS FOR MUSIC AUDIO SIGNALS Tomohio Naamura, Hiroazu Kameoa, Kazuyoshi

More information

Recognising Cello Performers using Timbre Models

Recognising Cello Performers using Timbre Models Recognising Cello Performers using Timbre Models Chudy, Magdalena; Dixon, Simon For additional information about this publication click this link. http://qmro.qmul.ac.uk/jspui/handle/123456789/5013 Information

More information

Onset Detection and Music Transcription for the Irish Tin Whistle

Onset Detection and Music Transcription for the Irish Tin Whistle ISSC 24, Belfast, June 3 - July 2 Onset Detection and Music Transcription for the Irish Tin Whistle Mikel Gainza φ, Bob Lawlor*, Eugene Coyle φ and Aileen Kelleher φ φ Digital Media Centre Dublin Institute

More information

NanoGiant Oscilloscope/Function-Generator Program. Getting Started

NanoGiant Oscilloscope/Function-Generator Program. Getting Started Getting Started Page 1 of 17 NanoGiant Oscilloscope/Function-Generator Program Getting Started This NanoGiant Oscilloscope program gives you a small impression of the capabilities of the NanoGiant multi-purpose

More information

6.5 Percussion scalograms and musical rhythm

6.5 Percussion scalograms and musical rhythm 6.5 Percussion scalograms and musical rhythm 237 1600 566 (a) (b) 200 FIGURE 6.8 Time-frequency analysis of a passage from the song Buenos Aires. (a) Spectrogram. (b) Zooming in on three octaves of the

More information

Soundprism: An Online System for Score-Informed Source Separation of Music Audio Zhiyao Duan, Student Member, IEEE, and Bryan Pardo, Member, IEEE

Soundprism: An Online System for Score-Informed Source Separation of Music Audio Zhiyao Duan, Student Member, IEEE, and Bryan Pardo, Member, IEEE IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, VOL. 5, NO. 6, OCTOBER 2011 1205 Soundprism: An Online System for Score-Informed Source Separation of Music Audio Zhiyao Duan, Student Member, IEEE,

More information