An interdisciplinary approach to audio effect classification

Similar documents
AN AUDIO effect is a signal processing technique used

Analysis, Synthesis, and Perception of Musical Sounds

Advance Certificate Course In Audio Mixing & Mastering.

Linear Time Invariant (LTI) Systems

S I N E V I B E S FRACTION AUDIO SLICING WORKSTATION

Fraction by Sinevibes audio slicing workstation

A prototype system for rule-based expressive modifications of audio recordings

CONTENT-BASED MELODIC TRANSFORMATIONS OF AUDIO MATERIAL FOR A MUSIC PROCESSING APPLICATION

About Giovanni De Poli. What is Model. Introduction. di Poli: Methodologies for Expressive Modeling of/for Music Performance

A Need for Universal Audio Terminologies and Improved Knowledge Transfer to the Consumer

A METHOD OF MORPHING SPECTRAL ENVELOPES OF THE SINGING VOICE FOR USE WITH BACKING VOCALS

Tempo and Beat Analysis

Polytek Reference Manual

Reference Manual. Using this Reference Manual...2. Edit Mode...2. Changing detailed operator settings...3

1 Introduction to PSQM

GCT535- Sound Technology for Multimedia Timbre Analysis. Graduate School of Culture Technology KAIST Juhan Nam

Digital Signal Processing Detailed Course Outline

Music Representations

Automatic Construction of Synthetic Musical Instruments and Performers

2018 Fall CTP431: Music and Audio Computing Fundamentals of Musical Acoustics

A Composition for Clarinet and Real-Time Signal Processing: Using Max on the IRCAM Signal Processing Workstation

Edit Menu. To Change a Parameter Place the cursor below the parameter field. Rotate the Data Entry Control to change the parameter value.

PSYCHOACOUSTICS & THE GRAMMAR OF AUDIO (By Steve Donofrio NATF)

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

Music 209 Advanced Topics in Computer Music Lecture 4 Time Warping

Engineering in Recording

Toward a Computationally-Enhanced Acoustic Grand Piano

cryo user manual & license agreement

Digital audio and computer music. COS 116, Spring 2012 Guest lecture: Rebecca Fiebrink

Multirate Digital Signal Processing

For sforzando. User Manual

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound

Laboratory Assignment 3. Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB

Implementation of an 8-Channel Real-Time Spontaneous-Input Time Expander/Compressor

ELEC 484 Project Pitch Synchronous Overlap-Add

Music Complexity Descriptors. Matt Stabile June 6 th, 2008

VISUALIZING AND CONTROLLING SOUND WITH GRAPHICAL INTERFACES

Topic 4. Single Pitch Detection

Proceedings of Meetings on Acoustics

TYING SEMANTIC LABELS TO COMPUTATIONAL DESCRIPTORS OF SIMILAR TIMBRES

ADSR AMP. ENVELOPE. Moog Music s Guide To Analog Synthesized Percussion. The First Step COMMON VOLUME ENVELOPES

CTP431- Music and Audio Computing Musical Acoustics. Graduate School of Culture Technology KAIST Juhan Nam

Prosoniq Magenta Realtime Resynthesis Plugin for VST

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics)

HST 725 Music Perception & Cognition Assignment #1 =================================================================

Scoregram: Displaying Gross Timbre Information from a Score

CTP 431 Music and Audio Computing. Basic Acoustics. Graduate School of Culture Technology (GSCT) Juhan Nam

2 2. Melody description The MPEG-7 standard distinguishes three types of attributes related to melody: the fundamental frequency LLD associated to a t

AUD 6306 Speech Science

Credits:! Product Idea: Tilman Hahn Product Design: Tilman Hahn & Dietrich Pank Product built by: Dietrich Pank Gui Design: Benjamin Diez

MHSIB.5 Composing and arranging music within specified guidelines a. Creates music incorporating expressive elements.

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY

Timbre perception

UNIVERSITY OF DUBLIN TRINITY COLLEGE

Tempo and Beat Tracking

It is increasingly possible either to

Tiptop audio z-dsp.

Vocal Processor. Operating instructions. English

This project will work with two different areas in digital signal processing: Image Processing Sound Processing

The Tone Height of Multiharmonic Sounds. Introduction

Perceptual dimensions of short audio clips and corresponding timbre features

Cristina Bachmann, Heiko Bischoff, Marion Bröer, Sabine Pfeifer The information in this document is subject to change without notice and does not

Short Set. The following musical variables are indicated in individual staves in the score:

Presents. Crystal Glasses V3. for NI KONTAKT 4+ Go to Index: 2

Cathedral user guide & reference manual

COMBINING SOUND- AND PITCH-BASED NOTATION FOR TEACHING AND COMPOSITION

Perceptual Considerations in Designing and Fitting Hearing Aids for Music Published on Friday, 14 March :01

FX Basics. Time Effects STOMPBOX DESIGN WORKSHOP. Esteban Maestre. CCRMA Stanford University July 2011

Combining Instrument and Performance Models for High-Quality Music Synthesis

2. AN INTROSPECTION OF THE MORPHING PROCESS

LEARNING TO CONTROL A REVERBERATOR USING SUBJECTIVE PERCEPTUAL DESCRIPTORS

Efficient Computer-Aided Pitch Track and Note Estimation for Scientific Applications. Matthias Mauch Chris Cannam György Fazekas

RetroMod 106 USER GUIDE. TRACKTION COMPANY

Acoustic Scene Classification

Creating a Feature Vector to Identify Similarity between MIDI Files

Spectral toolkit: practical music technology for spectralism-curious composers MICHAEL NORRIS

MUSI-6201 Computational Music Analysis

GESTURALLY-CONTROLLED DIGITAL AUDIO EFFECTS. Marcelo M. Wanderley and Philippe Depalle

Crash Course in Digital Signal Processing

T.R.A.X. SPAT. Real-time Voice and Sonic Modeling Processor. by by. Multiformat Room Acoustic Simulation & Localization Processor

Rhythm and Melody Aspects of Language and Music

Distributed Digital Music Archives and Libraries (DDMAL)

A SYSTEM FOR MUSICAL IMPROVISATION COMBINING SONIC GESTURE RECOGNITION AND GENETIC ALGORITHMS

Banff Sketches. for MIDI piano and interactive music system Robert Rowe

R H Y T H M G E N E R A T O R. User Guide. Version 1.3.0

Reference Manual. How to Use This Manual. Contents List. Search Function. Previous Page View/Next Page View

Singing voice synthesis in Spanish by concatenation of syllables based on the TD-PSOLA algorithm

Pitch Perception. Roger Shepard

Evaluating trained singers tone quality and the effect of changing focus of attention on performance

SYNTHESIS FROM MUSICAL INSTRUMENT CHARACTER MAPS

Music 175: Pitch II. Tamara Smyth, Department of Music, University of California, San Diego (UCSD) June 2, 2015

Topic 1. Auditory Scene Analysis

TOWARDS IMPROVING ONSET DETECTION ACCURACY IN NON- PERCUSSIVE SOUNDS USING MULTIMODAL FUSION

Information in this manual is subject to change without notice and does not represent a commitment on the part of Applied Acoustics Systems DVM Inc.

Timbre blending of wind instruments: acoustics and perception

Synthesizing a choir in real-time using Pitch Synchronous Overlap Add (PSOLA)

Introductions to Music Information Retrieval

SINGING PITCH EXTRACTION BY VOICE VIBRATO/TREMOLO ESTIMATION AND INSTRUMENT PARTIAL DELETION

The NORD MODULAR G2 demo software

Transcription:

An interdisciplinary approach to audio effect classification Vincent Verfaille, Catherine Guastavino Caroline Traube, SPCL / CIRMMT, McGill University GSLIS / CIRMMT, McGill University LIAM / OICM, Université de Montréal C I R M M T Centre for Interdisciplinary Research in Music Media and Technology Montréal, Qc, Canada Sept. 18, 2006

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification 1 Introduction Motivation audio effets: tools used by composers, performers, sound engineers to modify sounds = "effect" = technique (cause) vs. "effect" on perception [Verfaille et al., IEEE-TASLP, 2006] generally classified on the basis of underlying techniques musicians rely on perceptual attributes = gap between techniques & perception = poor communication between researchers and artists

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification 2 Introduction An interdisciplinary approach Goal: to link various types of classifications based on: underlying techniques type of control perceptual attributes intersection between: digital signal processing acoustics auditory perception and cognition psycholinguistics

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification 3 Existing discipline-specific classifications Based on: 1 underlying techniques [Moore, 1990; Orfanidis, 1996; DePoli et al., 1996; Roads, 1996; Zoelzer, 2002] 1.1 analog technologies 1.2 implementation techniques 1.3 domain of application / processing type 1.4 operations applied to a model 2 type of control 3 perceptual attributes

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification 4 1. Classifications based on underlying techniques 1. Classifications based on underlying techniques 1.1 Analog technologies mechanics/acoustics e.g. musical instruments, effects due to room acoustics electromechanics e.g. vinyls: pitch-shifting by changing rotation speed electromagnetics e.g. magnetic tapes: flanging electronics e.g. filters, vocoder, ring modulators

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification 5 1. Classifications based on underlying techniques 1. Classifications based on underlying techniques 1.2 Implementation techniques, from [Zoelzer, 2002] filters delays modulators and demodulators nonlinear processing spatial effects time-segment processing, e.g. SOLA, PSOLA time-frequency processing, e.g. phase vocoder source-filter processing, e.g. LPC spectral processing, e.g. sin + noise time and frequency warping

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification 6 1. Classifications based on underlying techniques 1. Classifications based on underlying techniques 1.3 Domain of application and processing type time domain: block processing (e.g. OLA, SOLA, PSOLA) sample processing (e.g. delay line, nonlinear processing) frequency domain (block processing): frequency domain synthesis (IFFT) (e.g. phase vocoder) time domain synthesis (oscillator bank) time and frequency domain (e.g. phase vocoder + LPC) = choice depends on the artifacts

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification 7 1. Classifications based on underlying techniques 1. Classifications based on underlying techniques 1.4 Operations applied to a model e.g. source-filter model based audio effects: [Verfaille & Depalle, DAFx-04] basic operations: scale, shift, warp, multiply, interpolate applied to the filter, the source or both components Filter Source Signal Components Warp Multiply Identity. Warp Scale Shift Interp Scale Shift Math. Operators Gender Change Equalizer Spectral Panning Vocoding Donald Duck Robotization Robotization -Shift Ring-Mod. Inharmonizer Audio Effects

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification 8 1. Classifications based on underlying techniques 1. Classifications based on underlying techniques Pros: see technical similarities of various effects better understand / implement multi-effects Cons: audio effects may appear in more than one class steep learning curve for non-dsp experts non-intuitive for musicians

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification 9 2. Classification based on the control type 2. Classification based on the control type from [Verfaille, 2003; Verfaille et al., JNMR 2006] constant variable, provided by: wave generators: periodic or low frequency oscillator (LFO) other generators: gestural control: realtime user-defined automation: offline user-defined adaptive: sound-defined

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification 10 2. Classification based on the control type 2. Classification based on the type of control Pros: complements previous classifications appeals to developers, performers and composers defines a general framework to design new audio effects, e.g. adaptive audio effects [Verfaille et al., IEEE-TASLP, 2006] Cons: useful mainly in a HCI & real-time context no link to implementation techniques / perception

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification Quizz: what do you hear? Quizz: what do you hear? Sound examples from [Verfaille, 2003] bell from Varèse s Poème Électronique P freq.-dependent tremoli controlled by C(f ) = fν=0 S(t, ν) = tremolo? flanging? both? implementation technique + control type Sylvain Boeuf s Like Someone In Love adaptive time-scaling + synchronization points (both) control type + sound feature = performed differently We need to take perception into account 11

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification 12 3. Classification based on perceptual attributes 3. Classification based on perceptual attributes Modified perceptual attribute(s) [Amatriain et al., JNMR, 2003] pitch: e.g. melody, intonation, harmony loudness: e.g. dynamics, tremolo time: e.g. duration, rhythm space: e.g. localization, room effect timbre: e.g. formants, brightness, texture

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification 13 3. Classification based on perceptual attributes 3. Classification based on perceptual attributes Examples of effects modifying timbre: [Verfaille et al., JNMR, 2006] DAFx name Perceptual Attr. Control Main Other chorus T random equalizer T L filter T L flanger T P LFO spectrum shift T P adaptive ring modulation T P A comb filter T L,P resonant filter T L,P wah-wah T L,P

amplification Time Timbre compressor timbral metamorphosis Loudness Timbre Timbre expander noise gate gender change Timbre Timbre limiter Timbre contrast nuance change Rhythm tremolo spectral tremolo Loudness Formants timbre morphing spectral envelope warping vocoder effect cross synthesis Loudness hybridization Timbre granular delay echo Room spectral interpolation reverberation distance mutation height scaling Voice quality Timbre panning azimuth spec. panning Loudness Doppler 3D binaural Localization Space spec. env. modifications shifting warping harmonics generator subharmonics generator scaling 3D transaural directivity change Directivity Directivity Localization Leslie / Rotary Audio Effects Timbre Spectrum shifting Harmonicity ring modulation SSB modulation Formants spectral ring modulation Formants Formants no formant preservation pitch-shifting spectral warping Harmonicity detune warping denoising Harmonicity inharmonizer Harmonicity autotune harmonizer declicking enhancer Brightness Time resampling distorsion Time prosody change Loudness Timbre intonation change Quality fuzz vibrato robotization vibrato preservation formants preservation tremolo preservation time-scaling Duration voice quality whisperization hoarseness martianization attack preservation resampling Time centroid change Brightness comb filter swing change Rhythm Timbre time-shuffling Filter resonant filter telephon effect Timbre inversion chorus, flanger, phase, wah-wah

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification 15 3. Classification based on perceptual attributes 3. Classification based on perceptual attributes Pros: complements to previous classifications appeals to all listeners represents artifacts (e.g. time-scaling) Cons: one effect can modify several attributes (control-dependent) difficult to find a graphical representation

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification 16 Interdisciplinary audio effect classification What for? Interdisciplinary audio effect classification = links discipline-specific classifications: semantic descriptors perceptual attributes control type operation / processing applied processing domains digital implementation techniques

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification 17 Interdisciplinary audio effect classification An example with chorus Interdisciplinary audio effect classification Chorus implementations: white noise controlling delay line(s) length modulation mixing pitch-shifted & time-scaled versions Semantic Descriptors Warm Sound Several Performers Perceptual Attribute Control Type Timbre White Noise Chorus Effect Applied Processing Transposition Time-Scaling Resampling Processing Domain Digital Implementation Technique Time- Frequency Phase Vocoder Time SOLA Time Delay Line

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification Interdisciplinary audio effect classification An example with adaptive-time scaling Interdisciplinary audio effect classification Adaptive time-scaling implementation: sound-defined control timbre duration Warm Sound Semantic Descriptors Several Performers Perceptual Attribute Duration Control Type Adaptive A-time-scaling Effect Applied Processing Transposition Time-Scaling Resampling Processing Domain TimeFrequency Time Time Digital Implementation Technique Phase Vocoder SOLA Delay Line 18

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification 19 Interdisciplinary audio effect classification Pros and cons Interdisciplinary audio effect classification Pros: combines different standpoints links layers of discipline-specific features compact representation of audio effects Cons: using a shoehorn to fit an elephant in a glass collaborative efforts

V. Verfaille, C. Guastavino & C. Traube An interdisciplinary approach to audio effect classification 20 Conclusions Conclusions review existing classifications introduce transverse classification: from signal processing to semantics best meet the need of a wider variety of users implications for teaching and knowledge sharing design of more intuitive user interfaces future directions: correlate verbal descriptors and lower-level attributes develop navigation tools (Wiki, trees) retrieve information

An interdisciplinary approach to audio effect classification Vincent Verfaille, Catherine Guastavino Caroline Traube, SPCL / CIRMMT, McGill University GSLIS / CIRMMT, McGill University LIAM / OICM, Université de Montréal C I R M M T Centre for Interdisciplinary Research in Music Media and Technology Montréal, Qc, Canada Sept. 18, 2006