MUSI-6201 Computational Music Analysis

Similar documents
CTP431- Music and Audio Computing Musical Acoustics. Graduate School of Culture Technology KAIST Juhan Nam

Music Representations

REAL-TIME VISUALISATION OF LOUDNESS ALONG DIFFERENT TIME SCALES

The Physics Of Sound. Why do we hear what we hear? (Turn on your speakers)

Psychoacoustics. lecturer:

TO HONOR STEVENS AND REPEAL HIS LAW (FOR THE AUDITORY STSTEM)

I. LISTENING. For most people, sound is background only. To the sound designer/producer, sound is everything.!tc 243 2

MUSI-6201 Computational Music Analysis

DIFFERENCES IN TRAFFIC NOISE MEASUREMENTS WITH SLM AND BINAURAL RECORDING HEAD

We realize that this is really small, if we consider that the atmospheric pressure 2 is

Chapter 24. Meeting 24, Dithering and Mastering

Vibration Measurement and Analysis

Experiments on tone adjustments

Loudness of transmitted speech signals for SWB and FB applications

PSYCHOACOUSTICS & THE GRAMMAR OF AUDIO (By Steve Donofrio NATF)

CTP 431 Music and Audio Computing. Basic Acoustics. Graduate School of Culture Technology (GSCT) Juhan Nam

Sonoris Meter VST 2.0

Why do some concert halls render music more expressive and impressive than others?

Pitch. The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high.


Meter Madness Mike Rivers

Lab P-6: Synthesis of Sinusoidal Signals A Music Illusion. A k cos.! k t C k / (1)

Music Representations

Music Complexity Descriptors. Matt Stabile June 6 th, 2008

Laboratory Assignment 3. Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng

Experiment 9 Analog/Digital Conversion

Musical Signal Processing with LabVIEW Introduction to Audio and Musical Signals. By: Ed Doering

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics)

Math and Music: The Science of Sound

Analysing Room Impulse Responses with Psychoacoustical Algorithms: A Preliminary Study

F600A COMPRESSOR Operating Manual

9.35 Sensation And Perception Spring 2009

CHAPTER 3 AUDIO MIXER DIGITAL AUDIO PRODUCTION [IP3038PA]

Quarterly Progress and Status Report. An attempt to predict the masking effect of vowel spectra

Course Web site:

Collection of Setups for Measurements with the R&S UPV and R&S UPP Audio Analyzers. Application Note. Products:

DK Meter Audio & Loudness Metering Complete. Safe & Sound

Signal Processing. Case Study - 3. It s Too Loud. Hardware. Sound Levels

Getting Started with the LabVIEW Sound and Vibration Toolkit

Electrical Stimulation of the Cochlea to Reduce Tinnitus. Richard S. Tyler, Ph.D. Overview

Pitch Perception and Grouping. HST.723 Neural Coding and Perception of Sound

Abbey Road TG Mastering Chain User Guide

Temporal summation of loudness as a function of frequency and temporal pattern

The Cocktail Party Effect. Binaural Masking. The Precedence Effect. Music 175: Time and Space

Lecture 2 What we hear: Basic dimensions of auditory experience

R G Alcorn, W C Beattie. The Queen s University of Belfast

Audio Editing. Developed by. Allama Iqbal Open University, Islamabad, Pakistan. In association with

Standard Definition. Commercial File Delivery. Technical Specifications

What is proximity, how do early reflections and reverberation affect it, and can it be studied with LOC and existing binaural data?

UNIVERSITY OF DUBLIN TRINITY COLLEGE

TABLE OF CONTENTS TC SUPPORT INTERACTIVE LM5 & LM5D -TDM / 1

Proceedings of Meetings on Acoustics

LDM24 Digital Monitoring Adapter Manual

Why We Measure Loudness

Contents. Welcome to LCAST. System Requirements. Compatibility. Installation and Authorization. Loudness Metering. True-Peak Metering

DRAFT RELEASE FOR BETA EVALUATION ONLY

Evaluation of the Technical Level of Saxophone Performers by Considering the Evolution of Spectral Parameters of the Sound

UNIT 1: QUALITIES OF SOUND. DURATION (RHYTHM)

La Salle University. I. Listening Answer the following questions about the various works we have listened to in the course so far.

Syrah. Flux All 1rights reserved

Perceptual Mixing for Musical Production

Music 175: Pitch II. Tamara Smyth, Department of Music, University of California, San Diego (UCSD) June 2, 2015

MASTER'S THESIS. Listener Envelopment

Query By Humming: Finding Songs in a Polyphonic Database

RECOMMENDATION ITU-R BT *, ** Planning criteria for digital terrestrial television services in the VHF/UHF bands

A SEMANTIC DIFFERENTIAL STUDY OF LOW AMPLITUDE SUPERSONIC AIRCRAFT NOISE AND OTHER TRANSIENT SOUNDS

Concert halls conveyors of musical expressions

TECHNICAL MEDIA SPECIFICATION ON THE FILE BASED SUBMISSION OF MATERIALS TO BE AIRED

LOUDNESS EFFECT OF THE DIFFERENT TONES ON THE TIMBRE SUBJECTIVE PERCEPTION EXPERIMENT OF ERHU

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

DTS Neural Mono2Stereo

Sound Measurement. V2: 10 Nov 2011 WHITE PAPER. IMAGE PROCESSING TECHNIQUES

DETECTING ENVIRONMENTAL NOISE WITH BASIC TOOLS

Absolute Perceived Loudness of Speech

2 MHz Lock-In Amplifier

Automatic music transcription

Tempo and Beat Tracking

Fourier Transforms 1D

Tempo and Beat Analysis

Noise evaluation based on loudness-perception characteristics of older adults

Interior and Motorbay sound quality evaluation of full electric and hybrid-electric vehicles based on psychoacoustics

Version 1.2 User Manual

Ch. 1: Audio/Image/Video Fundamentals Multimedia Systems. School of Electrical Engineering and Computer Science Oregon State University

SpectraFoo v4.0 User Manual METRIC HALO Copyright Metric Halo Distribution, Inc. All rights reserved.

Relation between violin timbre and harmony overtone

Using the ITU BS and CBS Loudness Meters to Measure Automatic Loudness Controller Performance

Audio Metering Measurements, Standards, and Practice (2 nd Edition) Eddy Bøgh Brixen

INSTRUCTION SHEET FOR NOISE MEASUREMENT

Foundations and Theory

Amplitude and Loudness 1

COZI TV: Commercials: commercial instructions for COZI TV to: Diane Hernandez-Feliciano Phone:

Lecture 2 Video Formation and Representation

EUROPA I PREAMPLIFIER QUICK START GUIDE Dave Hill Designs version

Semi-automated extraction of expressive performance information from acoustic recordings of piano music. Andrew Earis

The Lecture Contains: Frequency Response of the Human Visual System: Temporal Vision: Consequences of persistence of vision: Objectives_template

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY

AM-4 Audio Monitor. Videoquip Research Limited 595 Middlefield Road, Unit #4 Scarborough, Ontario, Canada. MIV 3S2

EBU Digital AV Sync and Operational Test Pattern

Audio Feature Extraction for Corpus Analysis

Transcription:

MUSI-6201 Computational Music Analysis Part 5.1: Intensity alexander lerch November 4, 2015

instantaneous features overview text book Chapter 4: Intensity (pp. 71 78) sources: slides (latex) & Matlab github repository lecture content loudness perception and decibels dynamics in music instantaneous features

instantaneous features overview text book Chapter 4: Intensity (pp. 71 78) sources: slides (latex) & Matlab github repository lecture content loudness perception and decibels dynamics in music instantaneous features

instantaneous features overview text book Chapter 4: Intensity (pp. 71 78) sources: slides (latex) & Matlab github repository lecture content loudness perception and decibels dynamics in music instantaneous features

instantaneous features overview text book Chapter 4: Intensity (pp. 71 78) sources: slides (latex) & Matlab github repository lecture content loudness perception and decibels dynamics in music instantaneous features

introduction intensity-related descriptors commonly used waveform view 1 1 1 x(t) (pop) 0.5 0-0.5 x(t) (stringquartet) 0.5 0-0.5 x(t) (speech) 0.5 0-0.5-1 -1-1 0 5 10 0 5 10 0 5 10 t [s] t [s] t [s] level monitoring (PPM, VU,... ) terms and definitions

introduction intensity-related descriptors commonly used waveform view 1 1 1 x(t) (pop) 0.5 0-0.5 x(t) (stringquartet) 0.5 0-0.5 x(t) (speech) 0.5 0-0.5-1 -1-1 0 5 10 0 5 10 0 5 10 t [s] t [s] t [s] level monitoring (PPM, VU,... ) terms and definitions magnitude intensity envelope level volume velocity loudness

human perception 1/2 perception has non-linear relation to intensity: model: logarithmic relation v db (n) = 20 log 10 ( v(n) v 0 v 0 : reference constant (0 db point) digital: v 0 = 1 dbfs scaling factor: 1 db JNDL )

human perception 1/2 perception has non-linear relation to intensity: model: logarithmic relation v db (n) = 20 log 10 ( v(n) v 0 v 0 : reference constant (0 db point) digital: v 0 = 1 dbfs scaling factor: 1 db JNDL )

human perception 1/2 perception has non-linear relation to intensity: model: logarithmic relation v db (n) = 20 log 10 ( v(n) v 0 v 0 : reference constant (0 db point) digital: v 0 = 1 dbfs scaling factor: 1 db JNDL )

excursion: level computation if v(n) = 0 : computation of log 10 (0) work-arounds a add constant ɛ v db (n) = 20 log 10 (v(n) + ɛ) b add if statement v trunc (n) = { v(n), if v(n) ɛ ɛ, otherwise

excursion: level computation if v(n) = 0 : computation of log 10 (0) work-arounds a add constant ɛ v db v db [db] 20 15 10 5 v db (n) = 20 log 10 (v(n) + ɛ) ǫ = 1e 01 ǫ = 1e 02 ǫ = 1e 03 ǫ = 1e 04 matlab source: matlab/displaylogepsilon.m 0-40 -35-30 -25-20 -15-10 -5 0 v db [db]

excursion: level computation if v(n) = 0 : computation of log 10 (0) work-arounds a add constant ɛ v db (n) = 20 log 10 (v(n) + ɛ) b add if statement v trunc (n) = { v(n), if v(n) ɛ ɛ, otherwise

human perception 2/2 decibel scale is not loudness scale: equal-sized steps on the decibel scale not perceived as equal-sized loudness steps perceptual loudness depends on frequency cochlear resolution masking effects

human perception 2/2 decibel scale is not loudness scale: equal-sized steps on the decibel scale not perceived as equal-sized loudness steps perceptual loudness depends on frequency cochlear resolution masking effects

human perception 2/2 120 SPL [db] 100 80 60 40 20 0 90 phon 80 phon 60 phon 40 phon 20 phon 0 phon 10 2 10 3 10 4 f [Hz] matlab source: matlab/displayequalloudnesscontours.m

dynamics in music score: only several rough dynamic steps,e.g.: pp, p, mf, f, ff comparably vague instructions on volume modifications, e.g.: crescendo, decrescendo, sf dynamics influenced by instrumentation timbre number of voices context and musical tension MIDI: 128 velocity steps no standardized relation to magnitude, power,...

dynamics in music score: only several rough dynamic steps,e.g.: pp, p, mf, f, ff comparably vague instructions on volume modifications, e.g.: crescendo, decrescendo, sf dynamics influenced by instrumentation timbre number of voices context and musical tension MIDI: 128 velocity steps no standardized relation to magnitude, power,...

dynamics in music score: only several rough dynamic steps,e.g.: pp, p, mf, f, ff comparably vague instructions on volume modifications, e.g.: crescendo, decrescendo, sf dynamics influenced by instrumentation timbre number of voices context and musical tension MIDI: 128 velocity steps no standardized relation to magnitude, power,...

dynamics in music score: only several rough dynamic steps,e.g.: pp, p, mf, f, ff comparably vague instructions on volume modifications, e.g.: crescendo, decrescendo, sf dynamics influenced by instrumentation timbre number of voices context and musical tension MIDI: 128 velocity steps no standardized relation to magnitude, power,...

features: root mean square 1/2 v RMS (n) = 1 K i e(n) i=i s(n) x(i) 2

features: root mean square 1/2 v RMS (n) = 1 K i e(n) i=i s(n) x(i) 2 value of this feature for the hypothetical prototype signals silence sinusoidal (Amplitude A)

features: root mean square 1/2 v RMS (n) = 1 K i e(n) i=i s(n) x(i) 2 f [khz] 10 5 0 1 5 10 15 20 25 matlab source: matlab/displayfeatures.m x(i) 0-50 v(n) -1 0 5 10 15 20 25 t [s]

features: root mean square 2/2 common variants (sample processing only): reduce computational complexity vrms(n) 2 = x(ie(n))2 x(i s(n 1)) 2 + vrms(n 2 1) i e(n) i s(n) + 1 v RMS(n) = vrms 2 (n) single pole approximation v tmp(i) = α v tmp(i 1) + (1 α) x(i) 2 v RMS(i) = v tmp(i)

features: root mean square 2/2 common variants (sample processing only): reduce computational complexity vrms(n) 2 = x(ie(n))2 x(i s(n 1)) 2 + vrms(n 2 1) i e(n) i s(n) + 1 v RMS(n) = vrms 2 (n) single pole approximation v tmp(i) = α v tmp(i 1) + (1 α) x(i) 2 v RMS(i) = v tmp(i)

features: weighted root mean square x(i) H(z) RMS v(n) H(z): A, B, C weighting RLB (BS.1770)...

features: weighted root mean square x(i) H(z) RMS v(n) H(z): A, B, C weighting RLB (BS.1770)...

features: weighted root mean square x(i) H(z) RMS v(n) H(z): A, B, C weighting RLB (BS.1770)... H(f) [db] 10 0-10 -20 BS.1770 MC ITU-R BS.468 A Weighting C Weighting Z Weighting matlab source: matlab/displayloudnessweighting.m 10 2 10 3 10 4 f [Hz]

features: peak envelope (max) v Peak (n) = max x(i) i s(n) i i e(n)

features: peak envelope (max) v Peak (n) = max x(i) i s(n) i i e(n) x(i) f [khz] 10 5 0 1 0 5 10 15 20 25 v(n) matlab source: matlab/displayfeatures.m -1 0 5 10 15 20 25 t [s] -50

features: peak envelope (PPM) 1/2 x(i) x(i) α AT λ v PPM(i) z 1

features: peak envelope (PPM) 1/2 x(i) x(i) α AT λ v PPM(i) z 1 release state ( x(i) < v PPM (i 1) λ = α RT )

features: peak envelope (PPM) 1/2 x(i) x(i) α AT λ v PPM(i) z 1 release state ( x(i) < v PPM (i 1) λ = α RT ) v PPM (i) = v PPM (i 1) α RT v PPM (i 1) = (1 α RT ) v PPM (i 1)

features: peak envelope (PPM) 1/2 x(i) x(i) α AT λ v PPM(i) z 1 attack state ( x(i) v PPM (i 1) λ = 0)

features: peak envelope (PPM) 1/2 x(i) x(i) α AT λ v PPM(i) z 1 attack state ( x(i) v PPM (i 1) λ = 0) v PPM (i) = α AT ( x(i) v PPM (i 1) ) + v PPM (i 1) = α AT x(i) + (1 α AT ) v PPM (i 1)

features: peak envelope (PPM) 2/2 10 f [khz] x(i) 5 0 1 0 5 10 15 20 25-1 0 5 10 15 20 25 t [s] discuss differences between peak meter and max per block -50 v(n) matlab source: matlab/displayfeatures.m

features: peak envelope (PPM) 2/2 10 f [khz] x(i) 5 0 1 0 5 10 15 20 25-1 0 5 10 15 20 25 t [s] discuss differences between peak meter and max per block -50 v(n) matlab source: matlab/displayfeatures.m

features: peak envelope (PPM) 2/2 10 f [khz] x(i) 5 0 1 0 5 10 15 20 25-1 0 5 10 15 20 25 t [s] discuss differences between peak meter and max per block -50 v(n) matlab source: matlab/displayfeatures.m

features: zwicker loudness Stimulus Outer Ear Transfer Function Excitation Patterns Specific Loudness Overall Loudness v Loud

features: zwicker loudness Stimulus Outer Ear Transfer Function Excitation Patterns Specific Loudness Overall Loudness v Loud outer ear transfer function 1 1 D. Hammershøi and H. Møller, Methods for Binaural Recording and Reproduction, Acta Acustica united with Acustica, vol. 88, no. 3, pp. 303 311, May 2002.

features: zwicker loudness Stimulus Outer Ear Transfer Function Excitation Patterns Specific Loudness Overall Loudness v Loud excitation patterns 1 1 M. Schleske, Vibrato of the musician, [Online]. Available: http://www.schleske.de/en/our-research/handbook-violinacoustics/vibrato-of-the-musician.html (visited on 07/29/2015).

features: zwicker loudness Stimulus Outer Ear Transfer Function Excitation Patterns Specific Loudness Overall Loudness v Loud specific loudness 1 1 U. of Salford, Customised metrics, [Online]. Available: https://www.salford.ac.uk/computing-science- engineering/research/acoustics/psychoacoustics/sound-quality-making-products-sound-

features: zwicker loudness Stimulus Outer Ear Transfer Function Excitation Patterns Specific Loudness Overall Loudness v Loud overall loudness v loud = i z i

derived features number or ratio of pauses dynamic range statistical features from (RMS) histogram...

summary lecture content 1 why are intensity-related features often in db 2 how does the db-scale relate to loudness 3 what are typical intensity-related features

summary lecture content 1 why are intensity-related features often in db 2 how does the db-scale relate to loudness 3 what are typical intensity-related features

summary lecture content 1 why are intensity-related features often in db 2 how does the db-scale relate to loudness 3 what are typical intensity-related features