The Trinity Church Videos: An Audio Analysis

Similar documents
AMEK SYSTEM 9098 DUAL MIC AMPLIFIER (DMA) by RUPERT NEVE the Designer

THE DIGITAL DELAY ADVANTAGE A guide to using Digital Delays. Synchronize loudspeakers Eliminate comb filter distortion Align acoustic image.

FPFV-285/585 PRODUCTION SOUND Fall 2018 CRITICAL LISTENING Assignment

Mixing in the Box A detailed look at some of the myths and legends surrounding Pro Tools' mix bus.

XB-14 Quick Operation Manual V1 23/10/2013

DTS Neural Mono2Stereo

SREV1 Sampling Guide. An Introduction to Impulse-response Sampling with the SREV1 Sampling Reverberator

DESIGNING OPTIMIZED MICROPHONE BEAMFORMERS

SoundField SurroundZone2. User Guide Version 1.0

Audio Metering Measurements, Standards, and Practice (2 nd Edition) Eddy Bøgh Brixen

The Cocktail Party Effect. Binaural Masking. The Precedence Effect. Music 175: Time and Space

BACHELOR THESIS. Stereo Microphone Techniques in Drum Recording

Voxengo PHA-979 User Guide

A few white papers on various. Digital Signal Processing algorithms. used in the DAC501 / DAC502 units

I. LISTENING. For most people, sound is background only. To the sound designer/producer, sound is everything.!tc 243 2

PSYCHOACOUSTICS & THE GRAMMAR OF AUDIO (By Steve Donofrio NATF)

Studio Notes. SQN-3 Mono Microphone Mixer

Edison Revisited. by Scott Cannon. Advisors: Dr. Jonathan Berger and Dr. Julius Smith. Stanford Electrical Engineering 2002 Summer REU Program

USER MANUAL FOR DDT 2D. Introduction. Installation. Getting Started. Danley Design Tool 2d. Welcome to the Danley Design Tool 2D program.

Musical Acoustics Lecture 15 Pitch & Frequency (Psycho-Acoustics)

Recording to Tape (Analogue or Digital)...10

White Paper JBL s LSR Principle, RMC (Room Mode Correction) and the Monitoring Environment by John Eargle. Introduction and Background:

CATHODE-RAY OSCILLOSCOPE (CRO)

How Close Can They be Stacked? By K0CQ. How close can yagis for harmonically related bands be stacked? What are the consequences of close stacking?

Overview. A 16 channel frame is shown.

Measurement of overtone frequencies of a toy piano and perception of its pitch

X-Panda User s Guide

Eventide Inc. One Alsan Way Little Ferry, NJ

Active MW Vertical Arrays

Electrical and Electronic Laboratory Faculty of Engineering Chulalongkorn University. Cathode-Ray Oscilloscope (CRO)

How Close Can They be Stacked? By K0CQ. How close can yagis for harmonically related bands be stacked? What are the consequences of close stacking?

spiff manual version 1.0 oeksound spiff adaptive transient processor User Manual

Experiment 9A: Magnetism/The Oscilloscope

The Ideal Videoconferencing Room

CR-6 MIXER USER MANUAL ENGLISH. Order Code: MIXE01

Technical Guide. Installed Sound. Loudspeaker Solutions for Worship Spaces. TA-4 Version 1.2 April, Why loudspeakers at all?

Lab #10 Perception of Rhythm and Timing

COMPOSITE VIDEO LUMINANCE METER MODEL VLM-40 LUMINANCE MODEL VLM-40 NTSC TECHNICAL INSTRUCTION MANUAL

Voxengo Soniformer User Guide

SOFTWARE INSTRUCTIONS REAL-TIME STEERING ARRAY MICROPHONES AM-1B AM-1W

Audio-Based Video Editing with Two-Channel Microphone

LPFM LOW POWER FM EQUIPMENT GUIDE

E X P E R I M E N T 1

Mac OS X Windows AAX/Audio Suite/Audio Units/VST 32/64 bit. redline monitor

FAQ. Mixing Features Phoenixville Pike, Suite 201 West Chester, PA Voice: Fax:

SOUND REINFORCEMENT APPLICATIONS

Directional Antenna MHz K , SITEL Caponago Tel.02 /

SPL Analog Code Plug-ins Manual Classic & Dual-Band De-Essers

The Cathode Ray Tube

CHAPTER 3 AUDIO MIXER DIGITAL AUDIO PRODUCTION [IP3038PA]

Acoustic Measurements Using Common Computer Accessories: Do Try This at Home. Dale H. Litwhiler, Terrance D. Lovell

Introduction 3/5/13 2

Quantitative Assessment of Surround Compatibility

Syrah. Flux All 1rights reserved

CATHODE RAY OSCILLOSCOPE. Basic block diagrams Principle of operation Measurement of voltage, current and frequency

Using Extra Loudspeakers and Sound Reinforcement

The basic concept of the VSC-2 hardware

OSCILLOSCOPE AND DIGITAL MULTIMETER

POSITIONING SUBWOOFERS

NTG-1 Instruction Manual

BeoVision Televisions

Precision DeEsser Users Guide

How to Obtain a Good Stereo Sound Stage in Cars

Lecture 5: Frequency Musicians describe sustained, musical tones in terms of three quantities:

THE MIXING DESK. Colin Owen.

Mixers. The functions of a mixer are simple: 1) Process input signals with amplification and EQ, and 2) Combine those signals in a variety of ways.

Why Movies Are Not Too Loud

Technical Instruction Manual

LDG M-7600 External Meter for Icom IC-7600

RoomMatch RM and RM TECHNICAL DATA SHEET. asymmetrical array modules. Key Features. Product Overview. Technical Specifications

Proceedings of Meetings on Acoustics

MIC CUE MIC 1 MIC 2 TREBLE BASS MIC1/2 Tone Controls GROUND

Difficult acoustic environments? Maintaining voice intelligibility

EXHIBIT BASIS OF DESIGN REPORT. Space Type: Space Usage: Off-center of room to match side screen. Location

Calibrating the timecode signal input

Quick Start for TrueRTA (v3.5) on Windows XP (and earlier)

HARPEXh. version 1.4. manual. Copyright Harpex Ltd. t t p : / / h a r p e x. n e t

Sound Design, Music, and Recording

PLAYMATE PROFESSIONAL STEREO 19 MIXER, USB/SD CARD AND BLUETOOTH PLAYER. User Guide and Reference Manual. page 1

Sound technology. TNGD10 - Moving media

CBT 100LA Constant Beamwidth Technology

Using an oscilloscope - The Hameg 203-6

BTV Tuesday 21 November 2006

Simple Harmonic Motion: What is a Sound Spectrum?

MP212 Principles of Audio Technology II

LIVE SOUND SUBWOOFER DR. ADAM J. HILL COLLEGE OF ENGINEERING & TECHNOLOGY, UNIVERSITY OF DERBY, UK GAND CONCERT SOUND, CHICAGO, USA 20 OCTOBER 2017

Activity 1A: The Power of Sound

Using Extra Loudspeakers and Sound Reinforcement

Effectively Managing Sound in Museum Exhibits. by Steve Haas

CDM10: Channel USB Mixer. Item ref: UK User Manual

DETECTING ENVIRONMENTAL NOISE WITH BASIC TOOLS

Multi-Purpose Auditorium Sound Reinforcement System Design ECE Spring 2017

New recording techniques for solo double bass

S op o e p C on o t n rol o s L arni n n i g n g O bj b e j ctiv i e v s

FC Cincinnati Stadium Environmental Noise Model

REQUIRED MATERIALS Text: Alten, S. (2014). Audio in Media (10 th Ed.). Belmont:Wadsworth.

Aphro-V1 Digital reverb & fx processor..

THE KARLSON REPRODUCER

y AW4416 Audio Workstation Signal Flow Tutorial

Product Number : VZ570 Product Name : As of August Specifications and appearance are subject to change without notice.

Transcription:

The Trinity Church Videos: An Audio Analysis A video clip which is claimed to be evidence of the explosive demolition of World Trade Center 2 is circulating on the Internet. It is hosted at http://www.mediumrecords.com/wtc/audio01.html#trinity and appears to have been obtained from http://www.terrorize.dk/911/wtc2dem1/index2.php although its provenance is otherwise unknown. This is a review of that clip from the point of view of an experienced recording engineer and audio technician. This video shows about four seconds of the collapse of WTC2, shot from a point near Trinity Church. The audio track contains two loud explosion-like sounds which purport to be the sounds of explosive charges detonating within the building. By repeated, careful listening in a high quality studio control room I've identified some properties which strongly suggest that the audio is a forgery cobbled together from sound effects libraries. The evidence on which I base this opinion can be checked by anyone with a decent playback system (if you don't have a speaker system and listening room with known good stereo imaging a pair of high quality headphones will serve the purpose very well)- no high-tech toys are needed. The key is in the stereo imaging of the explosions, but before getting to those details it's necessary first to consider what conditions would have to be fulfilled if this clip were authentic. The amount of motion in the camera's viewpoint indicates a hand-held camcorder, consequently if the audio is authentic it would have to be pickup from a built-in or camera-mounted microphone. A quick listen is all that's necessary to establish that the recording is true stereo, not a monaural recording duplicated on both channels of a two-channel recording. Below is a screenshot of the waveform display of a portion of the second explosion, with the left-channel signal on top and the right-channel signal on the bottom. While the signals in both channels are similar, the existence of significant interchannel differences, and hence a stereo component are obvious. Figure 1: Waveform display of beginning of second explosion

If the recording is in stereo and if it is an authentic pickup of the sound at the camera location, then the recording must have been made by using stereo micing techniques. There are several standard ways of setting up a mic array for stereo pickup, which can be divided into "coincident" (mics mounted as close together as possible) and "spaced" (mics separated by anywhere from 17 cm, as in the semi-coincident ORTF method, to several feet, as in the "AB" technique popular among classical music recordists). Of these, the spaced-mic methods are too unwieldy for a practical camera-mounted mic (imagine schlepping a camera with mic booms sticking out a couple of feet on each side), which leaves the coincident methods. Stereo micing seeks to preserve information about the left-right position of a sound source so that when the recording is played back the listener can perceive the location of the source. In coincident micing this is done by using two directional microphones (a directional microphone is more sensitive to sound arriving from directly in front than to sound arriving from other directions) mounted as close together as possible and aimed to the left and right of center, usually 45 degrees left and 45 degrees right. The result of doing this is that if a sound source is located to the left of center it will be picked up more strongly by the left microphone than the right microphone, since it's closer to the front axis of the left mic than that of the right. When the recording is played back, that source will be reproduced louder in the left speaker than in the right, which will produce the impression that the source is located between the center and the left speaker. The greater the difference in amplitude between the left and right channels, the farther to the side the source will appear to be. While changing the included angle between the microphone front axes and the directional patterns of the mics can change the apparent width of the soundstage, coincident micing will always preserve information about the angular position of the sound sources.. What that means is that if this recording is authentic we can expect it to have preserved the spatial aspects of the acoustic environment in which it was made, including the characteristics of the path between the tower and the mics. The video shows a clear line of sight between the camera and the portion of the tower where the collapse begins. If the estimate of 450 meters from the camera to the part of the tower where the purported explosions occur is correct- and it does appear to be reasonable- then that part of the tower, which is seen from a viewpoint close to its diagonal, would subtend an angle of about 11 degrees as viewed from the camera. We would thus expect the direct sound from the tower to have a narrow stereo image, in fact to be every nearly mono, with any stereo information coming from reflections from nearby buildings. These reflections, because they will travel a longer distance than the direct sound, will arrive later and will also arrive from different directions. We can also expect the direct sound radiating from the tower to arrive unobstructed at the camera. To get a feel for how direct and reflected sounds are localized, try this experiment: the next time you're in a large reflective space, close your eyes and clap your hands- or better, get someone else to stand a little way in front of you and clap their hands. Pay close attention to where you perceive the handclap to come from and where the "ambience", the pattern of reflected sound that gives us an auditory impression of our surroundings appears to come from. If you have the opportunity to hear a loud sound outdoors which produces an echo from some nearby building, take note of where you hear the original sound and the echo radiating from. This spatial information is just what a stereo recording preserves. Listening to the recording, the first explosion sound is in fact very narrow- so narrow that even on headphones, which exaggerate the width of the image, it's impossible to tell from a mono source- and slightly left of the center of the stereo image. There are essentially no lateral reflections to be heard. The second explosion has virtually opposite characteristics. It is spread wide across the stereo image, coming on very strongly from hard left and hard right, while it has no appreciable center component. While the ear is probably the best instrument for gauging the apparent width of the recordings of the two explosions, there are instruments which allow us to check our perceptions by measurement. One is the correlation meter. This is a device which measures the degree of sameness between the left and right channels. For a pure mono signal, which is identical in both channels, a correlation meter will produce an indication of +1. For a signal in which the left and

right channels have opposite polarity, that is, left = -right (commonly and erroneously referred to as "out of phase"), it will produce an indication of -1. For a signal in which the left and right channels have no correlation with each other, or a signal where the left and right are 90 degrees out of phase, it will produce an indication of 0. Correlation meters are used in sound mixing to gauge the width of the stereo image. Generally, the mixer will want to see the indication hovering between the 0 and +1 marks. If the indication dips into the negative half of the scale for any appreciable time, there are likely to be problems with the mono compatibility of the mix, while if it stays too near the top of the scale the mix may not contain enough stereo information to create a good sense of space. To check out the difference in imaging between the first and second explosions I used two different correlation meters, one the analog meter in the Neve 8078 console in the control room where I've done most of my listening to this clip, which unfortunately can't be reproduced on the 'net, and a digital one which can. To do this, I imported the AIFF audio file into a ProTools session and inserted the Digidesign/Bomb Factory correlation meter plug-in into the stereo master fader. I selected a one-second segment of audio starting at the beginning of the first explosion and set ProTools to play it repeatedly while I took a screenshot of the display. I then did the same thing for the first second of the second explosion. These screenshots show the results: Figure 2: Playback of first explosion. Correlation meter indicates very near +1

Figure 3: Playback of second explosion. Correlation meter reading drops to +0.5 The Neve correlation meter, which is faster than the Bomb Factory plug-in, was more revealing. Up to the first explosion, it hovered around the +0.75 area, with a slight uptick at the first explosion, showing that the explosion narrowed the overall image. The second explosion drove it down very nearly to the 0 mark, showing that the second explosion widened the mix considerably. Both meters confirm what I hear. The first explosion is mostly mono information while the second explosion is wide stereo. Observations on a phase scope- another old-school method of visualizing stereo which consists of an oscilloscope set to X-Y display mode and connected so that the left channel deflects the beam vertically while the right channel deflects it horizontally- give the same results. With source and mic in essentially fixed positions, for such changes in stereo perspective to occur in an authentic stereo-miced recording the acoustic path between the tower and the mics would have to change very dramatically in the two seconds between explosions. To produce the effects heard in this recording large reflecting surfaces would have to be present to the left and right of the camera at the time of the second explosion which weren't there two seconds before, and an obstacle would have to be placed to the front of the camera to block the direct sound from the second explosion, an obstacle which is nowhere to be seen in the video and which wasn't there during the first explosion two seconds before. The physical universe works in consistent ways. Sound radiating from a given location will encounter the same reflecting, absorbing, diffusing and diffracting objects on its way to another fixed location on repeated trials. A local acoustic environment made up of multistory buildings does not spontaneously rearrange itself in a two-second interval. But that is exactly what would be required to produce this recording in the field.

For that reason I believe that the likelihood of this recording being authentic is very, very, vanishingly low, and the likelihood of its being the forged product of someone with access to an SFX library and an editing workstation to be very high. If the hoaxer had had the wit to make his final product mono by summing the left and right channels this internal evidence would have disappeared and the fakery would have been much more difficult to detect. Just in case the self-contradictory nature of this recording isn't damning enough, the original video is on the 'net. It can be viewed, although not downloaded and saved, here: http://video.google.com/videoplay?docid=-5405555553528290546&q=trinity+church This video runs for one minute and 54 seconds, showing the collapse of the tower and onlookers, including the cameraman, fleeing the expanding dust cloud. Comparing the video portion with the "explosions" clip, it's clear that the video is identical except for the "10:28 AM" inserted into the latter. Careful listening to the sound portion shows some features which support the hypothesis that it is authentic. The audio is definitely mono. This is not surprising because the majority of built-in camcorder microphones are mono. What's more interesting is that everything in the sound track is clearly and logically related to what is seen in the video. At the beginning, we hear the gabble of voices as bystanders watch the burning tower. After the collapse begins, once the realization sinks in that this enormous building is actually collapsing before their eyes, the talking turns to screams. Individual voices can be distinguished and intelligible speech is heard. Throughout the video, from the reaction to the collapse to the sounds of the crowd fleeing the dust cloud to the cop gesturing and exhorting the people taking refuge in a building lobby to get down and keep away from the windows, the sound track is perfectly coordinated with what we see happening in the picture. There is no trace of the sound of explosions, but the booming rumble of the collapsing building is recorded clearly. That the sound of the collapsing building is bass-heavy and contains little high-frequency energy also argues in favor of the authenticity of the recording. Air isn't a perfect transmission medium for sound; energy is dissipated in viscous and rotational effects as sound waves propagate away from their source so that the falloff in loudness with increasing distance is somewhat greater than the inverse-square law predicts. This effect is called "excess attenuation". It's dependent on temperature, barometric pressure and relative humidity, but it is always greater at higher frequencies. Because this effect is one which the designers of public address systems have to contend with, there are computer programs for calculating the excess attenuation given temperature, pressure, relative humidity and the frequency of interest. One such online calculator can be found at http://www.mcsquared.com/dbframe.htm. Weather records for New York City on September 11, 2001 (see http://www.erh.noaa.gov/box/dailystns.shtml) give a range of minimum and maximum temperatures from 63 to 82 degrees Fahrenheit, a range of relative humidity from 38 to 87 percent and average pressures from 1018.8 to 1019.2 millibars (101880 to 101920 Pascals). Using the most optimistic set of conditions (maximum temperature, pressure and relative humidity), at a distance of 450 meters 10 khz should be attenuated by 32.5 db relative to 1 khz (if it was not yet quite that hot or humid when the tower fell, the high-frequency loss would be even greater). Consequently, we would expect the sound of the building collapsing to be boomy and lacking in treble and that is exactly what we hear in the original video.

By contrast, in the four seconds of the "explosions" video, there is not a single intelligible voice nor any indication that people near the camera are responding in any way to what they see happening. The only approximations of ambient background sounds are so generic, unidentifiable and unintelligible that they are far more likely to have been created synthetically or taken from a sound effects library than to have been recorded at the scene. You could check my observations with the visualization plug-ins available for systems like Digidesign's ProTools or Apple's Logic or Steinberg s Wavelab (Google "correlation meter": or "phasescope"). Electronics geeks who have a dual-trace scope with an X-Y mode can try the phase scope tests. But really all you need are your ears and a decent playback system. Anyone can learn empirically about the ways that sound interacts with the surroundings by paying attention to what they hear in different environments: that s quite enough to detect the tell-tale signs of a hoax like this. 17 th June 2006 Mark De Martini ktesibios@hotmail.com