Evolution of sound reproduction from mechanical solutions to digital techniques optimized for human hearing

Size: px
Start display at page:

Download "Evolution of sound reproduction from mechanical solutions to digital techniques optimized for human hearing"

Transcription

1 Communication acoustics: Paper ICA Evolution of sound reproduction from mechanical solutions to digital techniques optimized for human hearing Ville Pulkki 1 Aalto University, Finland, Ville.Pulkki@aalto.fi Abstract Sound reproduction consists of the processes of recording, processing, storing and recreating sound, typically speech, music, environmental or other sounds. The applications include such fields as public address, telecommunication, music, cinema, virtual reality, and aided hearing. The unifying factor is the common endpoint of the chain, the human listener. Historically, reproduction of sound has come a long way from the first monophonic phonographs. Nowadays audio is available efficiently in various digital formats and immersive 3D spatial sound can be reproduced over different multichannel set-ups or headphones. In general view, the trend in the development of sound reproduction during last decades has been the dedicated design of reproduction systems to match better the resolution of human hearing system. The reproduction methods are designed to reproduce acoustic parameters in time, frequency, and space with only a bit better resolution than the hearing system has. This paper discusses the needs and challenges faced in sound reproduction and it will also present various solutions and technologies. Keywords: audio, sound reproduction, spatial sound

2 Evolution of sound reproduction from mechanical solutions to digital techniques optimized for human hearing 1 Introduction The methods to store audio signals over time have been developed for more than hundred years, starting from mechanical solutions, and ending to masking-based digital audio. The first reproduction methods were monophonic, and the signal was made audible using a single acoustic radiator. The evolution of systems to reproduce also spatial properties of sound has produced a wide variety of different listening set-ups, with different number of loudspeakers in 2D or 3D positioning, or headphone reproduction with or without head tracking. The audio formats for spatial set-ups have been based on delivering single audio signal for each loudspeaker. Only recently techniques to represent audio in generic format that can be listened to with any spatial reproduction technique have emerged. 2 Storing audio signals 2.1 Analog methods The first method to transfer sound over time was the phonograph, which is a device invented in 1877 for the mechanical recording and reproduction of sound. In its later forms it is also called a gramophone. The sound is captured with a horn to a tube, and the waveforms propagating in the tube are recorded as corresponding physical deviations of a spiral groove into the surface of a rotating cylinder or disc. When reproducing the sound, the surface is similarly rotated while a playback stylus traces the groove. The vibrations of the stylus are then in transferred to vibrations of a membrane. The membrane then radiates to a tube having a horn in the open end of it, and the reproduced sound is made audible. In later electric record players or turntables, the motions of the stylus are converted into an analog electrical signal by a transducer called a pickup or cartridge. The signal is further electronically amplified with a power amplifier, then made audible using a loudspeaker. [29] Alternatively to mechanical storing, the magnetic solutions were developed in 1930 s. The most used ones were based on tape recorders. The audio signal is represented in analog form in the strength of magnetization in the magnetic coating on the tape. The playback head of the recorder then reads the signal, and converts it into electric signal. [28] 2.2 Digital methods Analog methods suffered from multiple problems, such as limited signal-to-noise ratio, and various different distortions in signal. The digital methods solve most of the problems in storing audio signals, where the electric audio signal from a microphone is converted to digital (numeric) form using analog-to-digital (A/D) conversion. The most straightforward representation 2

3 audio in Analysis filterbank Quantization & Coding Encoding of bitstream Perceptual model bitstream in Decoding of bitstream Inverse Quantization Synthesis filterbank audio out Figure 1: The block diagram of an audio encoder and decoder based on perceptual masking. Adapted from [3]. is to use PCM coding (Pulse-Code Modulation) in A/D conversion. Each sample is quantized into a binary number where the number of bits implies the precision of the result. Sample values of an analog signal are mapped onto binary numbers so that there are 2 n discrete levels when the number of bits is n. [30] Quantization with finite precision generates an error called quantization noise. Each added bit improves SNR between maximal signal and quantization noise by 6 db for each added bit, so that 16 bits, often used in audio, yields a maximum SNR of about 96 db. Because the dynamic range of the auditory system is about 130 db, even more than 22 bits may be needed. Digital signal processing has many advantages compared to analog techniques. It is predictable, and a single DSP processor may be programmed to compute any DSP programs that does not exceed its processing capacity. However, the amount of data needed to represent audio signals is too high for some applications, and the delay due to coding process may be harmful in communication. 2.3 Masking-based audio coding Considerable development has been conducted in compression of audio signals into lower amounts of data, where the perceptual effects caused by compression are minimized. With audio coding methods such as MPEG-1 Layer-3 (MP3) [10] and MPEG-2 Advanced Audio Coding (AAC) [11, 2], the main means of reducing the bit-rate is to transform the signals into the time-frequency domain and to optimize the quantization of the time-frequency samples using a perceptual masking model, as shown in Fig 1. If quantization noise was equally spread over the entire frequency region, it would be easily audible in those frequency regions where the signal level is low. The spectrum of quantization noise can be shaped so that it follows the masking curve created by the signal, but shifted slightly lower in level. In general, if the level is set to about 13 db lower than that of the signal, quantization noise is no longer audible, although corresponding quantization noise with a flat spectrum is clearly annoying. This effect is known as the "13-dB miracle" from an audio 3

4 demonstration given by J. D. Johnston and K. Brandenburg at AT&T Bell Labs in The audio signals used in the demonstration are described in [4]. 3 Channel-based sound systems 3.1 Monophony Monophonic reproduction was state-of-the-art method for sound reproduction for decades, and for many applications it is still an adequate method. For example, in telephones most commonly the sound captured near the mouth of the talker is monophonically reproduced with a loudspeaker next to the ear. This gives maximal quality in speech intelligibility, which is the key requirement in mobile communication. However, due to coloration effects monophonic reproduction is considered suboptimal in many other applications, such as in music and in cinema. A disadvantage of monophonic reproduction is that the coloration caused by the recording room is exaggerated in the listening when compared to binaural listening of the recording venue [19]. The reason for this is that the reproduced single sound signal is filtered by the room, manifesting as a complex structure in the magnitude frequency response. In recording room sound conditions the sound reaching at the two ears of a listener through the recording room has different magnitude and phase spectra, which results in binaural decoloration. However, in monophonic reproduction the spectrum emanated by the loudspeaker is already filtered by the recording room response, and the binaural decoloration mechanisms can only try to compensate for the listening room acoustics, not the recording room acoustics. This makes the acoustical effect of the recording room overemphasized in listening. When the microphone is at a distance of a few centimeters from the source, the acoustics of the recording room does not have much of an effect on the result, and the timbral quality of the reproduction is good. The disadvantage discussed in the previous paragraph is thus valid only when the captured effect of the room is significant, or in technical terms, the level of the direct sound is comparable to or lower than that of the reverberant field. 3.2 Stereophony The motivation to use more than two loudspeakers in the reproduction is the potentially better spatial quality in a larger listening area. The most common loudspeaker set-up is the twochannel stereophonic set-up. Its use became widespread after the development of the singlegroove 45 /45 two-channel record in the late 1950s. Two loudspeakers are positioned in front of the listener 60 apart. The set-up enables the positioning of virtual sources between the loudspeakers, and it also makes the timbral quality of reproduction better when compared to monophonic reproduction. The reduced coloration issues in stereophonic reproduction can be understood with simple discussion. When recording for stereophonic setup, at least two microphones are used. Since each microphone is located in different position in the recording room, or alternatively is in the same position but has different directivity, the room effect is different in corresponding 4

5 signals. This results in loudspeaker signals, that have different manifestations of the room effect with different phase and magnitude spectra, enabling binaural decoloration effects in hearing. The reproduced sound does thus then have such emphasized room effect as monophonic reproduction has, and the perceived quality is improved a lot. 3.3 Multi-channel setups Different multi-channel loudspeaker set-ups have been specified in the history of multichannel audio [26, 6, 27]. In the 1970s, the quadraphonic set-up was proposed where four loudspeakers are positioned evenly around the listener at azimuth angles ±45 and ±135. This layout was never successful because of problems related to content delivery techniques of that time, and because the layout itself has too few loudspeakers to provide good spatial quality in all directions around the listener [23]. A sound reproduction system was developed for cinema, where the front image stability of the standard stereophonic set-up was enhanced by an extra center channel and two surround channels were added to create atmospheric effects and room perception. This surround sound system for cinemas was first used in 1976 [6], and the ITU made a recommendation for the layout in 1992 [5]. The late 1990s saw some households acquiring also this 5.1 surround system, where the figure before the dot stands for the number of loudspeakers and the figure after the dot is the number of low-frequency channels. In the ITU recommendation, the three frontal loudspeakers are in the directions 0 and ±30, and the two surround channels in directions ±110±10. The system has been criticized for not being able to deliver good directional quality anywhere else other than in the front [23]. So other layouts with 6 12 loudspeakers have been proposed to enhance the directional quality in other directions as well. All the loudspeaker layouts described above have loudspeakers only in the horizontal plane. There are systems in which loudspeakers are placed above and/or below the listener too for use in theaters and virtual environment systems, which enables positioning sound sources above and below the listener as well, thus enhancing the perceived realism especially in situations where the 3-D position of a virtual source is important [24]. Typical examples of such situations are virtual sources for flying vehicles or the sound of raindrops on a roof. Such 3-D set-ups have been proposed for use in domestic listening too and are currently being standardized [12]. For example, Japan Broadcasting Corporation has proposed a 22.2 loudspeaker set-up [8], which has 22 loudspeakers in planes at three heights and two subwoofers, or a common setup has four elevated loudspeakers added to the 7.1 or 5.1 set-up, denoted as or 5.1.4, where the last digit denotes the number of elevated speakers. 3.4 Binaural reproduction The basic binaural recording technique is to reproduce a recorded binaural sound track through headphones. The recording is made by inserting miniature microphones in the ear canals of a real human listener, or by using a manikin with microphones in the ears [31, 1]. Such a recording is reproduced by playing the recorded signals to the ears of the listener. In principle, this is a very simple technique and can provide effective results. A simple implementation 5

6 is to replace the transducers of in-ear headphones with insert miniature microphones, use a portable audio recorder to record the sounds of the surroundings, and play back the sound with headphones. Already without any further processing, a convincing spatial effect is achieved, as the left-right directions of the sound sources and the reverberant sound field are reproduced naturally. Especially, if the person who did the recording is the listener, the effect can be striking. Unfortunately, there are also technical challenges with the technique. The sound may appear colored, the perceived directions move from front to back, and everything may be localized inside head. To partially avoid these problems, the recording and the reproduction should be carefully equalized, because headphone listening typically produces a different magnitude spectrum to the ear drum than natural listening. Careful equalization of headphone listening is, unfortunately, a complicated business, and it requires very careful measurements of the acoustical transmission of sound from the headphone to the ear drum [32]. A further challenge in binaural reproduction is that the auditory system also utilizes dynamic cues to localize sound sources. When listening to a binaural recording with headphones, the movements of the listener do not change the binaural reproduction at all, and the best explanation is that the sources must be inside the listener s head. This is one reason why headphone reproduction easily tends to be localized inside head of the listener [1]. 4 Loudspeaker-setup agnostic formats The drawback of the channel-based spatial audio systems is the requirement of to listen each format with a fixed loudspeaker setup. The consumer has to purchase the audio content specifically for his listening system, and the interchange between different systems is awkward. A generic audio format suitable for whatever standard or arbitrary loudspeaker layout or headphones would be an answer to these needs, and considerable amount of research has been conducted to that direction. 4.1 Generic linear audio The Ambisonics reproduction technique [7] provides a theoretical framework for coincident recording techniques for 2-D and 3-D multichannel loudspeaker set-ups. In theory, Ambisonics is a compact-format, loudspeaker-set-up-agnostic, efficient and comprehensive method for capture, storage and reproduction of spatial sound. All coincident multichannel microphone techniques produce signals that can be transformed to B-format signals. B-format signals have directional patterns that correspond to spherical harmonics. The spherical harmonics can thus be seen as basis functions for the design of arbitrary patterns. The most common microphone device for Ambisonics is the first-order four-capsule B-format microphone, producing signals with directional patterns of spherical harmonics upto the first order. Higher-order microphones with more capsules have also been developed and are commercially available. The higher-order components can then be derived in a specific frequency window. There are some higher-order microphones available, which can extract har- 6

7 monics upto about the 4-6th order in a certain frequency window. The number of good-quality microphone capsules in such devices is relatively high, something like Outside the frequency window the microphones suffer from low-frequency noise and deformation of directional patterns at high frequencies [21, 15]. In principle, first-order Ambisonics could be used for any loudspeaker set-up, but unfortunately it has very limited range of use. The broad first-order directional patterns make the listening area where the desired effect is audible very small, extending to the size of the head of the listener only at frequencies below about 700 Hz [25]. At higher frequencies, the high coherence between the loudspeaker signals leads to undesired effects, such as coloration and loss of spaciousness. The number of loudspeakers in a horizontal set-up should not exceed 2N + 1, where N is the order of the B-format microphone, to avoid high coherence between the loudspeaker signals. Thus, first-order microphones can be used only with three-loudspeaker set-ups, which are far too few to produce the perception of virtual sources between the loudspeakers. This calls for the use of microphone set-ups able to capture signals with higher-order spherical harmonic directional patterns. With higher-order directional signal components, the directional patterns of the loudspeaker signals can be made narrower, which solves these issues, though with known problems with low-frequency noise and high-frequency aliasing. 4.2 Generic non-linear audio formats The problems with the utilization of spherical harmonic signals in reproduction of spatial sound lead to consideration if the human hearing resolution could be used to enhance spatial audio reproduction. The spatial resolution of hearing is limited within the auditory frequency bands [1]. In principle, all sound within one critical band can be only perceived as a single source with broader or narrower extent. The limitations of spatial auditory perception impose the question of whether the spatial accuracy in reproduction of an acoustical wave field can be compromised without a decrease in perceptual quality. When some assumptions on the resolution of human spatial hearing are used to derive reproduction techniques, potentially an enhanced quality of reproduction is obtained [16]. The audio recording and reproduction technology called Directional audio coding (DirAC) [18, 14] was the first non-linear signal-dependent method to record and reproduce spatial audio exploiting the resolution of human hearing. It assumes that the spatial resolution of the auditory system at any one time instant and in one critical band is limited to extracting one cue for direction and another for inter-aural coherence. It further assumes that if the direction and diffuseness of the sound field is measured and reproduced correctly with a suitable time resolution, a human listener will perceive the directional and coherence cues correctly. An example implementation of DirAC is shown in Figure 2. In the analysis, the direction and diffuseness parameters of the sound field is estimated using temporal energy analysis in the auditory frequency bands. The parameters are then used in reproduction. The direction is expressed in azimuth and elevation angles, indicating the most important direction of arrival of sound energy. Diffuseness is a real number between zero and one, that indicates whether a sound field resembles mostly a plane wave or a diffuse field. Virtual microphones are then formed from B-format signals, which are divided into a diffuse stream and a non-diffuse stream 7

8 N frequency channels Directional and diffuseness analysis Direction (azi, ele) Diffuseness (Ψ) Gain factor computation Loud. speaker setup STFT 1 - Ψ Ψ gain factors Sum over frequency channels B-format microphone channels in Virtual cardioid microphones NON-DIFFUSE STREAM Loudspeaker setup information Decorrelation DIFFUSE STREAM Decorrelation Diffusion Loudspeaker signals B-format audio single-channel audio parameter Figure 2: Reproduction of spatial audio with Directional audio coding (DirAC). The system is non-linear and signal-dependent, and it is based on assumptions of human hearing. using the diffuseness parameter. The non-diffuse stream is assumed to contain sound that originates primarily from one source, and in DirAC the corresponding time-frequency part of sound is applied in the direction analyzed for it. The diffuse stream in turn is assumed to contain sound originating from reverberation or from multiple concurrent sources from different directions, which should produce low interaural coherence. The diffuse is thus applied to all directions after some decorrelation process. Both streams thus reduce the coherence between loudspeaker channels, which mitigates the artifacts in corresponding linear systems. According to listening tests, DirAC indeed enhances the reproduction quality when compared to first-order linear decoding with both loudspeakers and headphones [17]. The system performs well if the recorded spatial sound is not in strong violation with the implicit assumption of sound field in DirAC, i.e., at each frequency band only single source is dominant at one time with moderate level of reverberation. In other cases audible distortions, i.e., artifacts, may occur. Typical cases are, e.g., surrounding applause, speech in presence of broadband noise from opposing directions, or strong early reflections within single temporal analysis window. A number of methods to overcome the artifacts have been developed [17], and the utilization of higher-order input in parametric analysis makes the artifacts to vanish in any realistic acoustical condition [20]. 4.3 Object-based audio While the previous two sections discussed the recording of existing spatial sound fields for listening with arbitrary set-ups, there has been also advances in delivery of audio content to consumers in listening set-up agnostic format. In traditional mixing, the outcome of the mixing desk is a set of audio signals, each meant to be played back over a loudspeaker. Recently, several audio formats meant primarily for cinema have been proposed which allow different reproduction set-ups. The audio signals of discrete sources and multi-channel mixed material are sent accompanied by spatial metadata. The metadata then defines how the tracks are 8

9 Figure 3: The delivery of spatial audio using MPEG-H standard [9]. rendered to the loudspeakers. The corresponding methods are called object-based audio techniques [22, 13]. The first movies with sound tracks in such formats were released in For example, the MPEG-H format [9] can be used to deliver a large number of monophonic sound signals (objects) to be positioned to the reproduction setup, which can be either any loudspeaker setup or headphone playback, see Fig. 3. This can be accompanied by transmission of audio in channel-based format, which is then reproduced directly with the loudspeakers, or converted to match the layout in reproduction. Furthermore, higher-order Ambisonics (HOA) signals can also be transmitted with MPEG-H. References [1] J. Blauert. Spatial Hearing Psychophysics of Human Sound Localization. MIT Press, [2] M. Bosi, K. Brandenburg, S. Quackenbush, L. Fielder, K. Akagiri, H. Fuchs, and M. Dietz. ISO/IEC MPEG-2 advanced audio coding. J. Audio Eng. Soc., 45(10): , [3] K. Brandenburg. MP3 and AAC explained. In 17th Int. Conf. Audio Eng. Soc., September [4] K. Brandenburg and T. Sporer. -nmr-and-masking flag-: Evaluation of quality using perceptual criteria. In 11th Int. Audio Eng. Soc. Conf.: test & measurement. AES, [5] I. BS Multichannel stereophonic sound system with and without accompanying picture. Recommendation, International Telecommunication Union, Geneva, Switzerland, [6] M. F. Davis. History of spatial coding. J. Audio Eng. Soc., 51(6):554 69, June [7] M. J. Gerzon. Periphony: With height sound reproduction. J. Audio Eng. Soc., 21(1):2 10, January/February

10 [8] K. Hamasaki, K. Hiyama, and R. Okumura. The 22.2 multichannel sound system and its application. In Audio Eng. Soc. Convention 118. AES, [9] J. Herre, J. Hilpert, A. Kuntz, and J. Plogsties. Mpeg-h audio the new standard for universal spatial/3d audio coding. Journal of the Audio Engineering Society, 62(12): , [10] ISO/IEC. Coding of moving pictures and associated audio for digital storage media at up to about 1,5 mbit/s Part 3: Audio. Standard , [11] ISO/IEC. MPEG-2 advanced audio coding, AAC. Standard JTC1/SC29/WG11 (MPEG), [12] ISO/IEC High efficiency coding and media delivery in heterogeneous environments. Standard, [13] P.-A. S. Lemieux, W. Dressler, and J.-M. Jot. Object-based audio system using vector base amplitude panning, May US Patent App. 13/906,214. [14] J. Merimaa and V. Pulkki. Spatial impulse response rendering. In 7th Intl. Conf. on Digital Audio Effects (DAFX04), [15] S. Moreau, J. Daniel, and S. Bertet. 3d sound field recording with higher order Ambisonics - objective measurements and validation of spherical microphone. In Audio Eng. Soc. Convention 120, May [16] A. Politis and V. Pulkki. Overview to time-frequency-domain parametric spatial audio techniques. In V. Pulkki, S. Delikaris-Manias, and A. Politis, editors, Parametric Time-Frequency-Domain Spatial Audio, chapter 4. Wiley, In press. [17] V. Pulkki, M.-V. L., J. Vilkamo, J. Ahonen, and A. Politis. First-order directional audio coding. In V. Pulkki, S. Delikaris-Manias, and A. Politis, editors, Parametric Time-Frequency-Domain Spatial Audio, chapter 5. Wiley, In press. [18] V. Pulkki. Spatial sound reproduction with directional audio coding. J. Audio Eng. Soc., 55(6):503 16, [19] V. Pulkki and M. Karjalainen. Communication Acoustics: An Introduction to Speech, Audio and Psychoacoustics. John Wiley & Sons, [20] V. Pulkki, A. Politis, G. Del Galdo, and A. Kuntz. Parametric spatial audio reproduction with higher-order B-Format microphone input. In Audio Eng. Soc. Convention 134, May [21] B. Rafaely, B. Weiss, and E. Bachmat. Spatial aliasing in spherical microphone arrays. IEEE Trans. Signal Proc., 55(3): , [22] C. Q. Robinson, S. Mehta, and N. Tsingos. Scalable format and tools to extend the possibilities of cinema audio. SMPTE Motion Imaging Journal, 121(8):63 9, [23] F. Rumsey. Spatial Audio. Taylor & Francis, [24] A. Silzle, S. George, E. Habets, and T. Bachmann. Investigation on the quality of 3D sound reproduction. Proceedings of ICSA, page 334, [25] A. Solvang. Spectral impairment of two-dimensional higher order Ambisonics. J. Audio Eng. Soc., 56(4):267 79, April [26] G. Steinke. Surround sound the new phase. an overview. In Audio Eng. Soc. 100th Convention, Copenhagen, Denmark, [27] E. Torick. Highlights in the history of multichannel sound. J. Audio Eng. Soc., 46(1/2):27 31, [28] Wikipedia. Magnetic tape [29] Wikipedia. Phonograph [30] Wikipedia. Pulse-code modulation [31] A. Wilska. Untersuchungen ueber das richtungshoeren (Studies on Directional Hearing). PhD thesis, Helsinki University, English translation available: publications/wilskathesis/. [32] B. Xie. Head-Related Transfer Function and Virtual Auditory Display, volume 2. J. Ross publishing,

RECORDING AND REPRODUCING CONCERT HALL ACOUSTICS FOR SUBJECTIVE EVALUATION

RECORDING AND REPRODUCING CONCERT HALL ACOUSTICS FOR SUBJECTIVE EVALUATION RECORDING AND REPRODUCING CONCERT HALL ACOUSTICS FOR SUBJECTIVE EVALUATION Reference PACS: 43.55.Mc, 43.55.Gx, 43.38.Md Lokki, Tapio Aalto University School of Science, Dept. of Media Technology P.O.Box

More information

TEPZZ A_T EP A1 (19) (11) EP A1 (12) EUROPEAN PATENT APPLICATION. (51) Int Cl.: H04S 7/00 ( ) H04R 25/00 (2006.

TEPZZ A_T EP A1 (19) (11) EP A1 (12) EUROPEAN PATENT APPLICATION. (51) Int Cl.: H04S 7/00 ( ) H04R 25/00 (2006. (19) TEPZZ 94 98 A_T (11) EP 2 942 982 A1 (12) EUROPEAN PATENT APPLICATION (43) Date of publication: 11.11. Bulletin /46 (1) Int Cl.: H04S 7/00 (06.01) H04R /00 (06.01) (21) Application number: 141838.7

More information

TEPZZ 94 98_A_T EP A1 (19) (11) EP A1 (12) EUROPEAN PATENT APPLICATION. (43) Date of publication: Bulletin 2015/46

TEPZZ 94 98_A_T EP A1 (19) (11) EP A1 (12) EUROPEAN PATENT APPLICATION. (43) Date of publication: Bulletin 2015/46 (19) TEPZZ 94 98_A_T (11) EP 2 942 981 A1 (12) EUROPEAN PATENT APPLICATION (43) Date of publication: 11.11.1 Bulletin 1/46 (1) Int Cl.: H04S 7/00 (06.01) H04R /00 (06.01) (21) Application number: 1418384.0

More information

Concert halls conveyors of musical expressions

Concert halls conveyors of musical expressions Communication Acoustics: Paper ICA216-465 Concert halls conveyors of musical expressions Tapio Lokki (a) (a) Aalto University, Dept. of Computer Science, Finland, tapio.lokki@aalto.fi Abstract: The first

More information

Ch. 1: Audio/Image/Video Fundamentals Multimedia Systems. School of Electrical Engineering and Computer Science Oregon State University

Ch. 1: Audio/Image/Video Fundamentals Multimedia Systems. School of Electrical Engineering and Computer Science Oregon State University Ch. 1: Audio/Image/Video Fundamentals Multimedia Systems Prof. Ben Lee School of Electrical Engineering and Computer Science Oregon State University Outline Computer Representation of Audio Quantization

More information

ATSC Standard: A/342 Part 1, Audio Common Elements

ATSC Standard: A/342 Part 1, Audio Common Elements ATSC Standard: A/342 Part 1, Common Elements Doc. A/342-1:2017 24 January 2017 Advanced Television Systems Committee 1776 K Street, N.W. Washington, DC 20006 202-872-9160 i The Advanced Television Systems

More information

The Cocktail Party Effect. Binaural Masking. The Precedence Effect. Music 175: Time and Space

The Cocktail Party Effect. Binaural Masking. The Precedence Effect. Music 175: Time and Space The Cocktail Party Effect Music 175: Time and Space Tamara Smyth, trsmyth@ucsd.edu Department of Music, University of California, San Diego (UCSD) April 20, 2017 Cocktail Party Effect: ability to follow

More information

METHODS TO ELIMINATE THE BASS CANCELLATION BETWEEN LFE AND MAIN CHANNELS

METHODS TO ELIMINATE THE BASS CANCELLATION BETWEEN LFE AND MAIN CHANNELS METHODS TO ELIMINATE THE BASS CANCELLATION BETWEEN LFE AND MAIN CHANNELS SHINTARO HOSOI 1, MICK M. SAWAGUCHI 2, AND NOBUO KAMEYAMA 3 1 Speaker Engineering Department, Pioneer Corporation, Tokyo, Japan

More information

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur Module 8 VIDEO CODING STANDARDS Lesson 24 MPEG-2 Standards Lesson Objectives At the end of this lesson, the students should be able to: 1. State the basic objectives of MPEG-2 standard. 2. Enlist the profiles

More information

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video Mohamed Hassan, Taha Landolsi, Husameldin Mukhtar, and Tamer Shanableh College of Engineering American

More information

Introduction to Data Conversion and Processing

Introduction to Data Conversion and Processing Introduction to Data Conversion and Processing The proliferation of digital computing and signal processing in electronic systems is often described as "the world is becoming more digital every day." Compared

More information

Measurement of overtone frequencies of a toy piano and perception of its pitch

Measurement of overtone frequencies of a toy piano and perception of its pitch Measurement of overtone frequencies of a toy piano and perception of its pitch PACS: 43.75.Mn ABSTRACT Akira Nishimura Department of Media and Cultural Studies, Tokyo University of Information Sciences,

More information

A few white papers on various. Digital Signal Processing algorithms. used in the DAC501 / DAC502 units

A few white papers on various. Digital Signal Processing algorithms. used in the DAC501 / DAC502 units A few white papers on various Digital Signal Processing algorithms used in the DAC501 / DAC502 units Contents: 1) Parametric Equalizer, page 2 2) Room Equalizer, page 5 3) Crosstalk Cancellation (XTC),

More information

INTERNATIONAL TELECOMMUNICATION UNION GENERAL ASPECTS OF DIGITAL TRANSMISSION SYSTEMS PULSE CODE MODULATION (PCM) OF VOICE FREQUENCIES

INTERNATIONAL TELECOMMUNICATION UNION GENERAL ASPECTS OF DIGITAL TRANSMISSION SYSTEMS PULSE CODE MODULATION (PCM) OF VOICE FREQUENCIES INTERNATIONAL TELECOMMUNICATION UNION ITU-T G TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU GENERAL ASPECTS OF DIGITAL TRANSMISSION SYSTEMS TERMINAL EQUIPMENTS PULSE CODE MODULATION (PCM) OF VOICE FREQUENCIES

More information

1 Introduction to PSQM

1 Introduction to PSQM A Technical White Paper on Sage s PSQM Test Renshou Dai August 7, 2000 1 Introduction to PSQM 1.1 What is PSQM test? PSQM stands for Perceptual Speech Quality Measure. It is an ITU-T P.861 [1] recommended

More information

How to Obtain a Good Stereo Sound Stage in Cars

How to Obtain a Good Stereo Sound Stage in Cars Page 1 How to Obtain a Good Stereo Sound Stage in Cars Author: Lars-Johan Brännmark, Chief Scientist, Dirac Research First Published: November 2017 Latest Update: November 2017 Designing a sound system

More information

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE Copyright SFA - InterNoise 2000 1 inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering 27-30 August 2000, Nice, FRANCE I-INCE Classification: 7.9 THE FUTURE OF SOUND

More information

Information Transmission Chapter 3, image and video

Information Transmission Chapter 3, image and video Information Transmission Chapter 3, image and video FREDRIK TUFVESSON ELECTRICAL AND INFORMATION TECHNOLOGY Images An image is a two-dimensional array of light values. Make it 1D by scanning Smallest element

More information

Natural Radio. News, Comments and Letters About Natural Radio January 2003 Copyright 2003 by Mark S. Karney

Natural Radio. News, Comments and Letters About Natural Radio January 2003 Copyright 2003 by Mark S. Karney Natural Radio News, Comments and Letters About Natural Radio January 2003 Copyright 2003 by Mark S. Karney Recorders for Natural Radio Signals There has been considerable discussion on the VLF_Group of

More information

Digital Representation

Digital Representation Chapter three c0003 Digital Representation CHAPTER OUTLINE Antialiasing...12 Sampling...12 Quantization...13 Binary Values...13 A-D... 14 D-A...15 Bit Reduction...15 Lossless Packing...16 Lower f s and

More information

ELEC 691X/498X Broadcast Signal Transmission Fall 2015

ELEC 691X/498X Broadcast Signal Transmission Fall 2015 ELEC 691X/498X Broadcast Signal Transmission Fall 2015 Instructor: Dr. Reza Soleymani, Office: EV 5.125, Telephone: 848 2424 ext.: 4103. Office Hours: Wednesday, Thursday, 14:00 15:00 Time: Tuesday, 2:45

More information

APPLICATION OF A PHYSIOLOGICAL EAR MODEL TO IRRELEVANCE REDUCTION IN AUDIO CODING

APPLICATION OF A PHYSIOLOGICAL EAR MODEL TO IRRELEVANCE REDUCTION IN AUDIO CODING APPLICATION OF A PHYSIOLOGICAL EAR MODEL TO IRRELEVANCE REDUCTION IN AUDIO CODING FRANK BAUMGARTE Institut für Theoretische Nachrichtentechnik und Informationsverarbeitung Universität Hannover, Hannover,

More information

Evaluation of Auralization Results

Evaluation of Auralization Results Evaluation of Auralization Results Tapio Lokki and Lauri Savioja Helsinki University of Technology, Telecommunications Software and Multimedia Laboratory, P.O.Box 5400, FI-02015 HUT, Finland e-mail: {Tapio.Lokki,Lauri.Savioja}@hut.fi

More information

L. Sound Systems. Record Players

L. Sound Systems. Record Players L. Sound Systems We address three more sound sources in this section. These are the record player, tape deck, and CD player. They represent three levels of improvement in sound reproduction. Faraday's

More information

OVE EDFORS ELECTRICAL AND INFORMATION TECHNOLOGY

OVE EDFORS ELECTRICAL AND INFORMATION TECHNOLOGY Information Transmission Chapter 3, image and video OVE EDFORS ELECTRICAL AND INFORMATION TECHNOLOGY Learning outcomes Understanding raster image formats and what determines quality, video formats and

More information

Impact of scan conversion methods on the performance of scalable. video coding. E. Dubois, N. Baaziz and M. Matta. INRS-Telecommunications

Impact of scan conversion methods on the performance of scalable. video coding. E. Dubois, N. Baaziz and M. Matta. INRS-Telecommunications Impact of scan conversion methods on the performance of scalable video coding E. Dubois, N. Baaziz and M. Matta INRS-Telecommunications 16 Place du Commerce, Verdun, Quebec, Canada H3E 1H6 ABSTRACT The

More information

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK

Professor Laurence S. Dooley. School of Computing and Communications Milton Keynes, UK Professor Laurence S. Dooley School of Computing and Communications Milton Keynes, UK The Song of the Talking Wire 1904 Henry Farny painting Communications It s an analogue world Our world is continuous

More information

Investigation of Digital Signal Processing of High-speed DACs Signals for Settling Time Testing

Investigation of Digital Signal Processing of High-speed DACs Signals for Settling Time Testing Universal Journal of Electrical and Electronic Engineering 4(2): 67-72, 2016 DOI: 10.13189/ujeee.2016.040204 http://www.hrpub.org Investigation of Digital Signal Processing of High-speed DACs Signals for

More information

THE MPEG-H TV AUDIO SYSTEM

THE MPEG-H TV AUDIO SYSTEM This whitepaper was produced in collaboration with Fraunhofer IIS. THE MPEG-H TV AUDIO SYSTEM Use Cases and Workflows MEDIA SOLUTIONS FRAUNHOFER ISS THE MPEG-H TV AUDIO SYSTEM INTRODUCTION This document

More information

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur

Module 8 VIDEO CODING STANDARDS. Version 2 ECE IIT, Kharagpur Module 8 VIDEO CODING STANDARDS Lesson 27 H.264 standard Lesson Objectives At the end of this lesson, the students should be able to: 1. State the broad objectives of the H.264 standard. 2. List the improved

More information

ACTIVE SOUND DESIGN: VACUUM CLEANER

ACTIVE SOUND DESIGN: VACUUM CLEANER ACTIVE SOUND DESIGN: VACUUM CLEANER PACS REFERENCE: 43.50 Qp Bodden, Markus (1); Iglseder, Heinrich (2) (1): Ingenieurbüro Dr. Bodden; (2): STMS Ingenieurbüro (1): Ursulastr. 21; (2): im Fasanenkamp 10

More information

Understanding Compression Technologies for HD and Megapixel Surveillance

Understanding Compression Technologies for HD and Megapixel Surveillance When the security industry began the transition from using VHS tapes to hard disks for video surveillance storage, the question of how to compress and store video became a top consideration for video surveillance

More information

A SIMPLE ACOUSTIC ROOM MODEL FOR VIRTUAL PRODUCTION AUDIO. R. Walker. British Broadcasting Corporation, United Kingdom. ABSTRACT

A SIMPLE ACOUSTIC ROOM MODEL FOR VIRTUAL PRODUCTION AUDIO. R. Walker. British Broadcasting Corporation, United Kingdom. ABSTRACT A SIMPLE ACOUSTIC ROOM MODEL FOR VIRTUAL PRODUCTION AUDIO. R. Walker British Broadcasting Corporation, United Kingdom. ABSTRACT The use of television virtual production is becoming commonplace. This paper

More information

An Overview of Video Coding Algorithms

An Overview of Video Coding Algorithms An Overview of Video Coding Algorithms Prof. Ja-Ling Wu Department of Computer Science and Information Engineering National Taiwan University Video coding can be viewed as image compression with a temporal

More information

CURRICULUM VITAE John Usher

CURRICULUM VITAE John Usher CURRICULUM VITAE John Usher John_Usher-AT-me.com Education: Ph.D. Audio upmixing signal processing and sound quality evaluation. 2006. McGill University, Montreal, Canada. Dean s Honours List Recommendation.

More information

Video coding standards

Video coding standards Video coding standards Video signals represent sequences of images or frames which can be transmitted with a rate from 5 to 60 frames per second (fps), that provides the illusion of motion in the displayed

More information

Time smear at unexpected places in the audio chain and the relation to the audibility of high-resolution recording improvements

Time smear at unexpected places in the audio chain and the relation to the audibility of high-resolution recording improvements Time smear at unexpected places in the audio chain and the relation to the audibility of high-resolution recording improvements Dr. Hans R.E. van Maanen Temporal Coherence Date of issue: 22 March 2009

More information

Psychoacoustics. lecturer:

Psychoacoustics. lecturer: Psychoacoustics lecturer: stephan.werner@tu-ilmenau.de Block Diagram of a Perceptual Audio Encoder loudness critical bands masking: frequency domain time domain binaural cues (overview) Source: Brandenburg,

More information

Using the new psychoacoustic tonality analyses Tonality (Hearing Model) 1

Using the new psychoacoustic tonality analyses Tonality (Hearing Model) 1 02/18 Using the new psychoacoustic tonality analyses 1 As of ArtemiS SUITE 9.2, a very important new fully psychoacoustic approach to the measurement of tonalities is now available., based on the Hearing

More information

Using the BHM binaural head microphone

Using the BHM binaural head microphone 11/17 Using the binaural head microphone Introduction 1 Recording with a binaural head microphone 2 Equalization of a recording 2 Individual equalization curves 5 Using the equalization curves 5 Post-processing

More information

ECE438 - Laboratory 4: Sampling and Reconstruction of Continuous-Time Signals

ECE438 - Laboratory 4: Sampling and Reconstruction of Continuous-Time Signals Purdue University: ECE438 - Digital Signal Processing with Applications 1 ECE438 - Laboratory 4: Sampling and Reconstruction of Continuous-Time Signals October 6, 2010 1 Introduction It is often desired

More information

Audio and Video II. Video signal +Color systems Motion estimation Video compression standards +H.261 +MPEG-1, MPEG-2, MPEG-4, MPEG- 7, and MPEG-21

Audio and Video II. Video signal +Color systems Motion estimation Video compression standards +H.261 +MPEG-1, MPEG-2, MPEG-4, MPEG- 7, and MPEG-21 Audio and Video II Video signal +Color systems Motion estimation Video compression standards +H.261 +MPEG-1, MPEG-2, MPEG-4, MPEG- 7, and MPEG-21 1 Video signal Video camera scans the image by following

More information

Minimax Disappointment Video Broadcasting

Minimax Disappointment Video Broadcasting Minimax Disappointment Video Broadcasting DSP Seminar Spring 2001 Leiming R. Qian and Douglas L. Jones http://www.ifp.uiuc.edu/ lqian Seminar Outline 1. Motivation and Introduction 2. Background Knowledge

More information

Why do some concert halls render music more expressive and impressive than others?

Why do some concert halls render music more expressive and impressive than others? Evaluation of Concert Halls / Opera Houses : ISMRA216-72 Why do some concert halls render music more expressive and impressive than others? Tapio Lokki Aalto University, Finland, Tapio.Lokki@aalto.fi Abstract

More information

Calibrate, Characterize and Emulate Systems Using RFXpress in AWG Series

Calibrate, Characterize and Emulate Systems Using RFXpress in AWG Series Calibrate, Characterize and Emulate Systems Using RFXpress in AWG Series Introduction System designers and device manufacturers so long have been using one set of instruments for creating digitally modulated

More information

Largeness and shape of sound images captured by sketch-drawing experiments: Effects of bandwidth and center frequency of broadband noise

Largeness and shape of sound images captured by sketch-drawing experiments: Effects of bandwidth and center frequency of broadband noise PAPER #2017 The Acoustical Society of Japan Largeness and shape of sound images captured by sketch-drawing experiments: Effects of bandwidth and center frequency of broadband noise Makoto Otani 1;, Kouhei

More information

Multichannel Audio Technologies

Multichannel Audio Technologies Multichannel Audio Technologies Dr. Gavin Kearney gpkearney@ee.tcd.ie http://www.mee.tcd.ie/~gkearney/mcat Room 23, Top Floor, Printing House What is multichannel audio? 1. A way of expanding and enriching

More information

Theater Sound MAE 5083

Theater Sound MAE 5083 Theater Sound MAE 5083 Charles O Neill November 20, 2001 Contents 1 Introduction 1 2 Sound Recording 1 2.1 MechanicalSchemes... 1 2.2 OpticalSchemes... 1 2.3 MagneticSchemes... 2 2.3.1 TapeNoise... 2 3

More information

Research Article. ISSN (Print) *Corresponding author Shireen Fathima

Research Article. ISSN (Print) *Corresponding author Shireen Fathima Scholars Journal of Engineering and Technology (SJET) Sch. J. Eng. Tech., 2014; 2(4C):613-620 Scholars Academic and Scientific Publisher (An International Publisher for Academic and Scientific Resources)

More information

Chapter 1. Introduction to Digital Signal Processing

Chapter 1. Introduction to Digital Signal Processing Chapter 1 Introduction to Digital Signal Processing 1. Introduction Signal processing is a discipline concerned with the acquisition, representation, manipulation, and transformation of signals required

More information

Final draft ETSI EG V1.1.1 ( )

Final draft ETSI EG V1.1.1 ( ) Final draft EG 202 396-1 V1.1.1 (2005-09) Guide Speech Processing, Transmission and Quality Aspects (STQ); Speech quality performance in the presence of background noise; Part 1: Background noise simulation

More information

Colour Reproduction Performance of JPEG and JPEG2000 Codecs

Colour Reproduction Performance of JPEG and JPEG2000 Codecs Colour Reproduction Performance of JPEG and JPEG000 Codecs A. Punchihewa, D. G. Bailey, and R. M. Hodgson Institute of Information Sciences & Technology, Massey University, Palmerston North, New Zealand

More information

RECOMMENDATION ITU-R BT Studio encoding parameters of digital television for standard 4:3 and wide-screen 16:9 aspect ratios

RECOMMENDATION ITU-R BT Studio encoding parameters of digital television for standard 4:3 and wide-screen 16:9 aspect ratios ec. ITU- T.61-6 1 COMMNATION ITU- T.61-6 Studio encoding parameters of digital television for standard 4:3 and wide-screen 16:9 aspect ratios (Question ITU- 1/6) (1982-1986-199-1992-1994-1995-27) Scope

More information

RECOMMENDATION ITU-R BT (Questions ITU-R 25/11, ITU-R 60/11 and ITU-R 61/11)

RECOMMENDATION ITU-R BT (Questions ITU-R 25/11, ITU-R 60/11 and ITU-R 61/11) Rec. ITU-R BT.61-4 1 SECTION 11B: DIGITAL TELEVISION RECOMMENDATION ITU-R BT.61-4 Rec. ITU-R BT.61-4 ENCODING PARAMETERS OF DIGITAL TELEVISION FOR STUDIOS (Questions ITU-R 25/11, ITU-R 6/11 and ITU-R 61/11)

More information

EarStudio: Analog volume control. The importance of the analog volume control

EarStudio: Analog volume control. The importance of the analog volume control EarStudio: Analog volume control The importance of the analog volume control RADSONE - 8 June 2017 In every digital audio system, DAC is an essential component which converts digital PCM sample to the

More information

DELTA MODULATION AND DPCM CODING OF COLOR SIGNALS

DELTA MODULATION AND DPCM CODING OF COLOR SIGNALS DELTA MODULATION AND DPCM CODING OF COLOR SIGNALS Item Type text; Proceedings Authors Habibi, A. Publisher International Foundation for Telemetering Journal International Telemetering Conference Proceedings

More information

1 Overview of MPEG-2 multi-view profile (MVP)

1 Overview of MPEG-2 multi-view profile (MVP) Rep. ITU-R T.2017 1 REPORT ITU-R T.2017 STEREOSCOPIC TELEVISION MPEG-2 MULTI-VIEW PROFILE Rep. ITU-R T.2017 (1998) 1 Overview of MPEG-2 multi-view profile () The extension of the MPEG-2 video standard

More information

A prototype system for rule-based expressive modifications of audio recordings

A prototype system for rule-based expressive modifications of audio recordings International Symposium on Performance Science ISBN 0-00-000000-0 / 000-0-00-000000-0 The Author 2007, Published by the AEC All rights reserved A prototype system for rule-based expressive modifications

More information

DIGITAL COMMUNICATION

DIGITAL COMMUNICATION 10EC61 DIGITAL COMMUNICATION UNIT 3 OUTLINE Waveform coding techniques (continued), DPCM, DM, applications. Base-Band Shaping for Data Transmission Discrete PAM signals, power spectra of discrete PAM signals.

More information

Keywords Separation of sound, percussive instruments, non-percussive instruments, flexible audio source separation toolbox

Keywords Separation of sound, percussive instruments, non-percussive instruments, flexible audio source separation toolbox Volume 4, Issue 4, April 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Investigation

More information

Multirate Digital Signal Processing

Multirate Digital Signal Processing Multirate Digital Signal Processing Contents 1) What is multirate DSP? 2) Downsampling and Decimation 3) Upsampling and Interpolation 4) FIR filters 5) IIR filters a) Direct form filter b) Cascaded form

More information

Music Source Separation

Music Source Separation Music Source Separation Hao-Wei Tseng Electrical and Engineering System University of Michigan Ann Arbor, Michigan Email: blakesen@umich.edu Abstract In popular music, a cover version or cover song, or

More information

HEVC/H.265 CODEC SYSTEM AND TRANSMISSION EXPERIMENTS AIMED AT 8K BROADCASTING

HEVC/H.265 CODEC SYSTEM AND TRANSMISSION EXPERIMENTS AIMED AT 8K BROADCASTING HEVC/H.265 CODEC SYSTEM AND TRANSMISSION EXPERIMENTS AIMED AT 8K BROADCASTING Y. Sugito 1, K. Iguchi 1, A. Ichigaya 1, K. Chida 1, S. Sakaida 1, H. Sakate 2, Y. Matsuda 2, Y. Kawahata 2 and N. Motoyama

More information

AUDIOVISUAL COMMUNICATION

AUDIOVISUAL COMMUNICATION AUDIOVISUAL COMMUNICATION Laboratory Session: Recommendation ITU-T H.261 Fernando Pereira The objective of this lab session about Recommendation ITU-T H.261 is to get the students familiar with many aspects

More information

Multichannel source directivity recording in an anechoic chamber and in a studio

Multichannel source directivity recording in an anechoic chamber and in a studio Multichannel source directivity recording in an anechoic chamber and in a studio Roland Jacques, Bernhard Albrecht, Hans-Peter Schade Dept. of Audiovisual Technology, Faculty of Electrical Engineering

More information

OBJECT-AUDIO CAPTURE SYSTEM FOR SPORTS BROADCAST

OBJECT-AUDIO CAPTURE SYSTEM FOR SPORTS BROADCAST OBJECT-AUDIO CAPTURE SYSTEM FOR SPORTS BROADCAST Dr.-Ing. Renato S. Pellegrini Dr.- Ing. Alexander Krüger Véronique Larcher Ph. D. ABSTRACT Sennheiser AMBEO, Switzerland Object-audio workflows for traditional

More information

Linear Time Invariant (LTI) Systems

Linear Time Invariant (LTI) Systems Linear Time Invariant (LTI) Systems Superposition Sound waves add in the air without interacting. Multiple paths in a room from source sum at your ear, only changing change phase and magnitude of particular

More information

Hidden melody in music playing motion: Music recording using optical motion tracking system

Hidden melody in music playing motion: Music recording using optical motion tracking system PROCEEDINGS of the 22 nd International Congress on Acoustics General Musical Acoustics: Paper ICA2016-692 Hidden melody in music playing motion: Music recording using optical motion tracking system Min-Ho

More information

360 degrees video and audio recording and broadcasting employing a parabolic mirror camera and a spherical 32-capsules microphone array

360 degrees video and audio recording and broadcasting employing a parabolic mirror camera and a spherical 32-capsules microphone array 36 degrees video and audio recording and broadcasting employing a parabolic mirror camera and a spherical 32-capsules microphone array Leonardo Scopece 1, Angelo Farina 2, Andrea Capra 2 1 RAI CRIT, Turin,

More information

Implementation of an MPEG Codec on the Tilera TM 64 Processor

Implementation of an MPEG Codec on the Tilera TM 64 Processor 1 Implementation of an MPEG Codec on the Tilera TM 64 Processor Whitney Flohr Supervisor: Mark Franklin, Ed Richter Department of Electrical and Systems Engineering Washington University in St. Louis Fall

More information

Field Tests for Immersive and Interactive Broadcast Audio Production using MPEG-H 3D Audio

Field Tests for Immersive and Interactive Broadcast Audio Production using MPEG-H 3D Audio Field Tests for Immersive and Interactive Broadcast Audio Production using MPEG-H 3D Audio Christian Simon Yannik Grewe Nicolas Faecks Ulli Scuda Cite this article: Simon, Christian, Grewe Yannik, Faecks,

More information

Digital Video Engineering Professional Certification Competencies

Digital Video Engineering Professional Certification Competencies Digital Video Engineering Professional Certification Competencies I. Engineering Management and Professionalism A. Demonstrate effective problem solving techniques B. Describe processes for ensuring realistic

More information

FLEXIBLE SWITCHING AND EDITING OF MPEG-2 VIDEO BITSTREAMS

FLEXIBLE SWITCHING AND EDITING OF MPEG-2 VIDEO BITSTREAMS ABSTRACT FLEXIBLE SWITCHING AND EDITING OF MPEG-2 VIDEO BITSTREAMS P J Brightwell, S J Dancer (BBC) and M J Knee (Snell & Wilcox Limited) This paper proposes and compares solutions for switching and editing

More information

Binaural Measurement, Analysis and Playback

Binaural Measurement, Analysis and Playback 11/17 Introduction 1 Locating sound sources 1 Direction-dependent and direction-independent changes of the sound field 2 Recordings with an artificial head measurement system 3 Equalization of an artificial

More information

Chapter 2 Introduction to

Chapter 2 Introduction to Chapter 2 Introduction to H.264/AVC H.264/AVC [1] is the newest video coding standard of the ITU-T Video Coding Experts Group (VCEG) and the ISO/IEC Moving Picture Experts Group (MPEG). The main improvements

More information

Data Representation. signals can vary continuously across an infinite range of values e.g., frequencies on an old-fashioned radio with a dial

Data Representation. signals can vary continuously across an infinite range of values e.g., frequencies on an old-fashioned radio with a dial Data Representation 1 Analog vs. Digital there are two ways data can be stored electronically 1. analog signals represent data in a way that is analogous to real life signals can vary continuously across

More information

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng

The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng The Research of Controlling Loudness in the Timbre Subjective Perception Experiment of Sheng S. Zhu, P. Ji, W. Kuang and J. Yang Institute of Acoustics, CAS, O.21, Bei-Si-huan-Xi Road, 100190 Beijing,

More information

Error Resilient Video Coding Using Unequally Protected Key Pictures

Error Resilient Video Coding Using Unequally Protected Key Pictures Error Resilient Video Coding Using Unequally Protected Key Pictures Ye-Kui Wang 1, Miska M. Hannuksela 2, and Moncef Gabbouj 3 1 Nokia Mobile Software, Tampere, Finland 2 Nokia Research Center, Tampere,

More information

Processing. Electrical Engineering, Department. IIT Kanpur. NPTEL Online - IIT Kanpur

Processing. Electrical Engineering, Department. IIT Kanpur. NPTEL Online - IIT Kanpur NPTEL Online - IIT Kanpur Course Name Department Instructor : Digital Video Signal Processing Electrical Engineering, : IIT Kanpur : Prof. Sumana Gupta file:///d /...e%20(ganesh%20rana)/my%20course_ganesh%20rana/prof.%20sumana%20gupta/final%20dvsp/lecture1/main.htm[12/31/2015

More information

The Implementation Of Ambisonics For Restoring Quadraphonic Recordings

The Implementation Of Ambisonics For Restoring Quadraphonic Recordings The Implementation Of Ambisonics For Restoring Quadraphonic Recordings By: Aristotel Digenis BA (Hons) Recording Arts 2001-2002 SAE Technology College Sydney & Middlesex University London Table Of Contents

More information

The interaction between room and musical instruments studied by multi-channel auralization

The interaction between room and musical instruments studied by multi-channel auralization The interaction between room and musical instruments studied by multi-channel auralization Jens Holger Rindel 1, Felipe Otondo 2 1) Oersted-DTU, Building 352, Technical University of Denmark, DK-28 Kgs.

More information

BeoVision Televisions

BeoVision Televisions BeoVision Televisions Technical Sound Guide Bang & Olufsen A/S January 4, 2017 Please note that not all BeoVision models are equipped with all features and functions mentioned in this guide. Contents 1

More information

Multimedia Communications. Video compression

Multimedia Communications. Video compression Multimedia Communications Video compression Video compression Of all the different sources of data, video produces the largest amount of data There are some differences in our perception with regard to

More information

Motion Video Compression

Motion Video Compression 7 Motion Video Compression 7.1 Motion video Motion video contains massive amounts of redundant information. This is because each image has redundant information and also because there are very few changes

More information

Calibration of auralisation presentations through loudspeakers

Calibration of auralisation presentations through loudspeakers Calibration of auralisation presentations through loudspeakers Jens Holger Rindel, Claus Lynge Christensen Odeon A/S, Scion-DTU, DK-2800 Kgs. Lyngby, Denmark. jhr@odeon.dk Abstract The correct level of

More information

Digital Audio and Video Fidelity. Ken Wacks, Ph.D.

Digital Audio and Video Fidelity. Ken Wacks, Ph.D. Digital Audio and Video Fidelity Ken Wacks, Ph.D. www.kenwacks.com Communicating through the noise For most of history, communications was based on face-to-face talking or written messages sent by courier

More information

Exhibits. Open House. NHK STRL Open House Entrance. Smart Production. Open House 2018 Exhibits

Exhibits. Open House. NHK STRL Open House Entrance. Smart Production. Open House 2018 Exhibits 2018 Exhibits NHK STRL 2018 Exhibits Entrance E1 NHK STRL3-Year R&D Plan (FY 2018-2020) The NHK STRL 3-Year R&D Plan for creating new broadcasting technologies and services with goals for 2020, and beyond

More information

Vocoder Reference Test TELECOMMUNICATIONS INDUSTRY ASSOCIATION

Vocoder Reference Test TELECOMMUNICATIONS INDUSTRY ASSOCIATION TIA/EIA STANDARD ANSI/TIA/EIA-102.BABC-1999 Approved: March 16, 1999 TIA/EIA-102.BABC Project 25 Vocoder Reference Test TIA/EIA-102.BABC (Upgrade and Revision of TIA/EIA/IS-102.BABC) APRIL 1999 TELECOMMUNICATIONS

More information

Implementation of MPEG-2 Trick Modes

Implementation of MPEG-2 Trick Modes Implementation of MPEG-2 Trick Modes Matthew Leditschke and Andrew Johnson Multimedia Services Section Telstra Research Laboratories ABSTRACT: If video on demand services delivered over a broadband network

More information

Advance Certificate Course In Audio Mixing & Mastering.

Advance Certificate Course In Audio Mixing & Mastering. Advance Certificate Course In Audio Mixing & Mastering. CODE: SIA-ACMM16 For Whom: Budding Composers/ Music Producers. Assistant Engineers / Producers Working Engineers. Anyone, who has done the basic

More information

Detection and demodulation of non-cooperative burst signal Feng Yue 1, Wu Guangzhi 1, Tao Min 1

Detection and demodulation of non-cooperative burst signal Feng Yue 1, Wu Guangzhi 1, Tao Min 1 International Conference on Applied Science and Engineering Innovation (ASEI 2015) Detection and demodulation of non-cooperative burst signal Feng Yue 1, Wu Guangzhi 1, Tao Min 1 1 China Satellite Maritime

More information

Dual frame motion compensation for a rate switching network

Dual frame motion compensation for a rate switching network Dual frame motion compensation for a rate switching network Vijay Chellappa, Pamela C. Cosman and Geoffrey M. Voelker Dept. of Electrical and Computer Engineering, Dept. of Computer Science and Engineering

More information

How to use the DC Live/Forensics Dynamic Spectral Subtraction (DSS ) Filter

How to use the DC Live/Forensics Dynamic Spectral Subtraction (DSS ) Filter How to use the DC Live/Forensics Dynamic Spectral Subtraction (DSS ) Filter Overview The new DSS feature in the DC Live/Forensics software is a unique and powerful tool capable of recovering speech from

More information

Audio-Based Video Editing with Two-Channel Microphone

Audio-Based Video Editing with Two-Channel Microphone Audio-Based Video Editing with Two-Channel Microphone Tetsuya Takiguchi Organization of Advanced Science and Technology Kobe University, Japan takigu@kobe-u.ac.jp Yasuo Ariki Organization of Advanced Science

More information

h t t p : / / w w w. v i d e o e s s e n t i a l s. c o m E - M a i l : j o e k a n a t t. n e t DVE D-Theater Q & A

h t t p : / / w w w. v i d e o e s s e n t i a l s. c o m E - M a i l : j o e k a n a t t. n e t DVE D-Theater Q & A J O E K A N E P R O D U C T I O N S W e b : h t t p : / / w w w. v i d e o e s s e n t i a l s. c o m E - M a i l : j o e k a n e @ a t t. n e t DVE D-Theater Q & A 15 June 2003 Will the D-Theater tapes

More information

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP- 59 Measurement and Management of Loudness in Soundtracks for Television Broadcasting

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP- 59 Measurement and Management of Loudness in Soundtracks for Television Broadcasting Page 1 of 10 1. SCOPE This Operational Practice is recommended by Free TV Australia and refers to the measurement of audio loudness as distinct from audio level. It sets out guidelines for measuring and

More information

Engineering in Recording

Engineering in Recording Engineering in Recording Jez Wells Audio Lab - Department of Electronics University of York, Heslington, York, UK ISEE, University of Sheffield, July 2012 1 Overview Is Recording Engineering? Some definitions

More information

RECOMMENDATION ITU-R BT.1201 * Extremely high resolution imagery

RECOMMENDATION ITU-R BT.1201 * Extremely high resolution imagery Rec. ITU-R BT.1201 1 RECOMMENDATION ITU-R BT.1201 * Extremely high resolution imagery (Question ITU-R 226/11) (1995) The ITU Radiocommunication Assembly, considering a) that extremely high resolution imagery

More information

THE PSYCHOACOUSTICS OF MULTICHANNEL AUDIO. J. ROBERT STUART Meridian Audio Ltd Stonehill, Huntingdon, PE18 6ED England

THE PSYCHOACOUSTICS OF MULTICHANNEL AUDIO. J. ROBERT STUART Meridian Audio Ltd Stonehill, Huntingdon, PE18 6ED England THE PSYCHOACOUSTICS OF MULTICHANNEL AUDIO J. ROBERT STUART Meridian Audio Ltd Stonehill, Huntingdon, PE18 6ED England ABSTRACT This is a tutorial paper giving an introduction to the perception of multichannel

More information

decodes it along with the normal intensity signal, to determine how to modulate the three colour beams.

decodes it along with the normal intensity signal, to determine how to modulate the three colour beams. Television Television as we know it today has hardly changed much since the 1950 s. Of course there have been improvements in stereo sound and closed captioning and better receivers for example but compared

More information