A Real-time Framework for Video Time and Pitch Scale Modification

Size: px
Start display at page:

Download "A Real-time Framework for Video Time and Pitch Scale Modification"

Transcription

1 Dblin Institte of Technology Conference papers Adio Research Grop A Real-time Framework for Video Time and Pitch Scale Modification Ivan Damnjanovic Qeen Mary University London Dan Barry Dblin Institte of Technology, dan.barry@dit.ie David Dorran Dblin Institte of Technology Josh Reiss Qeen Mary University London Follow this and additional works at: Part of the Signal Processing Commons Recommended Citation Damnjanovic, I. et al. (2008) A Real-Time Framework for Video Time and Pitch Scale Modification. Proc. of the 11th International. Conference on Digital Adio Effects (DAFx-08), Espoo, Finland, September 1-4, This Conference Paper is broght to yo for free and open access by the Adio Research Grop at ARROW@DIT. It has been accepted for inclsion in Conference papers by an athorized administrator of ARROW@DIT. For more information, please contact yvonne.desmond@dit.ie, arrow.admin@dit.ie, brian.widdis@dit.ie. This work is licensed nder a Creative Commons Attribtion- Noncommercial-Share Alike 3.0 License

2 MM R2 1 A Real-time Framework for Video Time and Pitch Scale Modification Ivan Damnjanovic, Dan Barry, David Dorran, and Josha D. Reiss Abstract- A framework is presented which addresses the isses related to the real-time implementation of synchronised video and adio time-scale and pitch-scale modification algorithms. It allows for seamless real-time transition between continally varying, independent time-scale and pitch-scale parameters arising as a reslt of manal or atomatic intervention. We illminate the problems which arise in a real-time context as well as provide novel soltions to prevent artefacts, minimise latency, and improve synchronisation. The time and pitch scaling approach is based on a modified phase vocoder with optional phase locking and an integrated transient detector which enables high qality transient preservation in real-time. A novel method for adio/visal synchronisation was implemented in order to ensre no perceptible latency between adio and video while real-time time scaling and pitch shifting is applied. Evalation reslts are reported which demonstrate both high adio qality and minimal synchronisation error. Index Terms Time scale modification, Adio/visal synchronisation, adaptive video refresh rate I. INTRODUCTION Synchronised adio and video time stretching is often sed in video editing and prodction whenever video content needs to be sped p or slowed down either as a creative effect or to fit certain time slots within a programme schedle, as is the case in television advertisements. Time-scale modification (TSM) is typically sed to change the tempo of msical content or the playback rate of speech withot affecting pitch content. Conversely, pitch-scale modification (PSM) algorithms enable pitch shifting withot affecting the playback rate of the adio content. A significant amont of research has been dedicated to both TSM and PSM yielding a variety of time and freqency domain algorithms. Despite this abndance of literatre and readily available Copyright (c) 2008 IEEE. Personal se of this material is permitted. However, permission to se this material for any other prposes mst be obtained from the IEEE by sending a reqest to pbs-permissions@ieee.org Manscript received Jne 11, This work was spported in part by the Eropean Commnity nder the Information Society Technologies (IST) programme of the 6th FP for RTD - project EASAIER contract IST I. Damnjanovic is with Qeen Mary U. of London, London, E14NS, UK (telephone: , ivan.damnjanovic@elec.qml.ac.k). Dan Barry. Athor is with the Adio Research Grop in the Dblin Institte of Technology, Kevin Street, Dblin 8, Ireland (telephone: , dan.barry@dit.ie). David Dorran. is with the Adio Research Grop in the Dblin Institte of Technology, Kevin Street, Dblin 8, Ireland (telephone: , e- mail: david.dorran@dit.ie). Josh Reiss is with Qeen Mary University of London, London, E14NS, UK (telephone: , josh.reiss@elec.qml.ac.k). commercial applications, there is still a lack of information, nderstanding and consideration for real-time implementations of TSM and PSM algorithms. Here we illminate some of the problems which arise in a real-time context as well as provide novel soltions to these isses. A real-time software based framework is presented, which allows time stretching of adio content within digital video streams whilst maintaining synchronisation with the video content. Time-scale changes can be made in real-time with almost nperceivable latency and no transitional artefacts. In addition, the approach also spports real-time pitch shifting of the adio content independent of time-scale changes. The approach is based on a modified phase vocoder with optional phase locking and an integrated transient detector which enables high qality transient preservation in real-time. Within this article, emphasis is given to adio/visal synchronisation isses which arise in sch a framework. Despite the growth in algorithms for independent adio time or pitch modification, there are relatively few applications which address combined time stretching of video and adio. In [1], a method for adjsting video playback rate to compensate for network delay is presented. Similarly, [2] presents an adaptive method for video playback, intended to address isses concerning packet loss and random delays in streaming applications. Their method ses adio time scaling when the streamed video playback speed is modified, as sggested for packet loss in voice commnication [3]. Synchronised adio and video time scaling is typically sed in video editing and prodction whenever video content needs to be sped p or slowed down either as a creative effect or to fit certain time slots within a programme schedle. For example, TSM can be sed to alter the dration of an advertisement whilst preserving the pitch and timbre of speech and other adio content. Experiments have shown that increasing the information rate in commercials is more engaging and more favorable to viewers. In [4], it was sggested that an increase in the rate of information of p to 130 percent of the typical speech rate can significantly increase the impact of advertisements. The driving force for the work presented here on real-time synchronised adio/video time-stretching comes from ser reqirements and ser feedback in msic edcation research [5, 6], which indicated that time-scaled video wold be desirable in applications related to aral learning, msic transcription and msical techniqe analysis. The effects of adio/video time-compression and expansion on the learning

3 MM R2 2 process have been thoroghly stdied [4-8]. Besides time efficiency benefits, it was shown that learning from accelerated material can be at least eqally as effective as the normal speed of presentation. There were frther findings that stdents watching accelerated material stay more focsed. At normal speech rates they become bored and their attention begins to wander [7], and learning processes benefit from acceleration of presentation as long as intelligibility can be maintained [8]. For entertainment applications, internet video streaming, digital video players and set-top devices can benefit greatly from an adio/video time stretching tool. Stdies of digital video browsing [9] noted that one of the highest rated enhanced featres was watching time compressed video. II. AUDIO TIME-SCALE MODIFICATION Time-scale modification can be achieved in a nmber of ways in both the time and freqency domain. However, time domain approaches are typically not considered ideally sited to mixed adio content, which may inclde speech, polyphonic msic and ambient noise. As sch, the real-time time-scale modification techniqe proposed here is based on a set of modifications to the phase vocoder [10], a poplar freqency domain approach to time-scaling. A comprehensive ttorial otlining the theory of the traditional phase vocoder is presented in [11] and a brief description is provided here. The Forier transform interpretation of the phase vocoder is mathematically eqivalent to a short time Forier transform (STFT) [12] which segments the analysed signal into overlapping frames which are separated by a certain hop size. Within phase vocoder implementations, TSM is achieved by varying the analysis hop size R a with respect to the resynthesis hopsize R s sch that the time scaling factor is calclated as α=r s /R a. It follows then that R a >R s will reslt in timescale compression (speed p), and R a <R s will reslt in timescale expansion (slow down). Within the phase vocoder, analysis frames are remapped along the time axis reslting in newly constrcted synthesis frames, each with a modified phase spectrm, to ensre that the synthesis frames maintain phase coherence throgh time. Since the phase spectrm of each frame mst be modified, the windowing fnction will also be affected. For this reason, a resynthesis window is necessary and a 75% overlap is recommended to avoid modlation at the otpt. This will reslt in the otpt having a constant gain factor of approximately 1.5 which can easily be compensated by mltiplying all samples by the reciprocal of the gain factor. An overlap of 75% corresponds to a fixed synthesis hop size, R s, of N/4 samples. In order for the synthesis frames to overlap synchronosly, the frame phases mst be pdated sch that phase continity is maintained between adjacent otpt frames. The standard method sed to calclate sitable synthesis phases involves calclation of the instantaneos freqency of each bin in radians per sample. Having obtained the instantaneos freqency, it is possible to predict the expected phase of any component for a given synthesis hop size. Given that the freqency content of both msic and speech is stationary only over short periods, phase estimates will decrease in accracy as the hop sizes increase. The most accrate way to estimate phase for each component is by first calclating the principal argment of the heterodyned phase increment between adjacent analysis frames as defined in [10, 11]. The instantaneos freqency is then calclated in radians per sample. In order to calclate the phase spectrm for the new synthesis frame at the time scaled otpt, the instantaneos freqency is mltiplied by the synthesis hop size R s, and added to the resltant synthesis phases from the previos frame. This is known as phase propagation or phase pdating. The newly modified phases along with the original magnitde spectrm are then sed to reconstrct the adio frame. Althogh, the time scaled otpt is horizontally phase coherent at this point, the timbral qality is often described as sonding phasey or distant and is generally not regarded as natral sonding. Particlarly noticeable is how transients are affected by the phase vocoder. These artifacts can be attribted to the fact that the standard phase vocoder only attempts to achieve an optimal phase relationship between adjacent frames, known as horizontal phase coherence. However, the prsit of horizontal phase coherence has a profondly negative effect on vertical phase coherence, which describes the relationship between the phases of freqency components within a single frame. Maintaining vertical phase coherence is an important consideration in order to achieve natral sonding TSM. The improved phase vocoder [13] explicitly attempts to identify sinsoidal freqency bins in FFT frames by a peak picking process within the magnitde spectrm. The phases of these trly sinsoidal peak freqency bins are then pdated in the traditional manner, i.e., by maintaining horizontal phase coherence between corresponding peak freqency bins of sccessive frames. The non-sinsoidal freqency bins are then pdated by maintaining the phase difference that existed between each bin and its closest peak/sinsoidal freqency bin. The process is known as peak locking. III. REAL-TIME CONSIDERATIONS FOR DYNAMIC TIME- SCALING When a fixed time-scale factor is applied to an entire adio signal both R a and R s remain fixed. In which case, the position in time of any analysis or synthesis frames can be defined as t a =R a and t s =R s, respectively, where is an incrementing integer representing a seqence of frames as in [10]. For realtime implementations, where the time-scale factor, α, may be varying dynamically de to ser intervention, this definition will introdce distortions into the time-scaled otpt since the analysis hop is no longer fixed. The soltion is to redefine 1 ta = Ra as ta = ta + α Rs. This ensres that the crrent analysis frame position t a, is always pdated correctly. The position in time of the crrent analysis frame is always related to both the previos analysis frame and the crrent time scaling factor, α. Althogh it is favorable to vary the analysis hop R a and fix the synthesis hop R s to achieve TSM, it can reslt in inaccrate

4 MM R2 3 freqency estimation for time-scaling factors α < 1. When the signal is being sped p, the distance between analysis frames exceeds N/4. It becomes impossible to accrately predict the amont of phase nwrapping to be applied dring the freqency estimation stage of the horizontal phase pdate procedre described in [10, 11], reslting in inaccrate synthesis phase estimates. In addition to this, when α is varied over time, the accracy of the instantaneos freqency estimates also varies. This leads to momentary artefacts whenever the time scale factor, α, is changed. Effectively, the transitions between frames with different TSM factors are not perceptally smooth despite the windowed overlapping scheme. The soltion to both of these problems is to ensre that the instantaneos freqency estimates are always derived sing the phase differences between the crrent analysis frame and a frame one synthesis hop back from the position of the crrent analysis frame, X ( ta Rs, Ω k). Althogh, an extra FFT and an extra bffer is reqired to obtain the phases of this frame, it garantees that phase nwrapping errors will not be present and that the instantaneos freqency estimates will be consistent regardless of variation in α. The phase pdate eqation [10, 11] is now redefined in (1). 1 Yt ( s, Ω k) = Yt ( s, Ω k) + Xt ( a, Ωk) Xt ( a Rs, Ω k) (1) When vertical phase coherence is to be maintained, peak locking can be sed, and only the sinsoidal or peak freqency bins are pdated sing (1), with all other bins pdated as in [10, 11]. This method of phase pdating removes the need to estimate the instantaneos freqency. However, for the case where pitch scale modification is reqired, calclation of instantaneos freqency is still necessary. Nonetheless, the hop-back method described above is sed to avoid phase nwrapping errors and to maintain smooth pitch and time scale transitions. This will be discssed in the next section. A similar phase pdate procedre was proposed in [14] in which time-scale modification is achieved throgh the insertion and deletion of entire frames. Since the approach we propose here ses a variable analysis hop size, it has the advantage of maintaining better estimates of the magnitde spectrm, thereby greatly redcing the possibility of removing or repeating perceptally salient characteristics within the time-scaled signal. IV. REAL-TIME PITCH SHIFTING The simplest method to shift the apparent pitch of a signal is by interpolating or decimating the time domain signal. The reslting signal, althogh pitch shifted, is also shortened or lengthened by the reciprocal of the interpolation/decimation factor β. A common techniqe sed to shift the pitch and maintain dration is to pitch scale the signal sing interpolation/decimation, and apply complimentary time scale modification to restore the original length of the signal. This is easily achieved in the offline context bt becomes difficlt to implement in a real-time context. If both pitch shifting and time scaling are reqired simltaneosly, the problem becomes more difficlt since time scaling is reqired for 2 alternate operations (pitch and time scaling) within the same frame. When the signal is both time scaled and interpolated for any time scaling factor α and pitch shifting factor β, the reqired compensatory time scale factor sch that the resltant signal is both the reqired pitch and length [15], is simply αβ. In a real-time context the pitch and time scaling mst be carried ot within a single frame interval (in this implementation 23ms). Two isses arise. First, the comptational reqirements are directly related to the prodct of α and β, since each frame mst now be time-scaled internally to compensate for pitch shifting. This makes realtime operation nfeasible for large prodcts αβ. Second, the length of the resltant frame is no longer fixed. An additional bffer mst be sed in order to handle the overflow if the resltant frame exceeds N (analysis frame size) samples. If αβ<1, the resltant frame will be smaller than the reqired N samples. In this case, more inpt frames need to be processed ntil there are sfficient samples to generate an otpt frame. These isses can make the otpt npredictable, added to which the soltions are comptationally intensive. Here we present a novel method for real-time pitch shifting which resolves the problematic isses raised above. The comptational reqirements are not dependent on α and β and the method garantees that a fixed frame length can be generated independent of the time and pitch scale factors sed. No inter-frame time scaling and no additional bffers are reqired. The pitch shifting is performed sing linear resampling in the time domain, and phase vocoder theory is then applied sing a modified phase pdate eqation which incorporates the pitch scaling factor β. In order to generate a pitch shifted frame of known length, we interpolate or decimate the inpt time domain signal over the range t to a ta + Nβ, where N is a fixed analysis frame size chosen to ensre adeqate freqency resoltion. This reslts in a time domain frame of length N which has been generated by interpolating or decimating Nβ samples by the pitch scaling factor β. Figre 1 illstrates this procedre. This frame now constittes an analysis frame which can have arbitrary time scaling applied sing the phase pdate eqations presented below. Figre 1. The real-time re-sampling method sed for obtaining fixed length pitch shifted frames. A illstrates no pitch change, B pitching down and C pitching p.

5 MM R2 4 The goal is to estimate the phase propagation reqired to allow sccessive interpolated frames to be pdated sch that the pitch shifted and time scaled otpt is horizontally phase coherent. Recall (1), which was introdced as a preferred method to ensre reliably wrapped phase difference estimates. This was achieved by sing an extra FFT to estimate the phases of the frame exactly one synthesis hop back from the crrent analysis frame, thereby allowing the phase differences to be estimated over a known fixed interval eqal to R s. The apparent analysis hop is now eqal to the synthesis hop, bt the actal vale of R a is still variable. In order to estimate sitable synthesis phases for pitch shifted frames, the instantaneos freqency mst be calclated as follows. A new method to calclate the heterodyned phase increment for pitch shifted frames is given by (2), where the interpolation factor, β, is now inclded in the eqation. ΔΦ p = Xt (, Ωk) Xt ( Rs, Ωk) RsΩ k/ β (2) k a a k where X( t a, Ω ) and X( ta Rs, Ωk) represent the phases of the crrent analysis frame and an analysis frame exactly one synthesis hop back from the crrent vale of t a. The reslting term, ΔΦ p k, is then the principle argment of the heterodyned phase increment of the pitch shifted frame sch that it is in the range -π to π. Since the frames have been interpolated or decimated (reslting in freqency shifts) they will no longer exhibit the expected phase derivatives over a given hop, R s. To calclate the correct phase increment, the hop mst also be mltiplied by the reciprocal of the pitch scaling factor, β. The instantaneos freqency in radians per sample of the pitch shifted frame is given by (3). ˆ ( ωk ta) =Ω k+ βδφ p k / Rs (3) As opposed to the standard method [10, 11], we divide the phase deviation by R s instead of R a, becase the method sed to calclate phase difference in (3) ses two frames separated by a fixed distance, R s. The standard phase pdate eqation [10, 11] can now be sed, and peak locking can be applied as discssed previosly. The advantages of sing (1) for phase pdating have already been incorporated in (2) above. We now have modified phase vocoder eqations which allow realtime pitch shifting and time stretching simltaneosly. A key advantage of sing this method for pitch shifting is that compensatory time scaling is not reqired. Instead, the pitch scaling factor is incorporated in the phase pdate eqations. This garantees that the comptational load remains fixed and predictable for any combination of time and pitch scaling factors. V. REAL-TIME TRANSIENT PRESERVATION Althogh peak locking contribtes to maintaining the timbral qality of transients dring TSM, transients shold not be time-scaled if a natrally sonding otpt is reqired. An off-line soltion was proposed in [16]. The approach taken here is to identify transients atomatically in real-time. Upon detection of a transient, the time scale factor α is retrned to 1 (no scaling), and the analysis phases are mapped directly to the synthesis phases (phase locking) for the dration of the transient. When the transient has passed, the time scale factor is atomatically reset to the α vale prior to the transient. Transients represent an ideal place to lock the phases since any discontinities introdced to the time scaled signal will be masked by the transient itself. In order to identify an analysis frame as a transient [17], the log difference of each freqency component between consective frames is calclated as in (4). This measre effectively tells s how rapidly the spectrogram is flctating. X( ta, k) X ( t, k) = 20log, 1 k N /2 (4) ( R, k) f a 10 X ta s where X f ( ta, k ) is the log energy difference between frames separated by R s, and t a is the crrent analysis frame instant. In order to detect the presence of a transient we define a measre given in (5). N /2 Pt ( a, k) = 1 if X f ( ta, k) > T 1 Pe( ta ) = k = 1 Pt ( a, k) = 0 otherwise (5) where, T 1 is a threshold which signifies the rise in energy, measred in db, which mst be detected within a freqency channel before it is deemed to belong to an onset. In order for the frame to be declared a transient, Pe( t a ) mst exceed a second threshold T 2. In practice we have fond that T 1 =6dB and T 2 =3N/8 give satisfactory reslts for most poplar msic. Ths, a transient is detected at frame t a, if at least 75% of the bins in the log difference spectrogram, eqation (4), exceed a vale of 6dB. Note that sing this measre, the energy present in the signal is not the defining factor of the transient. Instead, we assign the transient probability, Pe( t a ), sing a measre of how broadband or percssive the onset is [17]. This is based on the nmber of bins exhibiting a positive first derivative as described by eqation (5). Figre 2 shows the effectiveness of this approach. Despite the fact that the signal itself has little dynamic range, the featre detector is rarely prone to false detections which makes it ideal for transient detection in time scaling. Frthermore, it can easily be implemented within the crrent framework since the only reqirement is that the crrent and previos frame magnitdes are available. Figre 2. A highly dynamically compressed signal containing rock msic is depicted in the top plot. The bottom plot shows the otpt of the percssive onset detector. Upon detecting a transient, the time scale factor, α, is atomatically retrned to 1, inhibiting TSM momentarily.

6 MM R2 5 We term this method transient hopping. In addition the frame phases are locked and the frame is mapped directly to the otpt. This mechanism preserves the transient and ensres that it is reprodced naffected at the otpt. Since we se 75% overlap, R s = 1024 for analysis frame length 4096, a short transient will exist in 4 consective frames. In order to preserve the transient correctly, the TSM factor, α, mst remain at a vale of 1 ntil all overlapping frames have passed the transient. Since the local time scale factor is redced, a time scale compensation factor is applied after each transient. Eq. (6) describes this action: T 1 if < 4 T αm α T α = if 4 < NF + 4 (6) m α α otherwise T where α is the global time scale factor, α is the TSM factor to be applied dring the frames preceding the transient, and where m is the maximm desired TSM factor and m mst be strictly greater than 1. The nmber of frames, N F, in which the time scale compensation factor mst be applied after the transient, is dependent on the maximm timescale factor, sch that NF = 4m -4. Using a larger nmber of frames to compensate for the transient has the advantage that smaller TSM factors may be distribted over a longer time period, ths redcing signal distortion de to excessive timescale factors. Figre 3 illstrates how the time scaling factor is varied before and after the transient in order to both preserve the transient and to maintain a constant global time scale factor. concern. For reasons discssed in previos sections, a 75% overlap is recommended. This effectively means that at any one time instant, 4 analysis frames are actively contribting to the crrent otpt frame. Figre 4. The relationship between inpt and otpt frames for α=1. In Figre 4, the adio to be processed is divided into overlapping frames of length N. In order to otpt a processed frame, 4 fll frames wold need to be processed and overlapped. This leads to considerable latency from the time a parameter change is affected to the time when its effects are adible at the otpt. However, given that the synthesis hop size is fixed at R s =N/4, we can load and process a single frame of length N, otpt ¼ of the frame, and retain the rest in a bffer to overlap with adio in sccessive otpt frames. To do this, a bffer of length N is reqired in which the crrent processed frame (with synthesis window applied) is placed. Three additional bffers of length 3N/4, N/2 and N/4 will also be reqired to store remaining segments from the 3 previosly processed frames. Each otpt frame of length N/4 is then generated by smming samples from each of these 4 bffers. Figre 5 shows how the bffer scheme works. On each iteration, a fll frame, F, of length N is processed and placed in bffer 1. The remaining samples from the 3 previos frames occpy bffers 2, 3 and 4. The reqired otpt frame of length N/4, S, is generated as defined in (7). Figre 3. Time scale factor as a fnction of transient detection VI. BUFFER SCHEMES One of the key isses in a real-time implementation of TSM is the choice of bffer scheme and for completeness sake we sggest a sitable scheme here. In offline processing, the entire signal is overlapped and concatenated before playback. However, in a real-time environment, a constant stream of processed adio mst be otptted and consective otpt frames mst be continos. In order for seamless concatenation, the bondaries of each otpt frame mst be at the constant gain associated with the overlap factor in order to avoid modlation. The method presented below addresses this Figre 5. Real-time otpt bffer scheme sing a 75% overlap. The gray arrows indicate how each segment of each bffer is shifted after the otpt frame has been generated S ( n) = F ( n) + F ( n+ N /4) + F ( n+ N /2) + F ( n+ 3 N /4) (7) n 1 n N /4 From (7), it can be seen that the otpt frame, S (n), is generated by smming the first N/4 samples form each bffer. Once the otpt frame has been generated and otptted, the first N/4 samples in each bffer can be discarded. The data in all bffers mst now be shifted in order to prepare for the next iteration. The gray arrows in Figre 5 illstrate how each segment of each bffer is shifted in order to accommodate a newly processed frame in the next iteration. The order in which the bffers are shifted is vital. Bffer 4 is filled with the remaining N/4 samples from bffer 3, bffer 3 is then filled

7 MM R2 6 with the remaining N/2 samples from bffer 2, and finally bffer 2 is filled with the remaining 3N/4 samples from bffer 1. Bffer 1 is now empty and ready to receive the next processed frame of length N. The reslt of this scheme, is that ¼ of a processed frame will be otptted at time intervals of R s, which is eqal to N/4 samples. Using the sggested frame size of 4096 samples, the otpt will be pdated every 1024 samples which is approximately eqal to 23.2 milliseconds. The adio will be processed with newly pdated parameters every 23.2 milliseconds, bt the latency will be larger than this and depends on the time reqired to access and write to hardware bffers in the adio interface. In general however, it is possible to achieve latencies < 40ms. VII. SYNCHRONISATION WITH THE HOST APPLICATION The reqirement to synchronise independent time and pitch scaling with video and screen pdates adds additional complexity. To maintain mltimedia synchronisation, the time scaling process shold control the master clock within an application. In this section, we present a real-time media synchronisation framework which has made this possible Previos sections have described in detail the adio processing blocks reqired to achieve real-time time and pitch scaling simltaneosly. Figre 6 shows how the overall system is configred. Figre 6. Overview of clocking between time/pitch scale modification and host application. Firstly, it is important to note that, in order to allow time scale modification to be carried ot in real-time whilst maintaining synchronisation with other media sch as video or screen pdates, e.g., the adio locators, it is necessary to pass fll control of the host clock to the time scaling algorithm. This is becase time scaling by its very natre involves maniplation of the time base of the adio. As described previosly, the time increment between frames is prely dependent on the choice of time scale factor. Frthermore, if we wish to continosly vary the time-scale factor, the time line becomes non linear at transition points. Essentially, the time scale algorithm mst be able to reqest any adio frame, starting at any sample point within the adio stream. With this in mind, the first stage involves loading an adio frame defined by the time scale algorithm itself. Immediately following this, the first stage of pitch shifting is achieved by interpolating or decimating the inpt waveform by the pitch scaling factor. Regardless of time or pitch scale factor, one fll frame is always poplated on every iteration. For example, sing a pitch scale factor of 2, 2N samples will be interpolated to prodce an N sample frame where N is the frame size. If the frame is identified as a transient, no frther processing is applied, and time scaling is sspended for 4 frames (de to 75% overlap). The frames arond a transient are reprodced at the otpt identical to that of the inpt and the adio clock is pdated as normal. If no transient has been detected, the phases are pdated according to the modified phase pdate eqations. Pitch shifting is only completed at this stage since the phase pdate procedre needs to inclde the interpolation factor. Following this, the processed adio frame is reprodced and re-windowed. The adio clock is then pdated, and the frame incremented by a varying factor depending on the ser inpt (i.e., TSM factor). In order to prodce a continos stream of adio, the bffer scheme described above is sed. Regardless of what processing is carried ot by the timescaling algorithm, it is solely responsible for pdating the host clock. The host then ses this information to pdate screen components which depend on adio playback position. Ths, all screen components, processes and visalisations are synchronised with the adio clock which is controlled by the time-scale modification algorithm. VIII. VIDEO SYNCHRONISATION Combined adio/visal artefacts that can be introdced de to loss of synchronisation are often the most perceptally ndesirable. Failre to keep adio and video streams synchronised, known as lip sync errors, reslt in adio events occrring before or after the associated video frames. When adio advances video by 20ms or when adio lags video by 40ms, it becomes detectible. Errors of +40ms and - 160ms are sbjectively annoying as reported by the International Telecommnications Union (ITU) in 1993[18]. Frther research reported in ITU-R BT [19], showed reliable detection of 45ms adio leading and 125ms adio lagging, while the acceptability region is even wider. The ITU recommends that the difference between adio and video shold be no less than -90ms and no more then +185ms. In reality, this range is probably too wide for acceptable performance. For example, in video footage of msical instrments being played, key strokes or string plcks are more precise than lip movement dring speech, so the synchronisation thresholds need to be redced. In addition, when a video has been stretched it can be easier to analyse and therefore synchronisation errors become more perceivable. In this section, three approaches to the preservation of adio/video synchronisation in time scaling applications will be presented. Insertion and deletion of frames is necessary when the frame rate is dictated by the playback device. Television standards sch as PAL/SECAM and NTSC se

8 MM R2 7 standardised refresh rates and hence the otpt of a time stretching modle mst maintain a corresponding frame rate. However, many software implementations of video players, inclding mplayer, VLC player and others, allow for change of the video rate once the compressed video is npacked. Screen refresh rate of modern eqipment is in the range of Hz, so variations in the frame rate can be introdced by choosing when a particlar video frame will be shown on the display device. Hence, less noticeable artefacts and smoother pictre transition can be obtained when variable frame rate, the second method, is applied. The third method, Adaptive Video Refresh Rate (AVRR), relies on the precision of the adio clock. Synchronisation is maintained by ensring that the video time code remains locked to the adio time code within an allowable threshold. Video time stretching for conventional broadcast ses insertion and deletion of frames to maintain synchronisation. When speeding p the video, some frames need to be dropped, whilst when slowing down some need to be dplicated. When frames are dplicated or dropped, maximal synchronisation error is half of a video frame length, since we rond to the closest frame. Hence, if the frame rate is 25 fps, maximal error will be 20 ms. This error range (-20ms to +20ms) meets ITU recommendations for lip sync error to be ndetectable. However it may not be good enogh for more demanding applications sch as time stretching of video, when precise movements are slowed and become easier to analyse. In addition, frame dplication can case jerkiness to be perceived in the video of slow steady movements. Changing the video frame rate by the scaling factor will generally give a smoother image since frames are eqally spaced in time. The additional advantage is that no frames are dropped when speeding p. Ideally, timing for a new frame is easy to calclate by advancing the previos frame time by the new frame rate interval. However, de to the fact that timing precision is inflenced by factors sch as temperatre and hmidity, simply setting-p the next frame to display a given period after the previos frame withot comparing it to a master clock can case long term synchronisation errors. The AVRR method refreshes the display with a new frame when the video time code is eqal to (or within a threshold of) the original time code of the adio frame being otptted. The refresh rate is adaptive since the period between two frames adapts to the adio clock. Ideally, it shold be eqal to the reciprocal of the scaled frame rate, bt will oscillate arond that vale. We define here two time-lines; one is the media player s actal time-line and the other is the original media time-line. It is crcial for this method to calclate precisely the time on the media time-line of the adio sample crrently being played. This time vale is then compared with the original time code associated with non-time scaled video frames and the display is refreshed with this frame when the video frame time code is smaller than or eqal to the time of the adio sample that is crrently being otptted. To minimise loss of synchronisation de to comptationally intensive processing, the decoding algorithm needs to be efficient and implemented in a separate high priority thread. Figre 7. Video time scaling implementation.

9 MM R2 8 The video-synchronised time stretching algorithm described above was implemented as presented in Figre 7, and intended for a demanding application reqiring fast access to adio frames while other intensive processing tasks are performed. Here, the adio stream is first ncompressed and stored locally in an adio inpt bffer. Unlike adio however, ncompressed video wold reqire an nacceptably large local bffer, so video packets are accessed directly from the compressed stream. Since video decoding is done on-line, particlar consideration was given to its implementation. Higher time compression rates will demand that video frames be decoded and scaled mch faster than sal. Hence, the video decoding is carried ot together with video zooming in a separate high priority thread. The video decoding thread receives two control inpts from the ser interface. Video zoom factors, changeable from the interface, are sent directly to the video scaler, which scales a frame according to a zoom factor and sends it to the video display bffer. Change of playback position is sent to the decoder and it instrcts the decoder to seek the stream and also to erase any previosly decoded frames from video display bffer. The time-stretching factor is sent to the adio processing engine in order to change the analysis hop size, and the adio otpt frame timestamp is calclated accordingly. However, this timestamp is not sfficient for proper A/V synchronisation, since it represents the time when the adio frame is sent to the adio hardware bffer. For example, if an adio frame is 1024 samples and the sample rate is Hz, the time resoltion will be 23.2 ms. For the normal playback speed, this may be sfficient, bt in the case of dobling the playback speed the time span between two adio sample points on the media timeline becomes 46.4 ms. Hence, some measre of fllness of the adio hardware bffer needs to be introdced for precise timing of otptted adio samples. The fllness of the hardware adio bffer is hardware dependent and measring it is often a complex task, so we propose to find approximate timing of the adio sample by measring the time difference (Δt) between the moment the adio frame is sent to the hardware bffer and the crrent time. This vale is then added to the timestamp of the adio frame that was sent to the adio bffer (T adio ), and is then compared with the video frame timestamp (T video ). The display is refreshed with this frame when the video frame time code is smaller than or eqal to the calclated adio time: Tvideo Tadio +Δ t (8) Another isse is timer precision for measring Δt. In Windows OS, the maximal precision that can be achieved with the standard timer is 15ms, which is hardly enogh for a synchronisation application. Hence, Δt is determined by measring CPU conts from the moment the frame is sent to the hardware bffer and then dividing by the CPU cont freqency. Since Δt gives a vale related to the real playback time-line, it is transposed to the media time line by dividing it by the time-stretching factor α: 1 CNTcp Δ t = (9) α fcp However, both variable frame rate and adaptive video refresh rate have the potential disadvantage that at higher time scale factors, since more frames are displayed per second, frames need to be decoded mch faster. Synchronisation can be lost if a frame is not decoded within a frame interval, so a preferred soltion is to combine AVRR with frame dropping when loss of synchronisation occrs. In or implementation, whenever the video lag exceeds 20 ms, the application instrcts the decoder not to decode the following frame, and retrns to fll decoding when the lag retrns to nder 10 ms. IX. AUDIO QUALITY EVALUATION Since the focs of this research is concerned with the realtime implementation of a synchronised video/adio and mltimedia time and pitch scale modification algorithm, the evalation of the adio time-scale algorithm presented here is not intended to be comprehensive. Instead, to ensre that this real-time implementation has not reslted in a compromise to the adio qality of the algorithm, a series of sbjective listening tests were carried ot in order to ensre that the TSM algorithm is as least as good as that described in [13]. The transient detection has not been sed in these comparison tests since [13] does not employ transient detection. In total, 10 sbjects ndertook a series of 20 tests 1 each, totalling 200 individal tests. The tests sed inclded slowing and speeding of adio as well as pitch shifting in both directions by a range of factors. Both time and pitch scale factors ranged from 0.75 to 1.5. A range of signals inclding solo and ensemble msic from a range of genres and male and female speech segments sampled at 16 bit, 44.1 khz comprise the test site. Each listener was presented with an nprocessed reference signal and two alternative processed signals. The same processing parameters and frame sizes are sed in each algorithm. The order in which the algorithms are presented was randomised. Figre 8. Sbjective listening test reslts for 10 sbjects. Along the horizontal axis, 1 indicates a predominant preference for real-time TSM whereas 5 indicates predominant preference for the improved phase vocoder [13]. 1

10 MM R2 9 The reslts are presented in Figre 8, where reslts for each sbject are given from 1 to 5, where 1 indicates predominant preference for real-time TSM, 3 indicates no preference, and 5 indicates predominant preference for the improved phase vocoder. The sbjective listening tests indicate that the overall trend is sch that the algorithms are perceived to perform eqally well. The average vale over all 200 tests was 2.985, very close to no preference, with a relatively low standard deviation of Sbjects who were predisposed to distinctly choosing 1 algorithm over the other tended to choose each algorithm a similar nmber of times indicating eqivalence of the algorithms. Many sbjects reported that the algorithms sonded very similar bt felt compelled to make explicit decisions regardless. The data is skewed slightly in favor of the real-time TSM algorithm, bt it is likely that a greater nmber of test sbjects wold introdce greater balance in the data. Some differences between the algorithms which may accont for this inclde the fact that the real-time TSM algorithm does not perform peak locking above 10 KHz de to the fact that peak locking is intended to maintain the phase relationship between the peak and lobes of sinsoidal components. Significant acostic energy above 10 KHz is often stochastic and attribted to transients, noise and ambience. Peak locking above 10 KHz forces non-sinsoidal components into a state of nnatral phase coherence which can sond objectionable to sbjects with acte hearing in the pper freqency range. Theoretically, the pitch shifting qality in [13] shold otperform that of the real-time algorithm bt sbjective tests have shown that the differences are largely imperceptible for moderate time scaling factors (in the region of 0.75 to 1.5) althogh the real-time algorithm can become noticeably more objectionable when opposing time and pitch scale factors are sed simltaneosly (i.e. slow down and pitch p simltaneosly). This is de to the efficient pitch shifting techniqe sed to achieve frame synchronos pitch shifting. X. A/V SYNCHRONISATION EVALUATION To measre the qality of the A/V synchronisation algorithm, we compared it with integration of or time-stretching into the FFmpeg (v0.4, ffmpeg.org/ffplay-doc.html) platform and with the MPlayer implementation (v1.0rc2, in LinxOS. FFplay is a well known efficient open sorce application for video encoding, and MPlayer is a robst, open sorce video player based on ffmpeg libraries. One of the many featres of MPlayer is the possibility to change playback speed, bt withot independent pitch-shifting. Nevertheless, this featre, robst implementation and the possibility to extract A/V synchronisation information make MPlayer sefl for evalation and comparison with or algorithm. For A/V synchronisation, FFplay ses dplicating and dropping video frames whereas MPlayer ses a variable frame rate. We compared video players on the Casino Royale trailer seqence coded in MPEG1 format with video frame dimension 640x352 at frames per second and an adio sample rate of Hz. The video frame lag with respect to adio is presented for 100 video frames from the middle of the seqence in the case of playing the video at half of the original speed (Figre 9) and with doble the original speed (Figre 10). It can be seen that or adaptive video refresh rate algorithm clearly otperforms the other two, becase of the precise matching of the video timestamp to the adio clock. The video lag of the AVRR time-stretching algorithm is also well below the ITU lip sync error recommendation with maximal video lag being 14 ms and maximal video advance being 13 ms in the case of dobled playback speed. Moreover, the standard deviation of video lag is ms, showing stability of this soltion. Video Lag (ms) AVRR Mplayer FFplay modified Frame Nmber Figre 9. Comparison of video lag for three video player implementations when playback speed is half of original. Video Lag (ms) AVRR Mplayer FFplay modified Frame Nmber Figre 10. Video lag when playback speed is dobled. Average Video Lag (ms) Time Scaling Factor AVRR FFplay Mplayer Figre 11. Average video lag as a fnction of time scaling factor. Figre 11 depicts the average video lag as a fnction of the 4.0

11 MM R2 10 time scaling factor for the three video synchronisation techniqes. FFplay was modified to ensre that it wold not decode dropped frames, otherwise its performance wold be significantly worse. However, it still shows notable degradation in performance as the time scaling factor increases beyond 2 and video frame decoding becomes significantly slower than the time to process a time scaled adio frame. MPlayer maintains sitable performance as time scale increases, thogh it does not adapt the variable refresh rate to the precise adio time codes. The AVRR method maintains strong synchronisation over the entire range of time scaling factors. Only at time scaling factors beyond 3.5 does the AVRR occasionally lose synchronisation, and opts not to decode a frame. XI. CONCLUSIONS A framework for real-time independent video time scaling and pitch shifting was presented. Carefl consideration was given to the problems which arise in a real-time context and novel soltions to these isses have been provided. It was shown how time-scale changes can be achieved in real-time with almost imperceptible latency and no transitional artefacts. The approach is based on a modified phase vocoder with optional phase locking and an integrated transient detector which enables high qality transient preservation in real-time. The framework presented is the basis for the developments of applications which allow for a seamless real-time transition between continally varying, independent video time-scale and pitch-scale parameters. A novel soltion for adio/visal synchronisation called adaptive video refresh rate has also been developed. De to the fact that synchronisation errors in the foreseen applications will be easier to detect, special focs was given to minimizing video lags and advances, reslting in an algorithm that significantly otperforms existing algorithms. [9] F. C. Li, A. Gpta, E. Sanocki, L. He, and Y. Ri, "Browsing digital video," presented at ACM CHI, Hage, Netherlands, [10] J. L. Flanagan, D. I. S. Meinhart, R. M. Golden, and M. M. Sondhi, "Phase vocoder," The Jornal of the Acostical Society of America, vol. 38, pp. 939, [11] M. Dolson, "The phase vocoder: A ttorial " Compter Msic Jornal, vol. 10, pp , [12] M. Portnoff, "Implementation of the digital phase vocoder sing the fast Forier transform," IEEE Transactions on Acostics, Speech, and Signal Processing, vol. 24, pp , [13] J. Laroche and M. Dolson, "Improved phase vocoder timescale modification of adio," IEEE Trans. Speech and Adio Processing, vol. 7, pp , [14] J. Bonada, "Atomatic techniqe in freqency domain for near-lossless time-scale modification of adio," presented at International Compter Msic Conference, pp , Berlin, Germany, [15] J. Laroche, "Atocorrelation method for high qality time/pitch scaling," presented at IEEE WASPAA, pp , Mohonk, NY, [16] C. Dxbry, M. Davies, and M. Sandler, "Improved time-scaling of msical adio sing phase locking at transients," presented at 112th AES Convention, pp. 1-5, Mnich, Germany, May 10-13, [17] D. Barry, D. FitzGerald, and E. Coyle, "Drm Sorce Separation sing Percssive Featre Detection and Spectral Modlation," presented at IEE Irish Signals and Systems Conference, pp , Dblin, Ireland, [18] International Telecommnication Union Docment 11A/47-E, 13 October [19] "Relative Timing of Sond and Vision for Broadcasting. Recommendation," International Telecommnication Union ITU-R BT , REFERENCES [1] M. C. Yang, S. T. Liang, and Y. G. Chen, "Dynamic video playot smoothing method for mltimedia applications," Mltimedia Tools and Applications, vol. 6, pp , [2] M. Kalman, E. Steinbach, and B. Girod, "Adaptive Media Playot for Low Delay Video Streaming over Error-Prone Channels," IEEE Transactions on Circits and Systems for Video Technology, vol. 14, pp , [3] Y. J. Liang, N. Färber, and B. Girod, "Adaptive playot schedling sing time-scale modification in packet voice commnication," presented at International Conference on Acostics, Speech, and Signal Processing (ICASSP), pp , Salt Lake City, [4] P. LaBarbera and J. MacLachlan, "Time-Compressed Speech in Radio Advertising," Jornal of Marketing, vol. 43, pp , [5] C. Landone, J. Harrop, and J. D. Reiss, "Enabling Access to Sond Archives throgh Integration, Enrichment and Retrieval: The EASAIER Project," presented at 8th International Conference on Msic Information Retrieval (ISMIR), Vienna, [6] C. Dffy, "A case stdy of networked sond resorces for edcation in traditional msic: the HOTBED project," presented at Integration of Msic in Mltimedia Applications, Barcelona, Spain, [7] J. S. Olson, "A Stdy of the relative effectiveness of verbal and visal agmentation of rate-modified speech in the presentation of technical material," in Annal Convention of the Association for Edcational Commnications and Technology (AECT). Anaheim, CA, [8] K. Harrigan, "The SPECIAL system: Searching time-compressed digital video lectres," Jornal of Research on Compting in Edcation, vol. 33, pp , 2000.

Analog Signal Input. ! Note: B.1 Analog Connections. Programming for Analog Channels

Analog Signal Input. ! Note: B.1 Analog Connections. Programming for Analog Channels B Analog Signal Inpt B.1 Analog Connections Refer to the diagram (page B-10) showing the VAN analog boards for connection of analog inpts. Be sre yo follow the indicated positive and negative polarity

More information

Speech Recognition Combining MFCCs and Image Features

Speech Recognition Combining MFCCs and Image Features Speech Recognition Combining MFCCs and Image Featres S. Karlos from Department of Mathematics N. Fazakis from Department of Electrical and Compter Engineering K. Karanikola from Department of Mathematics

More information

A Parallel Multilevel-Huffman Decompression Scheme for IP Cores with Multiple Scan Chains

A Parallel Multilevel-Huffman Decompression Scheme for IP Cores with Multiple Scan Chains A Parallel Mltilevel-Hffman Decompression Scheme for IP Cores with Mltiple Scan Chains X Kavosianos, E Kalligeros 2 and D Nikolos 2 Compter Science Dept, University of Ioannina, 45 Ioannina, Greece 2 Compter

More information

c:: Frequency response characteristics for sinusoidal movement in the fovea and periphery* ==> 0.' SOO O.S 2.0

c:: Frequency response characteristics for sinusoidal movement in the fovea and periphery* ==> 0.' SOO O.S 2.0 Freqency response characteristics for sinsoidal movement in the fovea and periphery* C. WILLIAM TYLER and JEAN TORRES Northeastern University, Boston, Massachsetts 211 Threshold sensitivity was measred

More information

MINIMED 640G SYSTEM^ Getting Started. WITH THE MiniMed 640G INSULIN PUMP

MINIMED 640G SYSTEM^ Getting Started. WITH THE MiniMed 640G INSULIN PUMP MINIMED 640G SYSTEM^ Getting Started WITH THE MiniMed 640G INSULIN PUMP let s get started! Table of Contents Section 1: Getting Started... 3 Getting Started with the MiniMed 640G Inslin Pmp...3 1.1 Pmp

More information

Easy Estimation of Spectral Purity of Test Signals for ADC Testing. David Slepička

Easy Estimation of Spectral Purity of Test Signals for ADC Testing. David Slepička Sep. -4, 008, lorence, Italy Easy Estimation of Spectral Prity of Test Signals for ADC Testing David Slepička Czech Technical University in Prage, aclty of Electrical Engineering, Dept. of Measrement Technická,

More information

A Buyers Guide to Laser Projection

A Buyers Guide to Laser Projection The Eropean Digital Cinema Form A Byers Gide to Laser Projection AUTUMN 2018 Table of Contents Slides 2-5 Introdctory notes Slides 6-22 1: Technical Considerations Slides 23-31 2. Financial and lifetime

More information

With Ease. BETTY WAGNER Associate Trinity College London, Associate Music Australia READING LEDGER LINE NOTES

With Ease. BETTY WAGNER Associate Trinity College London, Associate Music Australia READING LEDGER LINE NOTES READING LEDGER LINE NOTES With Ease f G f o o BETTY WAGNER Associate Trinity College London, Associate Msic Astralia READING LEDGER LINE NOTES A Nova WITH EASE Book Company Page Pblication http://www.msic-with-ease.com

More information

E-Vision Laser 4K Series High Brightness Digital Video Projector

E-Vision Laser 4K Series High Brightness Digital Video Projector E-Vision Laser 4K Series High Brightness Digital Video Projector 4INSTALLATION AND QUICK-START GUIDE 4CONNECTION GUIDE 4OPERATING GUIDE 4REFERENCE GUIDE 118-157A Abot This Docment Follow the instrctions

More information

Using Device-Specific Data Acquisition for Automated Laboratory Testing

Using Device-Specific Data Acquisition for Automated Laboratory Testing TRANSPOR'IATION RESEARCH RECORD 1432 9 Using Device-Specific Data Acqisition for Atomated Laboratory Testing THOMAS C. SHEAHAN, DON J. DEGROOT, AND JOHN T. GERMAINE Compter-based data acqisition systems

More information

1. Basic safety information 4 2. Proper use 4

1. Basic safety information 4 2. Proper use 4 307041 01 EN Digital twilight switch LUNA 120 top2 1200100/ 1200200 1. Basic safety information 4 2. Proper se 4 Disposal 4 3. Installation and connection 5 Monting the time switch 5 Connecting the cable

More information

A Model for Scale-Degree Reinterpretation: Melodic Structure, Modulation, and Cadence Choice in the Chorale Harmonizations of J. S.

A Model for Scale-Degree Reinterpretation: Melodic Structure, Modulation, and Cadence Choice in the Chorale Harmonizations of J. S. Empirical Msicology Review Vol. 10, No. 3, 2015 A Model for Scale-Degree Reinterpretation: Melodic Strctre, Modlation, and Cadence Choice in the Chorale Harmonizations of J. S. Bach TREVOR de CLERCQ[1]

More information

LB3-PCx50 Premium Cabinet Loudspeakers

LB3-PCx50 Premium Cabinet Loudspeakers Conications Systems LB3-PCx Premim Cabinet Lodspeakers LB3-PCx Premim Cabinet Lodspeakers www.boschsecrity.com High-qality speech and msic reprodction Weatherized, sited for sheltered otside se Prepared

More information

DQMx Series. Digital QAM Multiplexer INSTRUCTION MANUAL. Model Stock No. Description

DQMx Series. Digital QAM Multiplexer INSTRUCTION MANUAL. Model Stock No. Description One Jake Brown Road Old Bridge, NJ 08857-1000 USA (800) 523-6049 (732) 679-4000 FAX: (732) 679-4353 www.blondertonge.com INSTRUCTION MANUAL DQMx Series Digital QAM Mltiplexer Model Stock No. Description

More information

Chapter 4 (Part I) The Processor. Baback Izadi Division of Engineering Programs

Chapter 4 (Part I) The Processor. Baback Izadi Division of Engineering Programs EGC442 Introdction to Compter Architectre Chapter 4 (Part I) The Processor Baback Izadi Division of Engineering Programs bai@engr.newpaltz.ed Introdction CPU performance factors Instrction cont Determined

More information

DINION 5000 AN. Video DINION 5000 AN. Ultra high resolution 960H sensor

DINION 5000 AN. Video DINION 5000 AN.   Ultra high resolution 960H sensor Video TVL 960H Ultra high resoltion 960H sensor High Dynamic Range to see bright and dark details simltaneosly Detail enhancement Bilinx commnication for remote set-p and control Easy to install The DINION

More information

DINION AN H. Video DINION AN Ultra high resolution 960H sensor

DINION AN H. Video DINION AN Ultra high resolution 960H sensor Video www.boschsecrity.com 960H Ultra high resoltion 960H sensor High Dynamic Range to see bright and dark details simltaneosly Detail enhancement Bilinx commnication for remote set-p and control Easy

More information

Review: What is it? What does it do? slti $4, $5, 6

Review: What is it? What does it do? slti $4, $5, 6 Review: What is it? What does it do? Reg Src Instrction Instrction [3-] I [25-2] I [2-6] I [5 - ] 2 Src Op Reslt em em emtoreg I [5 - ] etend slti $, $5, 6 Reg Src Instrction Instrction [3-] I [25-2] I

More information

Drum Source Separation using Percussive Feature Detection and Spectral Modulation

Drum Source Separation using Percussive Feature Detection and Spectral Modulation ISSC 25, Dublin, September 1-2 Drum Source Separation using Percussive Feature Detection and Spectral Modulation Dan Barry φ, Derry Fitzgerald^, Eugene Coyle φ and Bob Lawlor* φ Digital Audio Research

More information

General Specifications

General Specifications General Specifications Model PR300 Power and Energy Meter GS 77C01E01-01E Oeriew This panel-monted power and energy meter with a large, three-row ED display integrates all the measring fnctions reqired

More information

DINION 5000 AN. Video DINION 5000 AN. Ultra high resolution 960H sensor

DINION 5000 AN. Video DINION 5000 AN.  Ultra high resolution 960H sensor Video www.boschsecrity.com TVL 960H Ultra high resoltion 960H sensor High Dynamic Range to see bright and dark details simltaneosly Detail enhancement Bilinx commnication for remote set-p and control Easy

More information

A prototype system for rule-based expressive modifications of audio recordings

A prototype system for rule-based expressive modifications of audio recordings International Symposium on Performance Science ISBN 0-00-000000-0 / 000-0-00-000000-0 The Author 2007, Published by the AEC All rights reserved A prototype system for rule-based expressive modifications

More information

Cast Away on the Letter A

Cast Away on the Letter A Cast Away on the Letter A TEACHER S GUIDE ELA COMMON CORE STANDARDS 4TH GRADE: For 4th Grade: Key Ideas and Details CCSS.ELA-LITERACY.RL.4.2 Determine a theme of a story, drama, or poem from details in

More information

¾Strip cable to 8 mm (max. 9) ¾Insert cable in the open DuoFix plug-in terminal at 45. LL2 cables per terminal position possible

¾Strip cable to 8 mm (max. 9) ¾Insert cable in the open DuoFix plug-in terminal at 45. LL2 cables per terminal position possible 307253 45 cable E Time switch TR 610 top3 6100130 TR 612 top3 6120130 DoFix plg-in terminal Test tap Plg-in screwless terminal C contact 1. Basic safety information WARIG Danger of death throgh electric

More information

Novel Blind Recognition Algorithm of Frame Synchronization Words Based on Soft- Decision in Digital Communication Systems

Novel Blind Recognition Algorithm of Frame Synchronization Words Based on Soft- Decision in Digital Communication Systems RESEARCH ARTICLE Novel Blind Recognition Algorithm of Frame Synchronization Words Based on Soft- Decision in Digital Commnication Systems Jiangyi Qin*, Zhiping Hang, Chnw Li, Shaojing S, Jing Zho College

More information

Vadim V. Romanuke * (Professor, Polish Naval Academy, Gdynia, Poland)

Vadim V. Romanuke * (Professor, Polish Naval Academy, Gdynia, Poland) Electrical, Control and Commnication Engineering ISSN 2255-959 (online) ISSN 2255-940 (print) 20, vol. 4, no., pp. 5 57 doi: 0.247/ecce-20-0006 https://www.degryter.com/view/j/ecce An Attempt of Finding

More information

In 2007, Pew Research conducted a survey to assess Americans knowledge of

In 2007, Pew Research conducted a survey to assess Americans knowledge of CHAPTER 12 Sample Srveys In 2007, Pew Research condcted a srvey to assess Americans knowledge of crrent events. They asked a random sample of 1,502 U.S. adlts 23 factal qestions abot topics crrently in

More information

Montgomery Modular Exponentiation on Reconfigurable Hardware æ

Montgomery Modular Exponentiation on Reconfigurable Hardware æ Montgomery Modlar Exponentiation on Reconfigrable Hardware æ Thomas Blm Worcester Polytechnic Institte ECE Department Worcester, MA 0609-2280, USA tblm@ece.wpi.ed Christof Paar christof@ece.wpi.ed Abstract

More information

Field Communication FXA 675 Rackbus RS-485 Interface monorack II RS-485

Field Communication FXA 675 Rackbus RS-485 Interface monorack II RS-485 Technical Information TI 221F/00/en Field Commnication RS-485 Interface monorack II RS-485 For distribted control of Commtec transmitters and field transmitters with RS-485 interface 19" Racksyst plg-in

More information

LTC 8800 Series Allegiant Matrix/Control Systems - Modular

LTC 8800 Series Allegiant Matrix/Control Systems - Modular Video LTC 88 Series Allegiant Matrix/Control Systems - Modlar LTC 88 Series Allegiant Matrix/Control Systems - Modlar www.boschsecrity.com 5 Camera by 4 monitor switching Expandable to larger matrix sizes

More information

Department of Electrical & Electronic Engineering Imperial College of Science, Technology and Medicine. Project: Real-Time Speech Enhancement

Department of Electrical & Electronic Engineering Imperial College of Science, Technology and Medicine. Project: Real-Time Speech Enhancement Department of Electrical & Electronic Engineering Imperial College of Science, Technology and Medicine Project: Real-Time Speech Enhancement Introduction Telephones are increasingly being used in noisy

More information

EX65 Explosion Protected Camera

EX65 Explosion Protected Camera Video EX65 Explosion Protected EX65 Explosion Protected www.boschsecrity.com Electropolished 316L stainless steel or alminm constrction High-resoltion, high-sensitivity Dinion 2X imager with WDR Integrated

More information

HIGHlite 4K Series High Brightness Digital Video Projector

HIGHlite 4K Series High Brightness Digital Video Projector HIGHlite 4K Series High Brightness Digital Video Projector 4INSTALLATION AND QUICK-START GUIDE 4CONNECTION GUIDE 4OPERATING GUIDE 4REFERENCE GUIDE Rev A Febrary 2018 118-083A Abot This Docment Follow the

More information

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video

Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video Skip Length and Inter-Starvation Distance as a Combined Metric to Assess the Quality of Transmitted Video Mohamed Hassan, Taha Landolsi, Husameldin Mukhtar, and Tamer Shanableh College of Engineering American

More information

HELMUT T. ZWAHLEN AND UMA DEVI VEL

HELMUT T. ZWAHLEN AND UMA DEVI VEL TRANSPORTATION RESEARCH RECORD 1456 125 Conspicity in Terms of Peripheral Visal Detection and Recognition of Florescent Color Targets Verss N onflorescent Color Targets Against Different Backgronds in

More information

1. Basic safety information. 2. Proper use. 3. Installation and connection. Time switch installation. Disposal. click. Time switch.

1. Basic safety information. 2. Proper use. 3. Installation and connection. Time switch installation. Disposal. click. Time switch. 307029 Time switch installation EN Time switch SELEKTA 175 top2 1750100 1. Basic safety information!! WARNING Danger of death from electric shock or fire! ¾Installation shold only be carried ot by a qalified

More information

The nature of the social experience at popular music festivals: Bestival a case study. Millie Devereux Caroline Jackson Bournemouth University

The nature of the social experience at popular music festivals: Bestival a case study. Millie Devereux Caroline Jackson Bournemouth University The natre of the social experience at poplar msic festivals: Bestival a case stdy Millie Deverex Caroline Jackson Bornemoth University Content Rationale, aim and objectives Literatre review Methodology

More information

FLEXIBLE SWITCHING AND EDITING OF MPEG-2 VIDEO BITSTREAMS

FLEXIBLE SWITCHING AND EDITING OF MPEG-2 VIDEO BITSTREAMS ABSTRACT FLEXIBLE SWITCHING AND EDITING OF MPEG-2 VIDEO BITSTREAMS P J Brightwell, S J Dancer (BBC) and M J Knee (Snell & Wilcox Limited) This paper proposes and compares solutions for switching and editing

More information

DESIGN O'F A HIGH SPEED DDA

DESIGN O'F A HIGH SPEED DDA DESIGN O'F A HIGH SPEED DDA Mark W. Goldman Gidance and Control Department Martin Company Baltimore, Maryland INTRODUCTION The objective of the company-fnded task which spported this work was to develop

More information

Product Overview 2009

Product Overview 2009 Prodct Overview 2009 Living high tech 1 Contents Editorial...3 The new ECoS 4 The new ECoS - Jst Play...5 Fnctions detailed...7 Expandibility...9 ECoS 10 ECoS...10 Expandibility...11 Navigator 12 Eqipment

More information

DIVAR network 3000 recorder

DIVAR network 3000 recorder Video DIVAR network 3000 recorder DIVAR network 3000 recorder www.boschsecrity.com APP H.265 32 IP channels with incoming 12 MP IP camera spport for view and playback Real time live display for 16 channels

More information

6.UAP Project. FunPlayer: A Real-Time Speed-Adjusting Music Accompaniment System. Daryl Neubieser. May 12, 2016

6.UAP Project. FunPlayer: A Real-Time Speed-Adjusting Music Accompaniment System. Daryl Neubieser. May 12, 2016 6.UAP Project FunPlayer: A Real-Time Speed-Adjusting Music Accompaniment System Daryl Neubieser May 12, 2016 Abstract: This paper describes my implementation of a variable-speed accompaniment system that

More information

1. Basic safety information 4 2. Proper use 4

1. Basic safety information 4 2. Proper use 4 307304 EN Astro Time Switch SELEKTA 170 top3 1700130 SELEKTA 174 top3 1740130 1. Basic safety information 4 2. Proper se 4 Disposal 4 3. Installation and connection 5 Time switch installation 5 Connecting

More information

UML Series 42- and 55-inch High Performance HD LED Monitors

UML Series 42- and 55-inch High Performance HD LED Monitors Video UML Series 42- and 55-inch High Performance HD LED Monitors UML Series 42- and 55-inch High Performance HD LED Monitors www.boschsecrity.com Fll HD 1920 x 1080 resoltion Precise color reprodction

More information

LB2 Premium-sound Cabinet Loudspeaker Range

LB2 Premium-sound Cabinet Loudspeaker Range Commnications Systems LB2 Premim-sond Cabinet Lodspeaker Range LB2 Premim-sond Cabinet Lodspeaker Range www.boschsecrity.com High-fidelity msic and speech reprodction Selectable 8 ohm, 70 V and 100 V inpts

More information

SPECTRA RESEARCH Institute

SPECTRA RESEARCH Institute SPECTRA RESEARCH Institte Final Report Neroelectric Activity and Analysis in Spport of Direct Brainwave to Compter Interface Development Richard H. Dickhat prepared for the Office of Naval Research nder

More information

E-Vision Laser 7500 Series E-Vision Laser 8500 Series E-Vision Laser 10K Series High Brightness Digital Video Projector

E-Vision Laser 7500 Series E-Vision Laser 8500 Series E-Vision Laser 10K Series High Brightness Digital Video Projector E-Vision Laser 7500 Series E-Vision Laser 8500 Series E-Vision Laser 10K Series High Brightness Digital Video Projector 4INSTALLATION AND QUICK-START GUIDE 4CONNECTION GUIDE 4OPERATING GUIDE 4REFERENCE

More information

EXHIBITOR S PROSPECTUS

EXHIBITOR S PROSPECTUS EXHIBITOR S PROSPECTUS Annal Conference & Trade Show TORCH Annal Conference & Trade Show April 18-20, 2017 Hyatt Regency Dallas DEADLINE FOR APPLICATION March 27, 2017 President s Message It is my pleasre

More information

DIVAR AN H RT APP. Video DIVAR AN H RT high-resolution on HDMI output. Mobile device support (ios, Android)

DIVAR AN H RT APP. Video DIVAR AN H RT high-resolution on HDMI output. Mobile device support (ios, Android) Video www.boschsecrity.com 960H RT APP 960H RT high-resoltion on HDMI otpt Mobile device spport (ios, Android) Remote alarm notification Network fnction for remote viewing, playback and control Simltaneos

More information

Music Theory Level 2. Name. Period

Music Theory Level 2. Name. Period Msic Theory evel 2 Name Period Table of Contents edger ines Grand Staff Page 3 Page 4 edger ine and Grand Staff Review Page 5 Grand Staff Piano Visal Page 6 Time Signatres Page 79 Theory Review Page Dotted

More information

BRAND GUIDELINES 2017

BRAND GUIDELINES 2017 BRAND GUIDELINES 2017 01 CONTENTS Introdction 02 Or Brand 04 Brand Positioning Statement 06 Reasons to Believe 07 Tone of Voice 09 Visal Gidelines 10 Typography: Print & Web 11 Color Palette 13 Using the

More information

An Efficient Spurious Power Suppression Technique (SPST) and its Applications on MPEG-4 AVC/H.264 Transform Coding Design

An Efficient Spurious Power Suppression Technique (SPST) and its Applications on MPEG-4 AVC/H.264 Transform Coding Design An Efficient Sprios Sppression echniqe (SPS) and s Applications on PEG-4 AVC/H64 ransform Coding De Kan-Hng Chen, Ko-Chan Chao, Jinn-Shyan Wang, Yan-Sn Ch Department of Electrical Engineering, National

More information

Experimental Study on Two-Phase Flow Instability in System Including Downcomers

Experimental Study on Two-Phase Flow Instability in System Including Downcomers Jornal of Nclear Science and Technology SSN: 0022-3131 (Print) 1881-1248 (Online) Jornal homepage: https://www.tandfonline.com/loi/tnst Experimental Stdy on Two-Phase Flow nstability in System nclding

More information

Understanding Compression Technologies for HD and Megapixel Surveillance

Understanding Compression Technologies for HD and Megapixel Surveillance When the security industry began the transition from using VHS tapes to hard disks for video surveillance storage, the question of how to compress and store video became a top consideration for video surveillance

More information

DIVAR AN H RT APP. Video DIVAR AN H RT high-resolution on HDMI output. Mobile device support (ios, Android)

DIVAR AN H RT APP. Video DIVAR AN H RT high-resolution on HDMI output. Mobile device support (ios, Android) Video www.boschsecrity.com 960H RT APP 960H RT high-resoltion on HDMI otpt Mobile device spport (ios, Android) Network fnction for remote viewing, playback, control and alarm notification Simltaneos monitoring,

More information

Chapter 10 Basic Video Compression Techniques

Chapter 10 Basic Video Compression Techniques Chapter 10 Basic Video Compression Techniques 10.1 Introduction to Video compression 10.2 Video Compression with Motion Compensation 10.3 Video compression standard H.261 10.4 Video compression standard

More information

Experimental. E-Gun. E-Gun Modulator Arrangement AI VI MONITORS TRIODE ELECTRON BEA~ CATHODE TRIGGER

Experimental. E-Gun. E-Gun Modulator Arrangement AI VI MONITORS TRIODE ELECTRON BEA~ CATHODE TRIGGER TEMPORAL WAVESHAPNG OF A TRODE COLD CATHODE ELECTRON BEAM GUN Gary Loda Beta Development Corporation, 557 Sierra Lane Dblin, CA 945 Sol Schneider* Little Silver, NJ 7739 William F. Otto and George J. Deenberg

More information

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS

POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS POST-PROCESSING FIDDLE : A REAL-TIME MULTI-PITCH TRACKING TECHNIQUE USING HARMONIC PARTIAL SUBTRACTION FOR USE WITHIN LIVE PERFORMANCE SYSTEMS Andrew N. Robertson, Mark D. Plumbley Centre for Digital Music

More information

Analysis, Synthesis, and Perception of Musical Sounds

Analysis, Synthesis, and Perception of Musical Sounds Analysis, Synthesis, and Perception of Musical Sounds The Sound of Music James W. Beauchamp Editor University of Illinois at Urbana, USA 4y Springer Contents Preface Acknowledgments vii xv 1. Analysis

More information

VIP X16 XF E Video Encoder

VIP X16 XF E Video Encoder Video VIP X16 XF E Video Encoder VIP X16 XF E Video Encoder www.boschsecrity.com 16 video inpt channels with Dal Streaming Deinterlacing at video inpt and progressive encoding Network-attached iscsi recording

More information

DIVAR network 2000 recorder

DIVAR network 2000 recorder Video DIVAR network 2000 recorder DIVAR network 2000 recorder www.boschsecrity.com APP H.265 16 IP channels with 256 Mbps incoming bandwidth 8 MP (UHD) IP camera spport for view and playback Real time

More information

770pp. THEORIA 64 (2009)

770pp. THEORIA 64 (2009) DOV M. GABBAY AND JOHN WOODS: The Rise of Modern Logic: From Leibniz to Frege. [Handbook of the History of Logic, vol. 3]. Elsevier North Holland, Amsterdam, 2004, 770pp. This volme contains essays on

More information

DIVAR network 2000 recorder

DIVAR network 2000 recorder Video DIVAR network 2000 recorder DIVAR network 2000 recorder www.boschsecrity.com APP H.265 16 IP channels with 256 Mbps incoming bandwidth 8 MP (UHD) IP camera spport for view and playback Real time

More information

Motion Video Compression

Motion Video Compression 7 Motion Video Compression 7.1 Motion video Motion video contains massive amounts of redundant information. This is because each image has redundant information and also because there are very few changes

More information

MetroLED. Linear LED Lighting System for Display Illumination

MetroLED. Linear LED Lighting System for Display Illumination MetroLED Linear LED Lighting System for Display Illmination The MetroLED lighting system. Click, twist, play. LED lighting systems have traditionally been the poor relation to fibre optics when it comes

More information

Understanding PQR, DMOS, and PSNR Measurements

Understanding PQR, DMOS, and PSNR Measurements Understanding PQR, DMOS, and PSNR Measurements Introduction Compression systems and other video processing devices impact picture quality in various ways. Consumers quality expectations continue to rise

More information

NWD 455 FlexiDome IP Cameras

NWD 455 FlexiDome IP Cameras Video NWD 455 FlexiDome IP Cameras NWD 455 FlexiDome IP Cameras www.boschsecrity.com High-impact, vandal-resistant enclosre Advanced color CCD network dome camera NightSense for low light sitations High-performance

More information

FLEXIDOME 5000 AN. Video FLEXIDOME 5000 AN. Ultra high resolution 960H sensor

FLEXIDOME 5000 AN. Video FLEXIDOME 5000 AN.   Ultra high resolution 960H sensor Video www.boschsecrity.com TVL 960H Ultra high resoltion 960H sensor High Dynamic Range to see bright and dark details simltaneosly High impact, vandal-resistant, weatherproof hosing Bilinx commnication

More information

LBC 347x/00 Horn and Driver Loudspeaker Range

LBC 347x/00 Horn and Driver Loudspeaker Range Conications Systems LBC 347x/ Horn and Driver Lodspeaker Range LBC 347x/ Horn and Driver Lodspeaker Range High efficiency drivers Excellent speech reprodction Easy assembly Water- and dst protected to

More information

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora

MULTI-STATE VIDEO CODING WITH SIDE INFORMATION. Sila Ekmekci Flierl, Thomas Sikora MULTI-STATE VIDEO CODING WITH SIDE INFORMATION Sila Ekmekci Flierl, Thomas Sikora Technical University Berlin Institute for Telecommunications D-10587 Berlin / Germany ABSTRACT Multi-State Video Coding

More information

Case Study: Can Video Quality Testing be Scripted?

Case Study: Can Video Quality Testing be Scripted? 1566 La Pradera Dr Campbell, CA 95008 www.videoclarity.com 408-379-6952 Case Study: Can Video Quality Testing be Scripted? Bill Reckwerdt, CTO Video Clarity, Inc. Version 1.0 A Video Clarity Case Study

More information

Video coding standards

Video coding standards Video coding standards Video signals represent sequences of images or frames which can be transmitted with a rate from 5 to 60 frames per second (fps), that provides the illusion of motion in the displayed

More information

MAutoPitch. Presets button. Left arrow button. Right arrow button. Randomize button. Save button. Panic button. Settings button

MAutoPitch. Presets button. Left arrow button. Right arrow button. Randomize button. Save button. Panic button. Settings button MAutoPitch Presets button Presets button shows a window with all available presets. A preset can be loaded from the preset window by double-clicking on it, using the arrow buttons or by using a combination

More information

Pipelining. Improve performance by increasing instruction throughput Program execution order. Data access. Instruction. fetch. Data access.

Pipelining. Improve performance by increasing instruction throughput Program execution order. Data access. Instruction. fetch. Data access. Chapter 6 Pipelining Improve performance by increasing instrction throghpt Program eection order Time (in instrctions) lw $, ($) Instrction fetch 2 4 6 8 2 4 6 8 ALU Data access lw $2, 2($) 8 ns Instrction

More information

Laboratory Assignment 3. Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB

Laboratory Assignment 3. Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB Laboratory Assignment 3 Digital Music Synthesis: Beethoven s Fifth Symphony Using MATLAB PURPOSE In this laboratory assignment, you will use MATLAB to synthesize the audio tones that make up a well-known

More information

DIVAR network 5000 recorder

DIVAR network 5000 recorder Video DIVAR network 5000 recorder DIVAR network 5000 recorder www.boschsecrity.com APP H.265 32 IP channels with 320 Mbps incoming bandwidth 12 MP IP camera spport for view and playback Real time live

More information

A P D C G Middle C u B

A P D C G Middle C u B READING MUSIC NOTES With Ease E For the Adlt Beginner A P D C G Middle C B f F BETTY WAGNER Associate Trinity College London, Associate Msic Astralia READING MUSIC NOTES WITH EASE Page 1 http://www.msic-with-ease.com

More information

Using the new psychoacoustic tonality analyses Tonality (Hearing Model) 1

Using the new psychoacoustic tonality analyses Tonality (Hearing Model) 1 02/18 Using the new psychoacoustic tonality analyses 1 As of ArtemiS SUITE 9.2, a very important new fully psychoacoustic approach to the measurement of tonalities is now available., based on the Hearing

More information

MIC Series IP Power Supply

MIC Series IP Power Supply Video MIC Series IP Power Spply MIC Series IP Power Spply www.boschsecrity.com MIC power spply with IVA-enabled, integrated Bosch IP technology provides video and control over IP for MIC550, MIC550IR,

More information

100 Hz Chassis 28 CTV

100 Hz Chassis 28 CTV SERVICE MANUAL 100 Hz Chassis 28 CTV Contents Technical Data..............................................4 Recommendation for service repairs............................5 Component description.......................................7

More information

8-1. Advanced Features About TV Watching TV... TV Antenna TV Windows Initial Setup Channel Settings...

8-1. Advanced Features About TV Watching TV... TV Antenna TV Windows Initial Setup Channel Settings... Abot... -2 Antenna... -3 Winows... -4 Initial Setp... -5 Watching... -6 Time Shift Playback While Watching... From Qick Lancher... Data Broacasts (Japanese)... Program Gie... -7-7 - -9 Recoring/Playing

More information

An Overview of Video Coding Algorithms

An Overview of Video Coding Algorithms An Overview of Video Coding Algorithms Prof. Ja-Ling Wu Department of Computer Science and Information Engineering National Taiwan University Video coding can be viewed as image compression with a temporal

More information

VIDEOJET decoder 8000

VIDEOJET decoder 8000 Video www.boschsecrity.com HD, 4K UHD and MP decoding Flexible monitor layots Directly drives p to two 4K UHD monitors Displays VCA metadata Compact size and VESA mont displays video from Standard Definition

More information

Features 1 Harris and other corners

Features 1 Harris and other corners CS 4495 Compter Vision A. Bobick Featres 1: Harris CS 4495 Compter Vision Featres 1 Harris Aaron Bobick School of nteractie Compting CS 4495 Compter Vision A. Bobick Featres 1: Harris Administriia PS 4:

More information

Image Acquisition Technology

Image Acquisition Technology Image Choosing the Right Image Acquisition Technology A Machine Vision White Paper 1 Today, machine vision is used to ensure the quality of everything from tiny computer chips to massive space vehicles.

More information

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP- 59 Measurement and Management of Loudness in Soundtracks for Television Broadcasting

FREE TV AUSTRALIA OPERATIONAL PRACTICE OP- 59 Measurement and Management of Loudness in Soundtracks for Television Broadcasting Page 1 of 10 1. SCOPE This Operational Practice is recommended by Free TV Australia and refers to the measurement of audio loudness as distinct from audio level. It sets out guidelines for measuring and

More information

However, in studies of expressive timing, the aim is to investigate production rather than perception of timing, that is, independently of the listene

However, in studies of expressive timing, the aim is to investigate production rather than perception of timing, that is, independently of the listene Beat Extraction from Expressive Musical Performances Simon Dixon, Werner Goebl and Emilios Cambouropoulos Austrian Research Institute for Artificial Intelligence, Schottengasse 3, A-1010 Vienna, Austria.

More information

DIVAR network 5000 recorder

DIVAR network 5000 recorder Video DIVAR network 5000 recorder DIVAR network 5000 recorder www.boschsecrity.com APP H.265 32 IP channels with 320 Mbps incoming bandwidth 12 MP IP camera spport for view and playback Real time live

More information

MIC Series IP Power Supply

MIC Series IP Power Supply Video MIC Series IP Power Spply MIC Series IP Power Spply www.boschsecrity.com MIC power spply with IVA-enabled, integrated Bosch IP technology provides video and control over IP for MIC550, MIC550IR,

More information

mag TBC SYNC+ Digital Video Time Base Corrector and Synchronizer Operations Manual with or without Digital Effects

mag TBC SYNC+ Digital Video Time Base Corrector and Synchronizer Operations Manual with or without Digital Effects Pl'JJ me e. mag Operations Manal TBC SYNC+ Digital Video Time Base Corrector and Synchronizer /'', f ' with or withot Digital Effects The information in this manal is frnished solely for the prpose of

More information

1 Introduction to PSQM

1 Introduction to PSQM A Technical White Paper on Sage s PSQM Test Renshou Dai August 7, 2000 1 Introduction to PSQM 1.1 What is PSQM test? PSQM stands for Perceptual Speech Quality Measure. It is an ITU-T P.861 [1] recommended

More information

AN IMPROVED ERROR CONCEALMENT STRATEGY DRIVEN BY SCENE MOTION PROPERTIES FOR H.264/AVC DECODERS

AN IMPROVED ERROR CONCEALMENT STRATEGY DRIVEN BY SCENE MOTION PROPERTIES FOR H.264/AVC DECODERS AN IMPROVED ERROR CONCEALMENT STRATEGY DRIVEN BY SCENE MOTION PROPERTIES FOR H.264/AVC DECODERS Susanna Spinsante, Ennio Gambi, Franco Chiaraluce Dipartimento di Elettronica, Intelligenza artificiale e

More information

FULL-AUTOMATIC DJ MIXING SYSTEM WITH OPTIMAL TEMPO ADJUSTMENT BASED ON MEASUREMENT FUNCTION OF USER DISCOMFORT

FULL-AUTOMATIC DJ MIXING SYSTEM WITH OPTIMAL TEMPO ADJUSTMENT BASED ON MEASUREMENT FUNCTION OF USER DISCOMFORT 10th International Society for Music Information Retrieval Conference (ISMIR 2009) FULL-AUTOMATIC DJ MIXING SYSTEM WITH OPTIMAL TEMPO ADJUSTMENT BASED ON MEASUREMENT FUNCTION OF USER DISCOMFORT Hiromi

More information

Controlling adaptive resampling

Controlling adaptive resampling Controlling adaptive resampling Fons ADRIAENSEN, Casa della Musica, Pzle. San Francesco 1, 43000 Parma (PR), Italy, fons@linuxaudio.org Abstract Combining audio components that use incoherent sample clocks

More information

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes

DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring Week 6 Class Notes DAT335 Music Perception and Cognition Cogswell Polytechnical College Spring 2009 Week 6 Class Notes Pitch Perception Introduction Pitch may be described as that attribute of auditory sensation in terms

More information

Dual frame motion compensation for a rate switching network

Dual frame motion compensation for a rate switching network Dual frame motion compensation for a rate switching network Vijay Chellappa, Pamela C. Cosman and Geoffrey M. Voelker Dept. of Electrical and Computer Engineering, Dept. of Computer Science and Engineering

More information

Andrián Pertout. Sonus dulcis. for Clarinet and Pianoforte. No. 375g

Andrián Pertout. Sonus dulcis. for Clarinet and Pianoforte. No. 375g Andrián Pertot Sons dlcis for Clarinet and Pianoforte No 75g Andrián Pertot Sons dlcis for Clarinet and Pianoforte No 75g Composed in Agst, 000 (Revised in anary, 00) Arranged for Do DICTO (Marco Antonio

More information

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES

OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES OBJECTIVE EVALUATION OF A MELODY EXTRACTOR FOR NORTH INDIAN CLASSICAL VOCAL PERFORMANCES Vishweshwara Rao and Preeti Rao Digital Audio Processing Lab, Electrical Engineering Department, IIT-Bombay, Powai,

More information

ELEC 484 Project Pitch Synchronous Overlap-Add

ELEC 484 Project Pitch Synchronous Overlap-Add ELEC 484 Project Pitch Synchronous Overlap-Add Joshua Patton University of Victoria, BC, Canada This report will discuss steps towards implementing a real-time audio system based on the Pitch Synchronous

More information

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY Eugene Mikyung Kim Department of Music Technology, Korea National University of Arts eugene@u.northwestern.edu ABSTRACT

More information