ADX TRAX. Insights on Audio Source Separation. Audio Praxis. Rick Silva

Similar documents
WAVES Cobalt Saphira. User Guide

S I N E V I B E S FRACTION AUDIO SLICING WORKSTATION

Simple Harmonic Motion: What is a Sound Spectrum?

Fraction by Sinevibes audio slicing workstation

Eventide Inc. One Alsan Way Little Ferry, NJ

Eventide Inc. One Alsan Way Little Ferry, NJ

500 SERIES DE-ESSER OWNER S MANUAL

Why We Measure Loudness

Version 1.10 CRANE SONG LTD East 5th Street Superior, WI USA tel: fax:

SPL Analog Code Plug-ins Manual Classic & Dual-Band De-Essers

Rechnergestützte Methoden für die Musikethnologie: Tool time!

Students will be able to properly manage classroom equipment and follow safety procedures.

y POWER USER MUSIC PRODUCTION and PERFORMANCE With the MOTIF ES Mastering the Sample SLICE function

Contents. Welcome to LCAST. System Requirements. Compatibility. Installation and Authorization. Loudness Metering. True-Peak Metering

APPLICATIONS OF A SEMI-AUTOMATIC MELODY EXTRACTION INTERFACE FOR INDIAN MUSIC

Audio Source Separation: "De-mixing" for Production

Chapter 40: MIDI Tool

Dynamic Spectrum Mapper V2 (DSM V2) Plugin Manual

INSTALLATION AND OPERATION INSTRUCTIONS EVOLUTION VIDEO DISTRIBUTION SYSTEM

Voice & Music Pattern Extraction: A Review

timing Correction Chapter 2 IntroductIon to timing correction

Precision DeEsser Users Guide

I, Kent Gibson, state the following, of which I have personal. knowledge: I am the founder of Forensic Audio (ForensicAudio.

Cisco Spectrum Expert Software Overview

Tempo and Beat Analysis

This project will work with two different areas in digital signal processing: Image Processing Sound Processing

Advanced Techniques for Spurious Measurements with R&S FSW-K50 White Paper

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

1. SAFETY & WARRANTY 2 2. WHAT IS CGM 3 3. SYSTEM SETUP (LINKING) Master to Group(s) Group to Channel(s) 3 4. CONNECT THE POWER 4

Introduction To LabVIEW and the DSP Board

SOUND REINFORCEMENT APPLICATIONS

Symphonic Wind Ensemble

Illuminating the home theater experience.

M-16DX 16-Channel Digital Mixer

USER S GUIDE DSR-1 DE-ESSER. Plug-in for Mackie Digital Mixers

PHYSICS OF MUSIC. 1.) Charles Taylor, Exploring Music (Music Library ML3805 T )

PSYCHOACOUSTICS & THE GRAMMAR OF AUDIO (By Steve Donofrio NATF)

SingMai Electronics SM06. Advanced Composite Video Interface: HD-SDI to acvi converter module. User Manual. Revision 0.

Topic 11. Score-Informed Source Separation. (chroma slides adapted from Meinard Mueller)

Recording to Tape (Analogue or Digital)...10

A Must-Have for the Studio

CLA MixHub. User Guide

Quick Setup Guide. DCX-500i / DCX-1000i / DCX-1500i Projectors with WC-Pro Scaler. Preliminary

WAVES H-EQ HYBRID EQUALIZER USER GUIDE

SingMai Electronics SM06. Advanced Composite Video Interface: DVI/HD-SDI to acvi converter module. User Manual. Revision th December 2016

Reference Guide Version 1.0

The MPC X & MPC Live Bible 1

AnalyseThat. AnalyseThat. Blue Cat Audio. by Paul Evans

M-16DX 16-Channel Digital Mixer

Further Topics in MIR

SOUND LABORATORY LING123: SOUND AND COMMUNICATION

I. LISTENING. For most people, sound is background only. To the sound designer/producer, sound is everything.!tc 243 2

AN ARTISTIC TECHNIQUE FOR AUDIO-TO-VIDEO TRANSLATION ON A MUSIC PERCEPTION STUDY

Music Radar: A Web-based Query by Humming System

Pitch-Synchronous Spectrogram: Principles and Applications

Quick Start for TrueRTA (v3.5) on Windows XP (and earlier)

Melody Sauce is an AU / VST / MIDI FX device that creates melodies as MIDI.

DTS Neural Mono2Stereo

Cablecast SX. Setup Guide. c Tightrope Media Systems For Cablecast version Build 206

MAutoPitch. Presets button. Left arrow button. Right arrow button. Randomize button. Save button. Panic button. Settings button

Pitch. The perceptual correlate of frequency: the perceptual dimension along which sounds can be ordered from low to high.

Quick Start manual for Nova

Stafford Township School District Manahawkin, NJ

AudioLava User Guide

Original Marketing Material circa 1976

DOD OWNER'S MANUAL 866 SERIES II GATED COMPRESSOR/LIMITER SIGNAL PROCESSORS

MCM-8 II PRELIMINARY USER GUIDE v1.0

2D/3D Multi-Projector Stacking Processor. User Manual AF5D-21

DA IN 1-OUT LINE DRIVER WITH EQUALIZATION + AUDIO USER S GUIDE

Chameleon Labs Model 7720

Sound Quality. Crossovers. High-Precision Stereo 2-Way/Mono 3-Way Crossover with Subwoofer Output

Collection of Setups for Measurements with the R&S UPV and R&S UPP Audio Analyzers. Application Note. Products:

560A 500 SERIES COMPRESSOR/LIMITER OWNER S MANUAL

DMP3. Users Manuual. Ver. # DMP

Doubletalk Detection

The Dangerous 2-Bus Manual

A Matlab toolbox for. Characterisation Of Recorded Underwater Sound (CHORUS) USER S GUIDE

About your Kobo ereader...6

L+R: When engaged the side-chain signals are summed to mono before hitting the threshold detectors meaning that the compressor will be 6dB more sensit

RF Explorer RackPRO. User Manual. Introduction. Greetings fellow traveler on the RF spectrum.

Abbey Road TG Mastering Chain User Guide

DH400. Digital Phone Hybrid. The most advanced Digital Hybrid with DSP echo canceller and VQR technology.

Overview. A 16 channel frame is shown.

SYNTHESIS FROM MUSICAL INSTRUMENT CHARACTER MAPS

Robert Alexandru Dobre, Cristian Negrescu

Topic 10. Multi-pitch Analysis

SAWM60 AUDIO/VIDEO MODULATOR

DA1909 COMPUTER VIDEO LINE DRIVER WITH EQUALIZATION USER S GUIDE

The Complete Guide to Music Technology using Cubase Sample Chapter

BRTC-M2 COMPRESSOR. CDSoundMaster BIG ROUND TUBE COMPRESSOR BY MX2 MICHAEL HEILER AND MICHAEL ANGEL

2. AN INTROSPECTION OF THE MORPHING PROCESS

Music Perception with Combined Stimulation

Calibrating the timecode signal input

HMA-860H AGILE MODULATOR

Build Applications Tailored for Remote Signal Monitoring with the Signal Hound BB60C

Topics in Computer Music Instrument Identification. Ioanna Karydi

A prototype system for rule-based expressive modifications of audio recordings

PYROPTIX TM IMAGE PROCESSING SOFTWARE

FS1-X. Quick Start Guide. Overview. Frame Rate Conversion Option. Two Video Processors. Two Operating Modes

Period #: 2. Make sure that you re computer s volume is set at a reasonable level. Test using the keys at the top of the keyboard

Transcription:

ADX TRAX Insights on Audio Source Separation Audionamix has served the professional audio community for years, lending expertise and creative solutions to individuals and companies at the forefront of the music, film, and broadcast industries. Delivering pristine voice isolations for everything from blockbuster movies and 3-D animated shorts to virtual duets by legendary artists, and exciting new remixes by top DJs and music producers, the company is positioned as the global leader in audio source separation. Audionamix now provides some insight into the world of audio source separation through the lens of its new software release, ADX TRAX. By Rick Silva (United States) ADX TRAX is designed to help producers, DJs, mash-up artists, and creative audio engineers isolate vocal or melodic motifs to create new hooks, innovative remixes, or virtual duets. The software also enables users to raise or lower the vocal track or a solo instrument in a mix during the mastering process without requiring access to the multi-track master sessions. ADX TRAX enables users to duck vocals or melodic instruments within a mixed track to keep the audience focused on the dialogue, voiceover talent in a commercial, or to meet profanity requirements for radio mixes. ADX TRAX is used by the production specialists at Audionamix to deliver its world renowned vocal isolations. Melodic Audio Source Separation Automatic voice activity detection (AVAD) and melodic audio source separation are the core components of TRAX software. Upon importing an audio file, TRAX quickly analyzes the content and sends all relevant information to Audionamix s cloud-based servers where the AVAD process begins. Once the process is complete, the servers return an estimated pitch guide. What appears in the interface is a zoomed in, spectral view of the melodic content the AVAD process was tracking to perform the automatic separation process. Audionamix calls this view its pitchogram overview. The TRAX pitchogram shows the main melodic content s fundamental frequencies along with the estimated pitch guide used to separate the melody from the music (see Figure 1). Following the AVAD process, the estimated pitch guide is sent through Audionamix s Vocal Extraction algorithm (VEX), then TRAX generates a separation. As a result, you get the first separation almost instantly (see Figure 2). It s important to note that the AVAD process only creates an estimation of the main melodic content. Although it saves time by calculating the pitch guide estimation, AVAD cannot define the melodic content s nuances nearly as accurately as you can. This is where the manual refinement process begins. Once the initial pitch guide appears in the pitchogram, you can see where it missed some of the desired melodic content. To correct these imperfections, choose from the standard editing tools at the bottom of the interface to create a customized pitch guide to zero in on the desired melodic content for separation. In places where the melodic content s amplitude was too low for the AVAD algorithm to track, you can use the piano 28 October 2014 audioxpress.com

roll editor to audition the melody s pitch that you are trying to separate. Then, apply a marquee tool selection to force VEX to separate melodic content within a specifically defined frequency range and time duration. Unlike editing within a full range spectrogram, you are only required to make your pitch guide align with the fundamental frequency of the melodic content you are trying to separate. Then TRAX automatically extrapolates the upper and lower harmonic partials during the separation process. Once all your refinements are complete and you ve created a custom pitch guide, click the Separate button located at the bottom of the Separate screen and a refined separation is created and listed in your separation bin (see Figure 3). Nonlinear Editing, Spectral Editing, and Audio Source Separation Since the release of TRAX, there have been numerous requests for spectral editing features. However, TRAX is source separation software, not a spectral editor. Although they may appear to be very similar, they are indeed very different. A waveform overview display of audio as shown in Figure 2 represents the amplitude of the audio over time. This is a great view for basic cut, copy and pasting techniques to remove or rearrange audio. By zooming in to the sample level, you can even make minor corrections and fix clicks or pops with a pencil tool. Regardless, any edits made to waveform overviews are going to affect the entire frequency range of the audio, and there is no way to edit individual frequencies or even visualize the melody with just this waveform overview. On the other hand, with a spectral editing software s spectrogram view, you can visualize and edit frequency specific content over the full frequency range of audio because it displays time on the X-axis and frequency on the Y-axis. Amplitude is usually displayed by brightness, with louder sounds being brighter and softer sounds being darker. Having this view of the audio is incredibly useful for carefully chipping away from Figure 1: The melodic content is shown within an 80-to-800-Hz frequency range. You can see the automatic estimation of the pitch guide traced over the melodic content. the content you are trying to isolate (e.g., isolating specific melodic content or vocal from a mastered recording). With a spectral view of the audio, you can almost immediately identify melodic content, including its fundamental frequency and most of its harmonic partials, as well as all of the other content a spectral view of the audio reveals (see Figure 4). Through the use of spectral editing tools, you can directly edit by time and frequency, applying gain to specific sounds, attenuate others, or remove them entirely from the frequency spectrum. You can also apply spectral repair, decrackling, and many other useful audio restoration techniques. Using a common spectral editor and applying a lot of time, effort, and tedious audio manipulation, you can achieve a fairly decent level of isolation. The process of Melodic Audio Source Separation in digital signal processing aims to identify a specific Figure 2: The automatic separation is displayed as typical waveform overviews a music file and a vocal file. audioxpress.com October 2014 29

Figure 3: The Separations Window shows the editing tools, Guide Tone controls, Transport, Separate Button, Separation Bins, and separated music and vocal files. Figure 4: A full mix of music in a standard spectogram shows the fundamental frequencies of melodic content, all its harmonic partials, and a visual representation of everything else in the audio file. Figure 5: The spectogram view of an isolated vocal is shown after an ADX TRAX automatic separation is complete and before it is refined. signal from a blend of many different signals that have been mixed together and then separate it from the mixture. For this reason, pure audio source separation is often referred to as the holy grail to almost every professional audio engineer. Audio engineers have known for more than 100 years that once signals are mixed together and combined into one master recording, you cannot go back and un-mix that master recording. That is why audio engineers strive to capture a great, live performance of a band in one take. Otherwise multitrack recorders are needed to access the separate audio stems and re-record any undesirable performances or re-mix unbalanced levels among the instruments or the voices. Un-mixing audio is what Audionamix is known for and ADX TRAX clearly demonstrates the ultimate first step of the un-mixing process. With an understanding that TRAX is a melodic audio source separation program and not a spectral editor, you ll realize that once the TRAX separation and refinement process is complete it is much easier to go through the final steps of audio clean up using spectral editing tools to achieve a stunning vocal isolation. Figure 4 showed a significant amount of information in the spectrogram when importing a full mix. Now compare the same audio clip after it is run through ADX TRAX (see Figure 5). Notice how the melody is much more isolated from the rest of the instrumentation in the audio file. The results shown in Figure 5 occurred just after the ADX TRAX automatic separation process, prior to refinements, and only took a matter of minutes. Simply starting from the results of TRAX in a spectral editor is an incredible time saver when trying to isolate a vocal. To add some perspective, if your goal was to reach this amount of isolation with a typical spectral editor it could take anywhere from two to 10 times the amount of manual effort than using TRAX and more often than not, the result would be completely different from one user to the next. Remember, an incredibly high skill set is required to not damage the file in a typical spectral editor. Additionally, if any of the vocal audio is deleted from the spectral work you ve done, there is no way to retrieve it the only thing that remains is the isolated audio. The rest of the audio has been deleted from your project. Although fundamental frequencies may be easy to identify using a spectrogram view, the target vocal contains many harmonic partials. These harmonic partials are crucial for maintaining the timbre, clarity, and integrity of the voice. When using a spectral editor, most of the upper partials 30 October 2014 audioxpress.com

are difficult to see and isolate. By simply refining the main pitch guide-line using Audionamix s pitchogram, and its user-friendly editing tools, TRAX automatically calculates the harmonic partials of the melody and separates as much of the original voice as possible. When TRAX isolates a vocal, it transfers everything else to a second, complementary file (the music file). These files, when combined, will always equal the sum of the original track. This is one of the key differentiators with the ADX approach. This unique process has immediate every-day use for digital audio enthusiasts. You can raise or lower vocals from a master recording 6 to 9 db and adjust pan positioning in the stereo field without requiring the multi-track master sessions. This is great for touch-ups in the mastering or re-mastering process, creating profanity law remixes, or lowering melodies in expensive sync music so it doesn t pull focus of the center channel dialog in a television or movie mix. Vocalists, guitarists, brass, and woodwind players can even use this software to play along with an actual band. To take melodic isolation and instrumental creation to the next level, Audionamix unveils its unique approach to spectral editing with the release of ADX TRAX 2.0 Spectral. ADX TRAX 2.0 Spectral While the TRAX audio source separation process does most of the heavy lifting when trying to isolate vocals or melodic content, spectral editing tools are perfect for removing unwanted content in your TRAX separations. But remember, with traditional spectral editors, once the information is removed, it s gone for good. So what if you want to put the remaining music from your vocal track, back into your music track to get a better instrumental backing track? And what if there is a bit of missing vocal still in the music track? How can you get that back? ADX TRAX 2.0 Spectral offers a unique approach to spectral editing software that is seamlessly integrated into the familiar interface and workflow of the original TRAX software. Now, you can separate the vocal from the music track, refine and re-process your results, create a comp track to get the best of separation, and use spectral editing tools. Explore pickups and electrical environments that enhance the sound of your Electric Guitar. cc-webshop.com 32 October 2014 audioxpress.com

ADX TRAX 2.0 Spectral includes the most important standard spectral editing tools, but the result is not standard at all. ADX TRAX 2.0 Spectral provides a workflow that remains nondestructive and continues to work between two files (vocal and music). This means any spectral changes you make to one file are automatically reflected in the other. For example, after processing with TRAX you might notice that part of a word has been left out of the vocal file. You can enter the spectral editing view, look at the music file, and identify the missing vocal part. Use a lasso tool to select it, and then use the cut function. You ll see it disappear from the music track, and automatically reappear in the vocal track. You can instantly switch back and forth between tracks to hear the results. This straightforward, non-destructive approach to spectral editing is the type of innovation that has kept high-profile clients requesting professional services from Audionamix for years. With the release of ADX TRAX 2.0 Spectral, Audionamix is finally starting to share their advanced technology and creative workflow with the world. ADX TRAX 2.0 Spectral will be showcased at the 137 th Audio Engineering Society (AES) Convention from October 9 12, 2014 at the Los Angeles Convention Center in Los Angeles, CA. During the convention, ADX TRAX experts will demonstrate ADX TRAX 2.0 Spectral, the all-in-one audio software for creating vocal isolations, instrument samples and instrumental tracks for a multitude of creative uses. a x Author s Note: As a result of the continued support from audioxpress, Audionamix is offering a special 25% off discount on all ADX TRAX plans to all audioxpress subscribers through October 31, 2014. Visit www.audionamix.com/shop and enter the promo code AXpress25 at checkout through the end of October to receive this special offer. We hope you enjoy using ADX TRAX and look forward to providing new and exciting technology to the pro audio community. About the Author R i c k S i l v a i s t h e V i c e P r e s i d e n t o f Production and Product M a n a g e m e n t f o r Audionamix, which is a global leader in audio separation. Based on years of audio signal processing research, the company developed i t s p a t e n t e d A D X Technology. Rick is also a studio owner, published a u t h o r, a n d v i d e o artist for Hal Leonard and Alfred Publishing. Rick works as an audio engineering instructor at the Musicians Institute in Hollywood, CA. He is also a musician/guitar player and a recording, mixing, and mastering engineer. Power Connectors and Cord Sets Visit us at AES booth 1635 offering an extensive range of ready-to-go toroidal transformers to please the ear, but won t take you for a ride. Avel Lindberg Inc. 47 South End Plaza, New Milford, CT 06776 p: 860.355.4711 / f: 860.354.8597 SCHURTER's V-Lock cord retaining system is easy, safe and cost effective. - wide range of international plugs - hundreds of V-Lock compatible inlets, outlets and power entry modules with or without filters (shown with KEA power entry module and extra-safe fusedrawer with integrated 2-pole circuit protection) - ideal for IT, audio and medical equipment where cord retention is required - cul and ENEC approved schurterinc.com/new_pems v-lock.schurter.com audioxpress.com October 2014 33