Speaker Recognition: Building the Mixer 4 and 5 Corpora

Similar documents
GALE Phase 2 Arabic Broadcast Conversation Speech Part Introduction

GALE Phase 2 Arabic Broadcast Conversation Speech Part Introduction

AVCS Technology Consulting for the 21 st Century

Multi-Purpose Auditorium Sound Reinforcement System Design ECE Spring Zach Vander Missen Muhammad Farooq Garrett McMindes

Theatre Technical Specifications

Recording to Tape (Analogue or Digital)...10

Dr JOHN GALLAGHER CONCERT CHAMBER

SHOWBOX AT THE MARKET TECHNICAL SPECIFICATIONS

Digiline 8 All-in-One Audio Matrix Verstärker Digiline 8 All-in-One 8x8 Audio Matrix Verstärker mit 1300W

Rooming List. Catering & Meals. 10 Hands and 1 Piano TIMELINE SHOW WITH GUEST Tech Sheet

Equipment Inventory. Seating. Stage. Dressing Rooms. Loading Bay

Activity 1A: The Power of Sound

Main Stage Technical Specs - Hall #2

SOUND. * All instruments and circuits have standard 2 pin and ground stage pin connectors. Page 1 of 5

VCR Integration for Record and Playback Extend the Intel TeamStation System's capabilities to include VCR Video in your conferences.

about us the company synergy Clyde Broadcast are experts in the design, specification and installation of radio studios and radio stations.

MWC17 Hall 8.0 NEXTech Theatre Package. Theatre F (100pax)

Currently, SBS International reaches more than 13 million households in the US through major satellite and cable service providers.

Amarillo Civic Center Auditorium 401 S. Buchanan Amarillo, Texas 79101

The Taal Inc. Rhythm Ensemble Tech Rider

SPORTS TRAINING GUIDE

Paradiso (ch. Václav Kuneš) (30min) Technical requirements / List of light equipment Focus sheet on demand - (standard focusing)

THEATRE & PERFORMING ARTS APPLICATION GUIDE

MWC18 Hall 8.0 NEXTech Theatre Package. Theatre D pax

1998 BROADCAST NEWS BENCHMARK TEST RESULTS: ENGLISH AND NON-ENGLISH WORD ERROR RATE PERFORMANCE MEASURES

YAAM Berlin Venue Specifications & Technical Rider. Effective April 2018 Availability, errors and specifications subject to change without notice!

Section A: Front of House (FOH) Public Address (PA) Speaker System

MWC18 Hall 8.0 NEXTech Theatre Package. Theatre A 500pax

SREV1 Sampling Guide. An Introduction to Impulse-response Sampling with the SREV1 Sampling Reverberator

Technology. Boardroom Package. Boardroom LCD Projector Package $195. LCD Projector 10' VGA Cable. Smaller Meeting Rooms Only

Dinkelspiel Auditorium - Technical Specifications

SHOWBOX SODO TECHNICAL SPECIFICATIONS

Sound, Music Lighting and Live Production. Short Courses

INFORMATION FOR VISITING COMPANIES

S0 Radio Broadcasting Mixer. June catalogue. Manufacturers of audio & video products for radio & TV broadcasters

NEC Display Solutions of America, Inc. U300X Installation Guide Desktop and Ceiling Mount v1.3

MWC17 Hall 8.0 NEXTech Theatre Package. Theatre C (250pax)

ATEM Television Studio

Ku-Band Redundant LNB Systems. 1:1 System RF IN (WR75) TEST IN -40 db OFFLINE IN CONTROLLER. 1:2 System POL 1 IN (WR75) TEST IN -40 db POL 2 IN

Equipment Inventory. Seating. Stage. Dressing Rooms. Loading Bay

QCTV PROGRAM REPORT. Council Chambers Presentation Audiovisual Systems. Member Cities: Andover, Anoka, Champlin, and Ramsey

MX-206 Stereo Microphone Mixer. Operating Manual

Centre Stage and City Room at Surrey City Hall Venue Information

Phil L. Thomas Performing Arts Center Shiprock, New Mexico

DH400. Digital Phone Hybrid. The most advanced Digital Hybrid with DSP echo canceller and VQR technology.

WU-Minn HCP MEG Initial Data Release: Reference Manual

Board Meeting Broadcast Project Preliminary Report August 02, 2017

AUDIO SPEAKERS / AMPLIFICATION

Using Extra Loudspeakers and Sound Reinforcement

Guide to Courtroom Technology. July 2017

AV KEEPS NYC SECURE JAIL IS UNDER CONTROL GREETINGS FROM MARS NYPD S EOC SERVES MULTIPLE PURPOSES.

New Products and Features on Display at the 2012 IBC Show

CUSSOU504A. Microphones. Week Two

Conference Center Guidelines. Wake Forest Biotech Place

The Temple Hoyne Buell Theatre

OVERVIEW. YAMAHA Electronics Corp., USA 6660 Orangethorpe Avenue

Audio Recording Engineering Program

Getting Started Guide for the V Series

inter.noise 2000 The 29th International Congress and Exhibition on Noise Control Engineering August 2000, Nice, FRANCE

Using Extra Loudspeakers and Sound Reinforcement

New recording techniques for solo double bass

Summary Timeline Selection of bids The mission of PWPL

TL AUDIO M4 TUBE CONSOLE

Meeting Room Overview/Descriptions

RevolutionaryText delivers a SaaS-oriented, patent-pending process developed by Reesa Parker, William Weber, and Harvey Schulman.

EH320USTi. 1080p ultra short throw interactive projection. TouchBeam finger touch interactive. Bright projection 4000 ANSI lumens

Mics, DIs, Stands, and Cables

English Project. Contents

DH401. Full HD 1080p, compact and powerful. Bright 1080p projector 4000 ANSI Lumens. Installation flexibility Vertical lens shift and 1.

EH416. Full HD 1080p, compact and powerful. Bright 1080p projector 4200 ANSI Lumens. Installation flexibility Vertical lens shift and 1.

Auditorium: 16m x 24m (including stage area) Variable acoustics to suit different performance types

X D M PREAMP MIXER

Performing Arts Center 3825 HENDERSHOT NW GRAND RAPIDS, MI Phone: (616) Fax: (616)

REQUEST FOR PROPOSAL for Conference AV & Media needs CUE, INC. Annual CUE Conference 2015

WU416. High resolution, compact and powerful. Bright WUXGA projector 4200 ANSI Lumens. Installation flexibility Vertical lens shift and 1.

2017 MICHIGAN SKILLS USA CHAMPIONSHIPS TASK AND MATERIALS LIST. SKILL OR LEADERSHIP AREA: Television Video Production

WU416. High resolution, compact and powerful. Bright WUXGA projector 4200 ANSI Lumens. Installation flexibility Vertical lens shift and 1.

WU416. High resolution, compact and powerful. Bright WUXGA projector 4200 ANSI Lumens. Installation flexibility Vertical lens shift and 1.

DU400. High resolution, compact and powerful. Bright WUXGA projector 4000 ANSI Lumens. Installation flexibility Vertical lens shift and 1.

ISO The pros and cons in a nutshell. Iain Critchley MIOA

DTS Neural Mono2Stereo

TELEPHONE HYBRID-2 USER MANUAL. hybrid-2 manual page 1. Version 1.04 (smd)

Audiovisual Hire Price List September Code Audiovisual Image Day Rate OVERHEAD Overhead projector $35.00

POSITIONING SUBWOOFERS

PRICING SCHEDULE JSE LIMITED VENUE PRICING

TECHNICAL SPECIFICATIONS

VPA Requirements for Band Auditions for

Extraordinary Meeting Experience PROFESSIONALISM - FLEXIBILITY - HIGH TECNOLOGY

Audio-Technica MX-381 Mixer Crestron Module Module Application Guide

Digital audio is superior to its analog audio counterpart in a number of ways:

AEQ BRAVO Broadcast Mixing Console

Practical Recording Techniques. Fifth Edition

Studio Theatre. Technical Specifications. Theatre Manager Kim Scarlata Box Office Coordinator Laura Johnston

Studio Room: Talbot College Room 117, Masterclass Room: MB140 Mon., MB254 Wed.

Paulo V. K. Borges. Flat 1, 50A, Cephas Av. London, UK, E1 4AR (+44) PRESENTATION

Product, Compact Projection EX632. Native XGA. Up to 6000 hours lamp life. Crestron RoomView RJ45 control and monitoring.

AUDIO VISUAL. Updated 2/4/19

INSTRUCTIONS FOR USE Pro-Ject Tuner Box S

Concert Hall Technical Specifications

IP Telephony and Some Factors that Influence Speech Quality

Transcription:

Speaker Recognition: Building the Mixer 4 and 5 Corpora Linda Brandschain, Christopher Cieri, David Graff, Abby Neely, Kevin Walker {brndschn ccieri graff aneely walkerk}@ldc.upenn.edu University of Pennsylvania Linguistic Data Consortium

Motivation Mixer supports R&D of speaker recognition systems robust to variation in: language: Arabic, Mandarin, Russian, Spanish channel: telephone + 8 to 14 microphones conversational situation: telephone conversation, interviews, reading words, phrases, sentences, transcripts, written texts Mixer 4 channel variation Mixer 5 channel conversational situation

Comparison of Phases SB M1 M2 M3 M4 M5 Core Calls (8+) Variable Environments Unique Handset (4+) Extended Data (20+) Multilingual (4+) Cross Channel (2 or 4) Transcript Reading (2+) Interviews (6)

Mixer Platform Design Mixer platform designed to address changing telephony Issues Encountered increased cell phone use inexpensive domestic and international calling rates rise in use of call forwarding and call-screening Solutions reduce hours of the study exploit all lines available to robot operator reduce impediments to matching subjects allow any pairing, including duplicates over recruit set goals 20 25% higher than required by project sponsors lower per call payment; large completion bonuses encourage subjects to give true, narrow availability schedule increase robot activity to combat increased miss ratio

Protocol

Protocol

Protocol

Protocol

Diagram of Platform Protocol

Mixer Call Platform Mixer 4 & 5 conducted simultaneously Studies began when participant pool >= 200 40 topics cycled current political and social issues, religion, hobbies, sports, etc no penalty for speaking off topic so long as conversation is topical participants could refuse call after hearing the topic of the day Auditing calls audited for length, sound quality, quantity/suitability of speech. participants who reached their goal were deactivated

Cross Channel Interview Room 14 02 09 04 10 06 11 12 Subject 07 05 08 01 03 Interviewer 13

Cross Channel Recording Room

Multi-Channel Set-Up Ch Microphone Placement Subject/Reference 1 Shure MX185 Lavalier Interviewer 2 Shure MX185 Lavalier Subject 3 Etymotic Micro-array Interviewer 4 Shure MX418X Podium Desk Front Center 5 Crown PZM-6D Desk Top Center 6 Audio Technica AT3035 Desk Front Right 7 Audio Technica Pro45 Hanging Center 8 Panasonic Camcorder Desk Top Right 9 RODE NT6 Desk Front Far Left 10 RODE NT6 Desk Front Center Left 11 RODE NT6 Desk Front Center Right 12 RODE NT6 Desk Front Center Far Right 13 AcoustiMagic Array Wall Mounted Center 14 Lightspeed Headset Subject

Mixer 4 Mixer 4 was designed to support speaker recognition research and technology evaluations Demographics of Subject Pool Native Speakers of American English 25% from Philadelphia 25% from Berkeley 50% from the entire US, however we recruited heavily in Georgia, Texas, Illinois, and New York Original Goals for Mixer 4 400 Subjects that made 10, 10 minute phone calls 200 Visited one of our two sites where they completed 2 cross-channel call 100 Participants were asked to complete extended data calls (20 x 10-minute phone calls)

Speake 140 120 Mixer 4 Call Yields Total Calls Total Minutes Total Hours Subjects with 10+ Calls Subjects with 20+ Calls 233 17,200 287 233 52 100 80 60 40 20 0 1 2 3 4 5 6 7 8 9 10 11 13 14 15 16 17 18 19 20 21 22 Calls Made

Mixer 5 Mixer 5 focused on cross-channel recordings of face to face interviews where the goal is to elicit speech within a variety of situations. Demographics of Subject Pool Native language undefined, however participants had to be fluent in English Approximately 50% recruited from Philadelphia, PA Approximately 50% recruited from Berkeley, CA Goals for Mixer 5 300 Participants Each Participant must complete 6 half hour sessions completed in no less than 6 days. Each session had a mandatory 30 minute break between sessions. Each of the 300 Participants must also complete 10 ten-minute phone calls Foreign language calls were encouraged but not required Bonuses were issued for the completion of 4 unique phone calls High/Low Vocal Effort Phone Calls ~1/3 of Mixer 5 Participants completed these calls Lightspeed XLC-20 headphones provide 40db passive acoustic isolation High Vocal Effort: Input audio is 65dB and relative levels of the mix components are 30% side-tone, 40% remote speaker and 30% white noise. Low Vocal Effort: Input audio is 65dB with no white noise.

Mixer 5 Interview Protocol Session Number 1 2 3 4 5 6 Min Repeating Questions 1 1 1 1 1 1 6 Warm-up 4 4 Family Personal 5 5 Informal Conversation 20 9 14 9 9 9 70 Transcript Reading 20 15 10 15 10 70 Story Reading 5 5 Sentence Reading 5 5 Phrase/Word List Reading 5 5 Low Vocal/Effort 5 5 High Vocal/Effort 4 4 Total Session 30 30 30 30 30 30 180

Mixer 5 Prompter

Speakers 300 Mixer 5 Call Yields 250 Total Calls Total Minutes Total Hours Subjects with 10+ Calls 2919 14595 243 245 200 150 100 50 0 1 2 3 4 5 6 8 9 10+ Calls

Speakers 300 Mixer 5 Interview Yields 250 Total Interviews Total Minutes Total Hours Subjects with 6+ Interviews 1874 56220 937 276 200 150 100 50 0 1 2 3 4 5 6+ Interviews

Future Work Mixer 1 & 2 in LDC publication pipeline Mixer 3 used in SRE06 & LRE07; remainder reserved for future evaluation Mixer 4 collection underway part used in SRE08 remainder reserved for future evaluation Mixer 5 interview collection ahead of schedule phone call collection also well underway part used in SRE08; remainder reserved for future evaluation Mixer 6 (Graybeard) subjects from previous CTS collection return to join Potential new studies conduct Mixer 5 style interviews in other languages conduct studies like Mixer 1 & 2 but involving other languages All Mixer data will be published after its use in technology evaluations.