Machine Translation and Advanced Topics on LSTMs

Similar documents
Machine Translation: Examples. Statistical NLP Spring MT: Evaluation. Phrasal / Syntactic MT: Examples. Lecture 7: Phrase-Based MT

Machine Translation Part 2, and the EM Algorithm

Statistical NLP Spring Machine Translation: Examples

Machine Translation: Examples. Statistical NLP Spring Levels of Transfer. Corpus-Based MT. World-Level MT: Examples

CSE 517 Natural Language Processing Winter 2013


The decoder in statistical machine translation: how does it work?

Copyright TeachMe.com a3dd2 1

Copyright TeachMe.com 72db6 1

Vigil at Goldfish Pond Lynn MA June 19, 2016 Poem by Rosemie Leyre. I grieve We grieve

Statistical Machine Translation Lecture 5. Decoding with Phrase-Based Models

Learning to translate with source and target syntax. David Chiang, USC Information Sciences Institute

Pre-Translation for Neural Machine Translation

Intonational meaning in Spanish conversation: low-rising vs. circumflex questions. Francisco Torreira & Simeon Floyd

CLASS NOTES and TEST REVIEW. SPANISH 1 UNIDAD 1 ETAPA 1 S. DePastino

Less is More: Picking Informative Frames for Video Captioning

How to write a first draft novel >>>CLICK HERE<<<

O APÓSTOLO. LEARNING RESOURCE 3rd and 4th Level suitable for S1 S4

Carlos Cabana Lesson Transcript - Part 11

CHAPTER 8 REVIEW. Spanish I

2012 Molly Martin, MD. All rights reserved. docmolly.com

DRAFT. Proposal to modify International Standard IEC

MITOCW watch?v=rkvem5y3n60

The Music of Poetry in the Caribbean. Penny Riffe and Nelson Salazar Eau Gallie High School. Brevard County, Florida

Slide Set 9. for ENCM 501 in Winter Steve Norman, PhD, PEng

EMC Publishing s Deutsch Aktuell 1, 6E Correlated to IDAHO CONTENT STANDARDS GRADE 7-12 HUMANITIES: WORLD LANGUAGES - LEVEL 1

Good afternoon! My name is Swetha Mettala Gilla you can call me Swetha.

ก ก ก ก ก ก ก ก. An Analysis of Translation Techniques Used in Subtitles of Comedy Films

High Value Applications and High Growth Markets for Printed Electronics

Discriminative and Generative Models for Image-Language Understanding. Svetlana Lazebnik

Gerunds and Infinitives

US_Math 4 1. Operations and Algebraic Thinking 4.OA 2. Number and Operations in Base Ten 4.NBT 3. Number and Operations - Fractions 4.

Lecture 2 Video Formation and Representation

GRE Task Force on Substitutes / Retrofits (TF S/R)

Generating Chinese Classical Poems Based on Images

Ari Muhonen 1. Invisible Library

Western Statistics Teachers Conference 2000

HCC class lecture 8. John Canny 2/23/09

Today is Monday. Yesterday was. Tomorrow will be. Today is Friday. Yesterday was. Tomorrow will be. Today is Wednesday.

Coimisiún na Scrúduithe Stáit State Examinations Commission. Leaving Certificate Marking Scheme. Spanish. Ordinary Level

Course Organizer. This Course. Performance Tasks. Course Questions: The. First Year Spanish

Corazón sin control (Los Hermanos de Buckhorn) (Spanish Edition)

Nombre Hora Repaso del otoño 2015

Español I. Introduccion. Nombres Colores y Formas Frases útiles Números 1-20 Interrogativas

Measurement of automatic brightness control in televisions critical for effective policy-making

LED driver architectures determine SSL Flicker,

Sick at school. (Enferma en la escuela) Spanish. List of characters. (Personajes) Girl with bike rider (Niña con ciclista) Student name.

The Relationship Between Movie Theatre Attendance and Streaming Behavior. Survey insights. April 24, 2018

Liberacion Sobrenatural: Libertad para tu Alma, Mente y Emociones (Supernatural Deliverance: Freedom for Your Soul Mind And Emotions Spanish Edition)

Audio: Generation & Extraction. Charu Jaiswal

1. Take today s notes 2. En silencio, sientate 3. Vamonos! In English, escribe about why time is important. Use the questions to prompt you.

arxiv: v1 [cs.cl] 11 Aug 2017

I. The Preterit: used for past actions that are seen as. It is used for completed actions that can be viewed as single events.

A Fast Alignment Scheme for Automatic OCR Evaluation of Books

North Carolina Standard Course of Study - Mathematics

A Need for Universal Audio Terminologies and Improved Knowledge Transfer to the Consumer

Introduction to NLP. Ruihong Huang Texas A&M University. Some slides adapted from slides by Dan Jurafsky, Luke Zettlemoyer, Ellen Riloff

LIST OF CONNECTORS AND TRANSITIONAL WORDS

HEVC Subjective Video Quality Test Results

Correlation to the Common Core State Standards

Learning fun with.

HEBS: Histogram Equalization for Backlight Scaling

06 1 MIPS Implementation Pipelined DLX and MIPS Implementations: Hardware, notation, hazards.

First Step Towards Enhancing Word Embeddings with Pitch Accents for DNN-based Slot Filling on Recognized Text

Basic Natural Language Processing

PHABLABS 4.0 Gobo Batman/Slide projector

Research & Development of Surface-Discharge Color Plasma Display Technologies. Tsutae Shinoda

I Messages. 3 Cs F. Preparation. Vocabulary. Lesson at a Glance

Maths-Whizz Investigations Paper-Back Book

The ACL Anthology Network Corpus. University of Michigan

The employment intensity of film and television production in Canada

Course Schedule from Oct. 2, 2017 to Dec. 31, 2018

You must hand in a summary of the reading book The Bird of Happiness and Other Wise Tales and this worksheet the day of the exam (20% of the mark)

SWITCHED INFINITY: SUPPORTING AN INFINITE HD LINEUP WITH SDV

Marc I. Johnson, Texture Technologies Corp. 6 Patton Drive, Hamilton, MA Tel

IV: INTEGRATION OF GOODS MARKETS

Regression Model for Politeness Estimation Trained on Examples

ALA 2012 Concordia University, Montreal

Departments of Real Estate National University of Singapore GUIDELINES FOR THE PREPARATION OF UNDERGRADUATE DISSERTATIONS

Contents. Introduction. Skyworks Solutions (SWKS) Cypress Semiconductor (CY) Sierra Wireless (SWIR) Silicon Labs (SLAB) Rockwell Automation (ROK)

Announcements. HW2 directory structure penalty to be removed due to grading inconsistencies.

Towards Using Hybrid Word and Fragment Units for Vocabulary Independent LVCSR Systems

Persistence Of Memory By Winona Kent READ ONLINE

Expressive performance in music: Mapping acoustic cues onto facial expressions

Automated sound generation based on image colour spectrum with using the recurrent neural network

El Principito (Spanish Edition) By Antoine de Saint-Exupery

Music Composition with RNN

Avoiding False Pass or False Fail

SPONSORSHIP OPPORTUNITIES

ECEN 667 Power System Stability Lecture 5: Transient Stability Intro

Vámonos. Conjugate the following verbs in the PRESENT tense! Think about Spanish 1. Do NOT use the preterite tense.

Research Article. ISSN (Print) *Corresponding author Shireen Fathima

PIPELINING: BRANCH AND MULTICYCLE INSTRUCTIONS

Achieve Accurate Critical Display Performance With Professional and Consumer Level Displays

Improving the Level on English Translation Strategies for Chinese Cultural Classics Fenghua Li

Iva Petrinska, Dilyan Ivanov, Valchan Georgiev TU Sofia, Bulgaria. September 25, 2017

ProSpanish. Vocabulary Course. made easy by ProSpanish. ProSpanish

Slide Set 8. for ENCM 501 in Winter Term, Steve Norman, PhD, PEng

Mini-dictionary. Verbs to Describe Research

Sony vegas pro 11 rendering settings

Transcription:

Machine Translation and Advanced Topics on LSTMs COSC 7336: Advanced Natural Language Processing Fall 2017 Some content on these slides was borrowed from Riloff, Money, and Socher and Manning.

Announcements Reminder: Paper presentation sign up coming up Presentation slides due Nov. 9th 11:59pm Link: https://www.dropbox.com/request/2cwaeglmimqo5dqgymwp Final Project Proposals due Nov. 10th! What is the problem What kind of data do you have available What approach you plan to use Link: https://www.dropbox.com/request/yfkwhgs0c22iqzletjea

Today s lecture Short intro to Machine Translation (MT) Challenges in MT Pre-Deep Learning Era Sequence to Sequence models with RNN Attention Translation using seq2seq models

Machine Translation (MT)

MT Definition Transform input text s, in source language a, into an equivalent text t in target language b. Good translation: Faithful Natural Many practical reasons for MT

Example Translations from Google Translate There is a lot at night. The oil lamps, which hang from a nail in front of the door, but the light floats like a bright almond tree, it is difficult to shake, it is terrible, unstable, to keep the dark deposit around it and the house up and down. until the last corners, where the darkness is so thick that it seems solid. The night has much to last. The oil lamp, hanging from a nail next to the door, is lit, but the flame, like a luminous almond tree floating, barely manages, tremulous, unstable, to hold the dark mass that surrounds it and fills the house from top to bottom, until the last corners, where the darkness, so thick, seems to have become solid.

Example Translations from Google Translate La noche tiene aún mucho que durar. El candil de aceite, colgado de un clavo al lado de la puerta, está encendido, pero la llama, como una almendrilla luminosa flotante, apenas consigue, trémula, inestable, sostener la masa oscura que la rodea y llena de arriba abajo la casa, hasta los últimos rincones, allí donde las tinieblas, de tan espesas, parecen haberse vuelto sólidas.

What makes MT difficult?

What makes MT difficult?

What makes MT difficult? Differences between languages (2) Syntactic divergences Subject-Verb-Object (SVO) like English SOV like Hindi and Japanese VSO languages like Irish and Arabic

What makes MT difficult? Differences between languages (2) Allowable omissions Pro-drop languages regularly omit subjects that must be inferred [Tu madre]i llamó en la tarde. qi Dijo que te esperaba a comer mañana. Your mother] called this afternoon. [She] said she will see you tomorrow for lunch.

What makes MT difficult? Differences between languages (3) Lexical divergences that require specification John plays the guitar. John toca la guitarra. John plays tennis. John juega tennis. The singer wore a purple attire La cantante usó un traje morado El cantante usó un traje morado.

What makes MT difficult? Differences between languages (4)

MT Approaches

Statistical MT (SMT) Before DL, best methods were SMT Trained on large amounts of parallel data But: Canadian Hansard European parliament corpora

SMT A good translation should be faithful and fluent, Final objective:

Noisy Channel Model for SMT

SMT Formulation following Bayes rule:

SMT

Phrase-Based SMT A good way to compute P(F E) is by considering the behavior of phrases

Phrase-Based SMT Base P(F E) on translating phrases in E to phrases in F. First segment E into a sequence of phrases ē1, ē1,,ēi Then translate each phrase ēi, into fi, based on translation probability Φ(fi ēi) Then reorder translated phrases based on distortion probability d(i) for the ith phrase.

Translation Probabilities Assume a phrase aligned parallel corpus is available or constructed that shows matching between phrases in E and F. Then compute (MLE) estimate of f based on simple frequency counts.

Alignment To train the translation model we need to know which words belong to which other words in the target language It s a really hard problem!

Alignment (2)

Alignment (3)

Decoding Assuming we have solved the alignment problem we can then estimate phrase translation probabilities What s next?

After Alignment There s a Lot More!

After Alignment There s a Lot More!

Evaluation of MT Systems

Evaluation of MT Systems Human subjective evaluation is the best but is time-consuming and expensive. Automated evaluation comparing the output to multiple human reference translations is cheaper and correlates with human judgments.

Automatic Evaluation of MT Collect one or more human reference translations of the source. Compare MT output to these reference translations. Score result based on similarity to the reference translations. BLEU NIST TER METEOR

BLEU Determine number of n-grams of various sizes that the MT output shares with the reference translations. Compute a modified precision measure of the n-grams in MT result.

BLUE Example

BLUE Example

Modified N-gram Precision Average n-gram precision over all n-grams up to size N (typically 4) using geometric mean.

Brevity Penalty c = total length of the candidate translation corpus r = effective reference length

BLEU Score Final BLEU Score: BLEU = BP x p Cand 1: Mary no slap the witch green. Best Ref: Mary did not slap the green witch.

Discussion Points SMT was state-of-the-art before Deep NLP Evaluation metrics can be improved SMT relies heavily on parallel corpora