The Visual Denotations of Sentences. Julia Hockenmaier with Peter Young and Micah Hodosh University of Illinois

Similar documents
Discriminative and Generative Models for Image-Language Understanding. Svetlana Lazebnik

Pupil s Book contents

Favorite Things Nouns and Adjectives

Meaning 1. Semantics is concerned with the literal meaning of sentences of a language.

FOIL it! Find One mismatch between Image and Language caption

Skill-Builders. Grades 3-4. Grammar & Usage. Writer Kathleen Cribby. Editorial Director Susan A. Blair. Project Manager Erica L.

Unit 5: Holiday in Thailand

Test 1 Answers. Listening. T RANSCRIPT Hello. This is the Cambridge Starters. Part 1 (5 marks) Part 2 (5 marks) Part 3 (5 marks) Part 4 (5 marks)

My name is: YazooA_booklet.indd 1 9/8/09 10:20:56 AM

Natural Language Processing

cl Underline the NOUN in the sentence. gl Circle the missing ending punctuation. !.? Watch out Monday Tuesday Wednesday Thursday you are in my class.

What is a Sentence? The rabbit that is hopping around. the horse track. The bunch of red roses. in their bee hives. is in a purple vase.

Macmillan Publishers S.A. Sample material TALL TALES. What are tall tales? I love my lasso. I can catch it with my lasso!

GRADE 9 FINAL REVISION

Skill-Builders. Grades 4 5. Grammar & Usage. Writer Sarah Guare. Editorial Director Susan A. Blair. Project Manager Erica L.

What are meanings? What do linguistic expressions stand for or denote?

Song Lessons Understanding and Using English Grammar, 3rd Edition. A lesson about adjective, adverb, and noun clauses (Chapters 12, 13, 17)

Visual Madlibs: Fill in the blank Description Generation and Question Answering Supplementary File

High Five! 3. 1 Read and write in, on or at. Booster. Name: Class: Prepositions of time Presentation. Practice. Grammar

Enjoy your holidays!

English Skills Practice and Apply: Grade 5

LESSON 30: REVIEW & QUIZ (DEPENDENT CLAUSES)

_GCPS_04_ELA_All_Domains (_GCPS_04_ELA_All_Domains)

Rubrics & Checklists

Hi my name is Pono and I like Greek food and rock and roll music. My favorite color is black and my favorite movie is The Neverending Story.

.Student A ... Student B

Part 1: Writing. Fundamentals of Writing 2 Lesson 5. Sentence Structure: Complex Sentences

download instant at

Anglia Examinations Preliminary Level Four Skills

Right now Listen and say the colours. 2 Read the notes. Then, write the names.

On the weekend UNIT. In this unit. 1 Listen and read.

Recording scripts Third edition. for Movers

UWaterloo at SemEval-2017 Task 7: Locating the Pun Using Syntactic Characteristics and Corpus-based Metrics

PE4. English Literacy 2017/2018. Name / Surname(s): School: Group: City / Town: Date: Year 4 of Primary Education

UNIT 3 3º E.S.O. MATCH THE FIRST PART OF A SENTENCE IN A WITH THE END OF A SENTENCE IN B. A

PE4. English Literacy 2017/2018. Name / Surname(s): School: Group: City / Town: Date: Year 4 of Primary Education

Maths Join up the numbers from 1 to 20

American. Herbert Puchta & Jeff Stranks Gunther Gerngross Christian Holzmann Peter Lewis-Jones. Student s Book

Unit Grammar Item Page

grocery store circus school beach dentist circus bowling alley beach farm theater beach school grocery store orchard school beach

Let s Chat. Unit In this unit you will learn how to carry out a conversation in English by using a conversation structure.

APSAC ADVISOR Style Guide

Grammar, punctuation and spelling

made an unpleasant, angry sound. having a pleasant taste or smell. Choose a word from the table above to fill in the blanks.

Pgs. Level 1 Questions Level 2 Questions Level 3 Questions Level 4 Questions

Grammar 101: Adjectives, Adverbs, Articles, Prepositions, oh my! For Planners

REVISION PAPER for FINAL TERM EXAM GRADE 5 ENGLISH LANGUAGE. Section A. Rikki-tikki from The Jungle Book by Rudyard Kipling (Excerpt)

Grammar, Spelling, and Punctuation

STYLE. Sample Test. School Tests for Young Learners of English. Form A. Level 1

Paper 1 Question 2. L.O. To build our knowledge of language techniques and to practise our ability to analyse writer s language choices.

Language Arts CRCT Study Guide: 4 th

Cambridge University Press 2004

CHILDREN S ESL CURRICULUM: STUDENT BOOK 5B: LOST ON MYSTERIOUS ISLAND

superlative adjectives e + er or est consonant + er or est (after one vowel + one consonant) y to i + er or est

Grammar: Comparative adjectives Superlative adjectives Usage: Completing a report

Basic English. Robert Taggart

One Room. Schools. by Isaiah Collins HOUGHTON MIFFLIN HARCOURT

A1 Personal (Subject) Pronouns

IS IT AN ADVERB? MORE WORDS THAT DESCRIBE

CS 562: STATISTICAL NATURAL LANGUAGE PROCESSING

1 Ordinary days A B C D E F. 1 Setting the scene. 6 Unit 1 Ordinary days

Our puppy Jack is a great big dog, When it comes to food, he s quite a hog!

English/Language Arts Test 8

Article Submission Guidelines 2018

An adverb is a word which adds or modifies the meaning of a verb, an adjective or another adverb.

Quiz 4 Practice. I. Writing Narrative Essay. Write a few sentences to accurately answer these questions.

PRE-ADOLESCENTS BEGINNERS WEB SAMPLE 2018 NEW CONTENTS

Tuesday 23 May 2017 Morning

LEVEL PRE-A1 LAAS LANGUAGE ATTAINMENT ASSESSMENT SYSTEM. English Language Language Examinations. English Be sure you have written your.

UNIT 01 It s mine Pages 12-13

Nouns Name Date Block

next to Level 5 Unit 1 Language Assessment

Unit 2: Artists at work

Instant Words Group 1

Kingdom Schools. Boys Intermediate. (Feb. 09 th -13 th, 2013) English Department. Name:

1 Family and friends. 1 Play the game with a partner. Throw a dice. Say. How to play

LESSON 7: ADVERBS. In the last lesson, you learned about adjectives. Adjectives are a kind of modifier. They modify nouns and pronouns.

Word Sense Disambiguation in Queries. Shaung Liu, Clement Yu, Weiyi Meng

LEVEL PRE-A1 LAAS LANGUAGE ATTAINMENT ASSESSMENT SYSTEM. English English Language Language Examinations Examinations. December 2005 May 2010

Graphic Organizer for Active Reading Thank You, M am

MIDTERM~STUDY GUIDE. A declarative sentence makes a statement. It ends with a period.

Georgia Performance Standards for Second Grade

What s Emma doing? Vocabulary Weather. Presentation 3 Warm up Look at Poppy s world on page 93 and answer. 0 Language focus. Grammar.

Introduction to Natural Language Processing Phase 2: Question Answering

ELEMENTARY GRAMMAR LABORATORY 1ST SEMESTER

Paper 1 Question 2. L.O. To build our knowledge of language techniques and to practise our ability to analyse writer s language choices.

General Revision on Module 1& 1 and (These are This is You are) two red apples in the basket.

Grammar Flash Cards 3rd Edition Update Cards UPDATE FILE CONTENTS PRINTING TIPS

The Basketball Game We had our game on Friday. We won against the other team. I was happy to win because we are undefeated. The coach was proud of us.

Scalable Semantic Parsing with Partial Ontologies ACL 2015

3 Complete the examples from the listening in Exercise 1. 1 m Nathan. You re 13. He 2 from

ENGLISH ENGLISH. Level 3. Tests AMERICAN. Student Workbook ENGLISH. Level 3. Rosetta Stone Classroom. RosettaStone.com AMERICAN

Free time. Grammar. Vocabulary. Skills. Communicate. Learn about the present simple, and adverbs of frequency.

Module 1 Our World. Ge Ready. Brixham Youth Club Come and join us! 1 Look at the information about a Youth Club. Write the words for activities.

Back to School Themed

Sentence Processing III. LIGN 170, Lecture 8

A real achievement. 4 a Complete the phrases with verbs from the box. 1 ride a bike 2 a car. 3 a book 4 the guitar. 5 a horse 6 a song

UNIT 8 GRAMMAR REFERENCE EXERCISES

Guru Nanak Public School, Model Town Extension, Ludhiana Practice Worksheet -(August, 2016) Subject- English Class- III 1 Unseen poem

Langua ge Arts GA MilestonesStudy Guide: 3rd

Transcription:

The Visual Denotations of Sentences Julia Hockenmaier with Peter Young and Micah Hodosh juliahmr@illinois.edu University of Illinois

Sentence-Based Image Description and Search Hodosh, Young, Hockenmaier, JAIR 2013.

Task: Image Description Two boys are playing football. People in a line holding lit roman A little girl is enjoying the swings A little girl is enjoying the swings A motorbike is racing around A boy in a yellow uniform c An elephant is being wa

Conceptual image descriptions...... describe the depicted entities, events, scenes... only describe what can be seen from the image... may differ in the amount of detail

Why not caption generation? For image and language understanding, the semantic question of whether a sentence describes an image or not is fundamental Natural Language Generation has additional syntactic and pragmatic aspects that detract from the semantic question Natural Language Generation is much harder to evaluate

Task: Image Search Two boys are playing football. People in a line holding lit roman A little girl is enjoying the swings A little girl is enjoying the swings A motorbike is racing around A boy in a yellow uniform c An elephant is being wa

Tags Discovery Cove Férias Orlando Florida USA EUA Vacations Description: Vacation at Discovery Cove My experience at Discovery Cove in Orlando, FL

Our data ~32K 8,000 Flickr images, each annotated with 5 crowdsourced captions.

Our captions Four basketball players in action. Young men playing basketball in a competition. Four men playing basketball, two from each team. Two boys in green and white uniforms play basketball with two boys in blue and white uniforms. A player from the white and green highschool team dribbles down court defended by a player from the other team.

Our model: Kernel CCA Images Ki(Di, ) Wi Shared Space Ws Ks(Ds, ) Sentences Sooners football player wears the number 28 and black armbands... KCCA for image description: 1. Project (unseen) images and sentences into the shared space. 2. Rank sentences by their distance A boy jumps to the query image from one bed to another...

Experimental Results

Rate of success (S@k) Image annotation Image search NN BoW1 BoW5 TagRank Tri5 S@1 S@5 S@10 S@1 S@5 S@10 5.8 *** 15.3 *** 20.1 *** 4.9 *** 12.9 *** 18.1 *** For almost half of all unseen 12.2 *** 30.3 *** 39.7 *** 11.4 *** 30.5 *** 40.2 *** images (or captions), 15.0 * 34.1 ** 42.7 *** 12.1 *** 31.5 *** 40.8 *** the first ten results include 16.2 34.2 ** 42.9 *** 12.4 *** 31.5 *** 41.6 *** a good caption (image). 16.4 32.9 *** 43.4 *** 13.1 ** 33.1 ** 43.8 *** Tri5Sem 16.6 37.7 49.1 15.7 36.9 48.5 S@k: Percentage of test items for which the top k results contain a relevant item

Score: 4 (No errors) A girl wearing a yellow shirt and sunglasses smiles. A man climbs up a sheer wall of ice.

Score: 3 (Minor errors) A boy jumps into the blue pool water. A child jumping on a tennis court.

Score: 2 (Major errors) A dog in a grassy field, looking up. A boy in a blue life jacket jumps into the water.

Score: 1 (Unrelated) Basketball players in action. A black dog with a purple collar running.

Back to semantics...

Implied Semantics Language L Images I

Denotational Semantics Language L Universe U

Denotational Semantics The denotation of a (declarative) sentence is the set of all possible worlds in which it is true: s = {w U: s is true in w }

Visual denotations The visual denotation of a (descriptive) sentence is the set of all images for which it is a correct description: s = {i I: s describes (part of) i }

Denotation Graph 1. Normalize captions: - Spelling; capitalization - Lemmatization - Normalize determiners 2. Make captions more generic: - Replace nouns by hypernyms - Drop modifiers (adjectives, adverbs, PPs) 3. Extract VPs and NPs This yields a large subsumption hierarchy of (partial) image descriptions

Subsumption hierarchy A child plays A child plays guitar A girl plays A child plays on the beach A child plays soccer A girl plays on the playground A girl plays on the beach A child in red plays on the beach

Statistics Original data (~32,000 images) ~160K distinct captions Denotation graph: ~1500K distinct captions: ~280K captions with s 2 ~40K captions with s 5 ~19K captions with s 10 ~1.7K captions with s 100 142 captions with s 1000 e.g. person play instrument, woman standing,...

Applications Better models for image description/search? Better models of natural language semantics?

Denotational similarities p( VP1 VP2 ) p( talk engage in conversation) = 0.79 p( play tennis swing racket) = 0.82 p( stand wait for subway = 0.58 p( sit ride subway) = 0.56 p( stand lean against building) = 0.53 p( shave look in mirror) = 0.41 p( dig hole use shovel) = 0.38 p( make face stick out tongue) = 0.38

Future/Ongoing work Using denotational similarities: e.g. for Textual Similarity, Entailment Recognition Capturing compositionality in our models: Integrate with (syntactic) grammar induction for Combinatory Categorial Grammar (Bisk and Hockenmaier 2012, 2013) Improving coverage of the denotation graph: Reduce sparsity of existing captions Add more images (using other resources?)