Semantic Analysis in Language Technology

Similar documents
Word Meaning and Similarity

Word Senses. Slides adapted from Dan Jurafsky and James Mar6n

Lecture: Lexical Semantics

Lexical Semantics. Thesaurus-based. ree years apart, we can see a clear shift in popularity

Introduction to Semantics

CS114 Lecture 15 Lexical Seman3cs

Lecture 13: Chapter 10: Semantics

Chapter 9: Semantics. LANE 321 Content adapted from Yule (2010) Copyright 2014 Haifa Alroqi

What are meanings? What do linguistic expressions stand for or denote?

Introduction to Semantics and Pragmatics Class 3 Semantic Relations

Lexical Categories: Semantics

Ontology and Taxonomy. Computational Linguistics Emory University Jinho D. Choi

Regular Polysemy in WordNet and Pattern based Approach

Introduction to Semantics and Pragmatics Class 3 Semantic Relations

On the Ontological Basis for Logical Metonymy:

Lexical Semantics: Sense, Referent, Prototype. Sentential Semantics (phrasal, clausal meaning)

Introduction to Semantics and Pragmatics Class 4 Semantic Relations and Semantic Features

Semantics: The meaning of words

Language and Inference

Introduction to WordNet, HowNet, FrameNet and ConceptNet

LESSON TWELVE VAGUITY AND AMBIGUITY

Introduction to semantic networks and conceptual graphs

WordFinder. Verginica Barbu Mititelu RACAI / 13 Calea 13 Septembrie, Bucharest, Romania

Instrument and experiencer. Location, source and goal. Lexical relations

A picture of the grammar. Sense and Reference. A picture of the grammar. A revised picture. Foundations of Semantics LING 130 James Pustejovsky

Word Sense Disambiguation in Queries. Shaung Liu, Clement Yu, Weiyi Meng

Compound Noun Polysemy and Sense Enumeration in WordNet

TABLE OF CONTENTS. #3996 Daily Warm-Ups: Language Skills 2 Teacher Created Resources, Inc.

Lire Journal: Journal of Linguistics and Literature Volume 3 Nomor 2 October 2018

Metonymy in Grammar: Word-formation. Laura A. Janda Universitetet i Tromsø

Meaning 1. Semantics is concerned with the literal meaning of sentences of a language.

Semantics. Philipp Koehn. 16 November 2017

TABLE OF CONTENTS. Free resource from Commercial redistribution prohibited. Language Smarts TM Level D.

Lecture (04) CHALLENGING THE LITERAL

Key - Worksheet 3 Linguistics Eng B

A Dictionary Of Synonyms And Antonyms By Joseph Devlin

Motif Definition and Classification to Structure Non-linear Plots and to Control the Narrative Flow in Interactive Dramas

Language Arts Study Guide Week 1, 8, 15, 22, 29

TJHSST Computer Systems Lab Senior Research Project Word Play Generation


Creating Mindmaps of Documents

The Cognitive Nature of Metonymy and Its Implications for English Vocabulary Teaching

Affect-based Features for Humour Recognition

BIO + OLOGY = PHILEIN + ANTHROPOS = BENE + VOLENS = GOOD WILL MAL + VOLENS =? ANTHROPOS + OLOGIST = English - Language Arts Step 6

Clusters and Correspondences. A comparison of two exploratory statistical techniques for semantic description

Georgia Performance Standards for Second Grade

arxiv: v1 [cs.cl] 24 Oct 2017

Table of Contents TABLE OF CONTENTS

Helping Metonymy Recognition and Treatment through Named Entity Recognition

organise (dis- is a prefix and ed is a suffix.) What is the root word in disorganised?

Contents. sample. Unit Page Enrichment. 1 Conditional Sentences (1): If will Noun Suffixes... 4 * 3 Infinitives (1): to-infinitive...

English Language Arts 600 Unit Lesson Title Lesson Objectives

Taxonomy Displays Bridging UX & Taxonomy Design. Content Strategy Seattle Meetup April 28, 2015 Heather Hedden

UNIVERSITY OF SWAZILAND FACULTY OF HUMANITIES DEPARTMENT OF ENGLISH LANGUAGE AND LITERATURE SECOND SEMESTER FINAL EXAMINATION PAPER MAY 2017

Foundations in Data Semantics. Chapter 4

A Cognitive Account of the Lexical Polysemy of Chinese Kai Flora Yu-Fang Wang Graduate Institute of English, National Taiwan Normal University

Sentiment Analysis of English Literature using Rasa-Oriented Semantic Ontology

Alice in Wonderland. Great Illustrated Classics Reading Comprehension Worksheets. Sample file

Fry Instant Phrases. First 100 Words/Phrases

By Mrs. Paula McMullen Library Teacher Norwood Public Schools

Useful Definitions. a e i o u. Vowels. Verbs (doing words) run jump

Corpus evidence for a lexical account of the English conative construction

Sentences and prediction Jonathan R. Brennan. Introduction to Neurolinguistics, LSA2017 1

The Ontological Character of Classes in the Dewey Decimal Classification. Rebecca Green Michael Panzer OCLC Online Computer Library Center, Inc.

Ontology-based Distinction between Polysemy and Homonymy

Quiz. Dictionaries and indexes. Level A Circle the right answer for each question. 1) You use a dictionary to find the meanings of words.

Metonymy Research in Cognitive Linguistics. LUO Rui-feng

LANGUAGE ARTS GRADE 3

Dictionary Of Synonyms And Antonyms With Discriminations By Albert C. & Kitchen, Paul C. Baugh

Homographic Puns Recognition Based on Latent Semantic Structures

English Skills Practice and Apply: Grade 5

Rhetorical Questions and Scales

Contents. Teaching Guidelines...4. Lessons. Appendix. Contents 3

Lauderdale County School District Pacing Guide Sixth Grade Language Arts / Reading First Nine Weeks

Grammar Reteaching Prepositional Phrases

Contents. Section 1. Section 2. Section 3

Subject: English Grade: V Year: Year Planner Text book Used: The English Connection Month & No. of Teaching Periods March/ April (19)

CHAPTER II LITERATURE REVIEW. the problems of the study to support this thesis. They are :

Commas - 1. Name: The comma will put a PAUSE in your sentence. The comma allows you to combine 2 IDEAS into one sentence.

EMPOWERING TEACHERS. Instructional Example LA We are going identify synonyms for words. TEACHER EXPLAINS TASK TEACHER MODELS TASK

Automatically Extracting Word Relationships as Templates for Pun Generation

Power Words come. she. here. * these words account for up to 50% of all words in school texts

"Ways Verbal Play such as Storytelling and Word-games Can Be Used for Teaching-and-learning Languages"

Tuesday January 15th, In your comp books on a new sheet of paper on your bellwork side--label the page Parts of Speech Notes

Song Lessons Understanding and Using English Grammar, 3rd Edition. A lesson about adjective, adverb, and noun clauses (Chapters 12, 13, 17)

Johnny Appleseed by Steven Kellogg Vocabulary Practice and Craftivity. Created by Gay Miller

The First Hundred Instant Sight Words. Words 1-25 Words Words Words

Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures

Word: multipurpose. Who s or Whose? : Fifth Grade Weekly Spiral LA homework sheet week 2. Name Date. Monday Tuesday Wednesday Thursday

1 Family and friends. 1 Play the game with a partner. Throw a dice. Say. How to play

Intersemiotic Complementarity: A Framework for Multimodal Discourse Analysis

BOL 1 - BASICS OF LANGUAGE - ENGLISH DRAFT FOR PUBLICATION MAY 9, 2007

WPRD FORMATION IN THE NOWADAYS WRITTEN MEDIA DERIVATION WITHIN THE PERIOD SUMMARY

Relational Logic in a Nutshell Planting the Seed for Panosophy The Theory of Everything

Basic Natural Language Processing

LESSON 26: DEPENDENT CLAUSES (ADVERB)

Add note: A note instructing the classifier to append digits found elsewhere in the DDC to a given base number. See also Base number.

Witty, Affective, Persuasive (and possibly Deceptive) Natural Language Processing

Support Activities. Annotated Teacher s Edition. Level 4. Columbus, OH

Lyricon: A Visual Music Selection Interface Featuring Multiple Icons

Transcription:

Spring 2017 Semantic Analysis in Language Technology Word Senses Gintare Grigonyte gintare@ling.su.se Department of Linguistics Stockholm University, Sweden

Acknowledgements Most slides borrowed from: Dan Jurafsky and James H. Martin Some slides borrowed from D. Jurafsky and C. Manning and D. Radev (Coursera) J&M(2015, draft): https://web.stanford.edu/~jurafsky/slp3/ (Slides material based on SAIS 2014-16 by Santini)

Outline Word Meaning WordNet 3

Definitions Lexical semantics is the study of the meaning of words and the systematic meaning-related connections between words. A word sense is the locus of word meaning; definitions and meaning relations are defined at the level of the word sense rather than wordforms. Homonymy is the relation between unrelated senses that share a form. Polysemy is the relation between related senses that share a form. Synonymy holds between different words with the same meaning. Hyponymy and hypernymy relations hold between words that are in a class inclusion relationship. Meronymy type of hierarchy that deals with part whole relationships. WordNet is a large database of lexical relations for English 4

Word Meaning and Similarity Word Senses and Word Relations

Reminder: lemma and wordform A lemma or citation form Same stem, part of speech, rough semantics A wordform 6 The inflected word as it appears in text Wordform banks sung duermes Lemma bank sing dormir Cf. token/type ratio: crude measure of lexical densitiy: If a text is 1,000 words long, it is said to have 1,000 "tokens". But a lot of these words will be repeated, and there may be only say 400 different words in the text. "Types", therefore, are the different words. The ratio between types and tokens in this example would be 40%. (source: wordsmith tools)

Lemmas have senses One lemma bank can have many meanings: Sense 1: Sense 2: a bank 1 can hold the investments in a custodial account as agriculture burgeons on the east bank the river 2 will shrink even more Sense (or word sense) A discrete representation of an aspect of a word s meaning. The lemma bank here has two senses 7

8 Other examples?

Homonymy Homonyms: words that share a form but have unrelated, distinct meanings: bank 1 : financial institution, bank 2 : sloping land bat 1 : club for hitting a ball, 1. Homographs (bank/bank, bat/bat) 2. Homophones: 1. Write and right 2. Piece and peace bat 2 : nocturnal flying mammal 9

Homonymy causes problems for NLP applications Information retrieval bat care Machine Translation bat: šikšnosparnis (animal) or beisbolo lazda (baseball) Text-to-Speech bass (stringed instrument) vs. bass (fish) There would be no ambiguity for Speech to Text: why? 10

1. The bank was constructed in 1875 out of local red brick. 2. I withdrew the money from the bank Are those the same sense? Sense 2: A financial institution Sense 1: The building belonging to a financial institution A polysemous word has related meanings Most non-rare words have multiple meanings 11

12 Polysemy

Lots of types of polysemy are systematic School, university, hospital All can mean the institution or the building. A systematic relationship: Building Organization Other such kinds of systematic polysemy: Author (Jane Austen wrote Emma) 13 Metonymy or Systematic Polysemy: A systematic relationship between senses Works of Author (I love Jane Austen) Tree (Plums have beautiful blossoms) Fruit (I ate a preserved plum)

How do we know when a word has more than one sense? The zeugma test: Two senses of serve? Which flights serve breakfast? Does Lufthansa serve Philadelphia??Does Lufthansa serve breakfast and San Jose? Since this conjunction sounds weird, we say that these are two different senses of serve 14

Synonyms Word that have the same meaning in some or all contexts. filbert / hazelnut couch / sofa big / large automobile / car vomit / throw up Water / H 2 0 Two lexemes are synonyms if they can be substituted for each other in all situations If so they have the same propositional meaning 15

Synonyms But there are few (or no) examples of perfect synonymy. Even if many aspects of meaning are identical Still may not preserve the acceptability based on notions of politeness, slang, register, genre, etc. Example: Water/H 2 0 Big/large Brave/courageous high brow: latinate words 16

Synonymy is a relation between senses rather than words Consider the words big and large Are they synonyms? How big is that plane? Would I be flying on a large or small plane? How about here: Miss Nelson became a kind of big sister to Benjamin.?Miss Nelson became a kind of large sister to Benjamin. Why? big has a sense that means being older, or grown up large lacks this sense 17

18 Synonymy: Summary

19 Other semantic relations

Antonyms Senses that are opposites with respect to one feature of meaning Otherwise, they are very similar! dark/light short/long fast/slow rise/fall hot/cold up/down in/out More formally: antonyms can define a binary opposition or be at opposite ends of a scale long/short, fast/slow Be reversives: 20 rise/fall, up/down

Hyponymy and Hypernymy One sense is a hyponym of another if the first sense is more specific, denoting a subclass of the other car is a hyponym of vehicle mango is a hyponym of fruit Conversely hypernym/superordinate ( hyper is super ) vehicle is a hypernym of car fruit is a hypernym of mango Superordinate/hyper vehicle fruit furniture Subordinate/hyponym car mango chair 21

Hyponymy more formally Extensional: The class denoted by the superordinate extensionally includes the class denoted by the hyponym Entailment: A sense A is a hyponym of sense B if being an A entails being a B Hyponymy is usually transitive (A hypo B and B hypo C entails A hypo C) Another name: the IS-A hierarchy A IS-A B (or A ISA B) B subsumes A 22

Hyponyms and Instances WordNet has both classes and instances. An instance is an individual, a proper noun that is a unique entity San Francisco is an instance of city But city is a class city is a hyponym of municipality...location... 23

WordNet

25 WordNet

26 Synsets

How is sense defined in WordNet? The synset (synonym set), the set of near-synonyms, instantiates a sense or concept, with a gloss Example: chump as a noun with the gloss: a person who is gullible and easy to take advantage of This sense of chump is shared by 9 words: chump 1, fool 2, gull 1, mark 9, patsy 1, fall guy 1, sucker 1, soft touch 1, mug 2 Each of these senses have this same gloss (Not every sense; sense 2 of gull is the aquatic bird) 27 gullible=naive

28 Tree-like Structure

29 WordNet: bar 1/6

30 2/6

31 3/6

32 4/6

33 5/6

34 6/6

35 Polysemy

WordNet 3.0 A hierarchically organized lexical database On-line thesaurus + aspects of a dictionary Some other languages available or under development (Arabic, Finnish, German, Portuguese ) 36 Category Unique Strings Noun 117,798 Verb 11,529 Adjective 22,479 Adverb 4,481

37 Senses of bass in Wordnet

38 WordNet Hypernym Hierarchy for bass

39 WordNet Noun Relations

WordNet 3.0 Where it is: http://wordnetweb.princeton.edu/perl/webwn Libraries Python: WordNet from NLTK http://www.nltk.org/home Java: JWNL, extjwnl on sourceforge

The end