Clusters and Correspondences. A comparison of two exploratory statistical techniques for semantic description

Similar documents
Introduction to Semantics and Pragmatics Class 3 Semantic Relations

winter but it rained often during the summer

Metonymy in Grammar: Word-formation. Laura A. Janda Universitetet i Tromsø

Re-appraising the role of alternations in construction grammar: the case of the conative construction

ANALYTICAL GRAMMAR (UNIT #12) NOTES-PAGE 25 GERUND PHRASES. DEFINITION: A GERUND is a verb ending in ing which is used as a noun.

Beware of Dog: Verbs, cont.

Introduction It is now widely recognised that metonymy plays a crucial role in language, and may even be more fundamental to human speech and cognitio

Metonymic Patterns for WOMEN across Time: A Usage-based Approach to Visualizations of Language Change

The structure of this ppt

LESSON 30: REVIEW & QUIZ (DEPENDENT CLAUSES)

The structure of this ppt

63 In QetQ example, heart is classified as noun: singular, common, abstract Homophones: sea/sea 68 Homophones: sea/see

Rhythm and Melody Aspects of Language and Music

Two Styles of Construction Grammar Do Ditransitives

Metonymy Research in Cognitive Linguistics. LUO Rui-feng

Lauderdale County School District Pacing Guide Sixth Grade Language Arts / Reading First Nine Weeks

Cluster Analysis of Internet Users Based on Hourly Traffic Utilization

2. Second Person for Third Person: [ You = Someone - does not exist in Greek!] (... = you, the Christians I am writing to)

tech-up with Focused Poetry

Keeping an eye on the data: metonymies and their patterns *

The ACL Anthology Network Corpus. University of Michigan

Tuesday January 15th, In your comp books on a new sheet of paper on your bellwork side--label the page Parts of Speech Notes

Information processing in high- and low-risk parents: What can we learn from EEG?

Week Objective Suggested Resources 06/06/09-06/12/09

Paper 1 Question 2. L.O. To build our knowledge of language techniques and to practise our ability to analyse writer s language choices.

The Cognitive Nature of Metonymy and Its Implications for English Vocabulary Teaching

Semantic Analysis in Language Technology

Corpus evidence for a lexical account of the English conative construction

Contents. Section 1 VERBS...57

DIRECT AND REPORTED SPEECH

Scope and Sequence for NorthStar Listening & Speaking Intermediate

Randolph High School English Department Vertical Articulation of Writing Skills

Centre for Economic Policy Research

READY-TO-GO REPRODUCIBLES

Grammar is a way of thinking about language. Grammar is a way of thinking about language.

PARTICIPIAL PHRASES: EXERCISE #1

Sentence Processing III. LIGN 170, Lecture 8

Today we are going to look at techniques to revise and polish technical manuscripts.

MIDTERM EXAMINATION Spring 2010

Paper 1 Question 2. L.O. To build our knowledge of language techniques and to practise our ability to analyse writer s language choices.

(Week 13) A05. Data Analysis Methods for CRM. Electronic Commerce Marketing

7. The English Caused-Motion Construction. Presenter: 林岱瑩

Unit Topic and Functions Language Skills Text types 1 Found Describing photos and

Articulating Medieval Logic, by Terence Parsons. Oxford: Oxford University Press,

CHAPTER 2 REVIEW OF RELATED LITERATURE. advantages the related studies is to provide insight into the statistical methods

Longman Academic Writing Series 4

Grade eight exit benchmarks TEST Form A Section one: Literature terms: matching

Summit 1. Test Unit 1

Metonymy Determining the Type of the Direct Object

Adisa Imamović University of Tuzla

Design for Information

Spanish Language Programme

A Cognitive Account of the Lexical Polysemy of Chinese Kai Flora Yu-Fang Wang Graduate Institute of English, National Taiwan Normal University

On Meaning. language to establish several definitions. We then examine the theories of meaning

Similarities in Amy Tans Two Kinds

Tropes and the Semantics of Adjectives

Standard 2: Listening The student shall demonstrate effective listening skills in formal and informal situations to facilitate communication

Expressing Space in Estonian. Synonymous Locative Constructions in Estonian. Grammatical Synonymy 14/11/2010

ก ก ก ก ก ก ก ก. An Analysis of Translation Techniques Used in Subtitles of Comedy Films

Language and Mind Prof. Rajesh Kumar Department of Humanities and Social Sciences Indian Institute of Technology, Madras

Descriptive adjectives: - ed vs -ing. LEVEL NUMBER LANGUAGE Intermediate B1_2055G_EN English

Reviewed by Charles Forceville. University of Amsterdam, Dept. of Media and Culture

The structure of this ppt. Structural and categorial (and some functional) issues: English Hungarian

AP LANGUAGE & COMPOSITION SUMMER PROJECT

Regression Model for Politeness Estimation Trained on Examples

Emphasis. Get the reader to NOTICE! (cannot be sound, interjection, or dialogue) The thought was there. Pain. That pain did not stop the murder.

LA CAFÉ. 25 August Could I designate a person to set ipad timer for 9:50 every Monday 8A and 10:42 8B?

Arts, Computers and Artificial Intelligence

English Language Arts 600 Unit Lesson Title Lesson Objectives

Introduction to Semantics and Pragmatics Class 4 Semantic Relations and Semantic Features

Noun Phrase Modifications by Adverb Clauses*

LESSON 26: DEPENDENT CLAUSES (ADVERB)

10 Common Grammatical Errors and How to Fix Them

Helping Metonymy Recognition and Treatment through Named Entity Recognition

The verbal group B2. Grammar-Vocabulary WORKBOOK. A complementary resource to your online TELL ME MORE Training Learning Language: English

General Educational Development (GED ) Objectives 8 10

ANALYTICAL GRAMMAR (UNIT #17) NOTES-PAGE 35 NOUN CLAUSES. surprised. 2.) art n hv lv pro av The champion will be whoever wins.

Name: Date: Verbal Phrases

Cambridge Primary English as a Second Language Curriculum Framework mapping to English World

Compare and Contrast Fables

Lire Journal: Journal of Linguistics and Literature Volume 3 Nomor 2 October 2018

BBM 413 Fundamentals of Image Processing Dec. 11, Erkut Erdem Dept. of Computer Engineering Hacettepe University. Segmentation Part 1

Visual Encoding Design

District of Columbia Standards (Grade 9)

Coolios gangster paradise came out when rap and hip hop was were taking over

Write down the date when you first study a unit or section in Oxford Word Skills Advanced, then write down the date when you study it again.

Code : is a set of practices familiar to users of the medium

LESSON TWELVE VAGUITY AND AMBIGUITY

Seminar 6 Clarity vs Ambiguity & Vagueness

Linking words B2. Grammar-Vocabulary WORKBOOK. A complementary resource to your online TELL ME MORE Training Learning Language: English

LIKE, LOVE, HATE +ING

SAMPLE. Grammar, punctuation and spelling. Paper 1: short answer questions. English tests KEY STAGE LEVELS. First name. Middle name.

3/26/2013. Midterm. Anna Loparev Intro HCI 03/21/2013. Emotional interaction. (Ch 1, 10) Usability Goals

Toward Computational Recognition of Humorous Intent

Lecture 13: Chapter 10: Semantics

UNIT 13: STORYTIME (4 Periods)

Here we go again. The Simple Past tense, is a simple tense to describe actions occurred in the past or past experiences.

Introduction to Semantics and Pragmatics Class 3 Semantic Relations

Positioning and Stance

English nominalizations ending in suffixes -hood and -ness in the framework of cognitive linguistics

Transcription:

Clusters and Correspondences. A comparison of two exploratory statistical techniques for semantic description Dylan Glynn University of Leuven RU Quantitative Lexicology and Variational Linguistics

Aim of Study Compare two simple techniques for exploratory multivariate analysis of semantic structure Show that quantitative semantic analysis is possible

Cognitive Linguistics Symbolic unit Form-meaning pairs - no formal modules (Langacker 1987, Fillmore & al. 1988) Encyclopaedic semantics No semantic modules meaning is all conception and perception (Fillmore 1985, Lakoff 1987) Entrenchment No grammar language is usage no language system, social langue, or individual competence

Quantitative Approaches to Semantic Structure within Cognitive Linguistics Polysemy Lexical Gries (2006) run Glynn (2008) hassle Synonymy Constructional Gries (1999) VPCxs, Heylen (2005) Middle Field Cxs, Grondelaers & al. (2007) 'there' Cxs Speelman & Geeraerts (forth.) Causative Cxs Lexical Divjak (2006) intend verbs, Divjak & Gries (2006) try verbs, Newman & Rice (2004) posture verbs Newman & Rice (2004) prepositions

Hierarchical Cluster Analysis HCA shows grouping 2-way tables agglomerative distance matrix possibility of significance testing (via bootstrapping) HCA visualisation dendograms different distance measures = emphasis different groupings discrete groups = misleading semantic description

Cluster Analysis

Multiple Correspondence Analysis MCA shows correlations n-way tables canonical correlation distance matrix MCA visualisation correspondence maps proximity = correlation conflated multiple spaces = misleading proximity

Multiple Correspondence Analysis transitive Intrans Adjectivee Intrans w/o ob Trans w/o ob

Corpus and Annotation LiveJournal Corpus Online personal diaries Very large, unparsed British vs. American is distinguished, but little register variation Some gender bias toward woman, probably restricted to middle class, 15-25 year olds. Annotation 3 parameters- Semantic, Formal, and Social 120 values 20 variables 2000 occurrences

Breaking Down Lemmata Transitive Saw quite a few people I knew, including the awful stalker guy who's been hassling me... Transitive Oblique If you hassle me about my kinky hair, I'll cut it all off. hat in hand, humble, almost begging. Intransitive Officer McCoy, me and him was hassling and my gun went off, hitting him somewhere... Nominal Mass... because it saves all that ammoying hassle of SOD'S-BLOODY-LAW!!!!!! Nominal Count I rarely paint my nails(it can be such a hassle!) Adjective Attributive It's a very hassily event to do. Adjective Predicative She will not take part in Saturday's 5000m race, saying she is tired and bothered Gerund the technical know-how to do this sort of hassling...

Breaking Down Lemmata Form Occurrences Count Noun hassle (hassle_count) 146 Mass Noun hassle (hassle_mass) 217 Gerund hassle (hassle_gerund) 40 Predicative Adjective bother (bother_pred) 124 Intransitive bother (bother_intrans) 222 Transitive annoy (annoy_trans) 449 Transitive hassle (hassle_trans) 274 Transitive bother (bother_trans) 275

Agent Type Indirect Semantic Variable: Agent Type - Human Specific so im hassling you instead of your mum, haha! - Human Non-Specific but we started to have more people hassling us. - Institution Well, the Church bothers me quite often, - Activity - Event It bothers me everytime by boyfreind talks to, or about his ex girlfriends -Thing I pulled it out but the mouse annoys me too much... - Abstract State of Affairs I have been open to him about everything else except that part.. however, it bothers me and I'm caught in between

Agglomerative Hierarchical Cluster Analysis (Dist: Euclidean/ Met: Average) Construction-Lexeme Agent Type

Multiple Correspondence Analysis Construction-Lexeme Agent Type

Agglomerative Hierarchical Cluster Analysis "pvclust" 2 kinds of p-values: AU (Approximately Unbiased) determined by multiscale bootstrap resampling BP (Bootstrap Probability) value determined by normal bootstrap resampling.

PV Agglomerative Hierarchical Cluster Analysis (Dist: Euclidean/ Met: Ward) Construction-Lexeme Agent Type

Direct Semantic Variables: Cause, Affect, Humour Cause of Event - expenditure of energy - imposition - imposition / request - interruption - request - condemnation - tease Affect on Patient - anger - repetition / boring - concern - thought - emotional pain - physical pain Humour - - Use of humour in the example - No use of humour in the example

Agglomerative Hierarchical Cluster Analysis (Dist: Euclidean/ Met: Average) Construction-Lexeme Dialect Cause Affect Humour less forms

Multiple Correspondence Analysis Construction-Lexeme Dialect Cause Affect Humour - less forms

PV Agglomerative Hierarchical Cluster Analysis (Dist: Euclidean/ Met: Ward) Construction-Lexeme Cause Affect - less forms

Bivariate Correspondence Analysis bother trans Construction-Lexeme Cause Affect - less forms

Russian Adjectival Constructions Discrepancies between HCA and MCA

Russian Adjectival Constructions Discrepancies between HCA and MCA

Bivariate Correspondence Analysis Construction-Lexeme Cause Affect - less forms

Detail of Correspondence Analysis Usage Cluster 1 Class Form Transitive annoy Transitive bother Affect Features anger repetition concern thought emotional pain physical pain interruption aesthetic

Detail of Correspondence Analysis Usage Cluster 2 Class Forms Transitive hassle Cause - Affect Features imposition request imposition request tease condemn

Detail of Correspondence Analysis Usage Cluster 3 Class Forms Count Noun hassle Mass Noun hassle Gerund hassle Adjective bother Intransitive bother Affect Features energy agitation

Summary Pros and Cons for HCA and MCA in Quantitative Approaches to Cog. Sem. HCA - groups usage patterns relative to features + Possibility for significance testing + Clear visualisations - 'Blind' Clustering - Discrete Grouping MCA - maps usage patterns relative to visualised features + Analogue representation of associations + Correlations visible - Misleading visualisations - No significance testing

Summary Quantitative Semantic Study A combination of formal, indirect semantic and direct semantic tagging is possible and can produce coherent verifiable results Although semantic analysis is more subjective than formal analysis, if we are to describe all of language, then we should also include semantic features

for further information: http://wwwling.arts.kuleuven.ac.be/qlvl/ http://perswww.kuleuven.be/dylan_glynn