An Impact Analysis of Features in a Classification Approach to Irony Detection in Product Reviews

Similar documents
Influence of lexical markers on the production of contextual factors inducing irony

Harnessing Context Incongruity for Sarcasm Detection

Your Sentiment Precedes You: Using an author s historical tweets to predict sarcasm

KLUEnicorn at SemEval-2018 Task 3: A Naïve Approach to Irony Detection

#SarcasmDetection Is Soooo General! Towards a Domain-Independent Approach for Detecting Sarcasm

World Journal of Engineering Research and Technology WJERT

Sarcasm Detection in Text: Design Document

Automatic Detection of Sarcasm in BBS Posts Based on Sarcasm Classification

arxiv: v1 [cs.cl] 8 Jun 2018

Detecting Sarcasm in English Text. Andrew James Pielage. Artificial Intelligence MSc 2012/2013

Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing

The final publication is available at

Introduction to Natural Language Processing This week & next week: Classification Sentiment Lexicons

Projektseminar: Sentimentanalyse Dozenten: Michael Wiegand und Marc Schulder

LT3: Sentiment Analysis of Figurative Tweets: piece of cake #NotReally

Acoustic Prosodic Features In Sarcastic Utterances

arxiv: v2 [cs.cl] 20 Sep 2016

저작권법에따른이용자의권리는위의내용에의하여영향을받지않습니다.

Are Word Embedding-based Features Useful for Sarcasm Detection?

Sarcasm Detection on Facebook: A Supervised Learning Approach

Implementation of Emotional Features on Satire Detection

arxiv: v1 [cs.cl] 3 May 2018

NLPRL-IITBHU at SemEval-2018 Task 3: Combining Linguistic Features and Emoji Pre-trained CNN for Irony Detection in Tweets

A New Analysis of Verbal Irony

Automatic Sarcasm Detection: A Survey

Towards a Contextual Pragmatic Model to Detect Irony in Tweets

Figurative Language Processing in Social Media: Humor Recognition and Irony Detection

DICTIONARY OF SARCASM PDF

This is a repository copy of Who cares about sarcastic tweets? Investigating the impact of sarcasm on sentiment analysis.

How Do Cultural Differences Impact the Quality of Sarcasm Annotation?: A Case Study of Indian Annotators and American Text

Some Experiments in Humour Recognition Using the Italian Wikiquote Collection

Modelling Sarcasm in Twitter, a Novel Approach

Harnessing Sequence Labeling for Sarcasm Detection in Dialogue from TV Series Friends

The Lowest Form of Wit: Identifying Sarcasm in Social Media

Inducing an Ironic Effect in Automated Tweets

Mining Subjective Knowledge from Customer Reviews: A Specific Case of Irony Detection

Modelling Irony in Twitter: Feature Analysis and Evaluation

Formalizing Irony with Doxastic Logic

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

SARCASM DETECTION IN SENTIMENT ANALYSIS

TWITTER SARCASM DETECTOR (TSD) USING TOPIC MODELING ON USER DESCRIPTION

SARCASM DETECTION IN SENTIMENT ANALYSIS Dr. Kalpesh H. Wandra 1, Mehul Barot 2 1

ICWSM A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews

Finding Sarcasm in Reddit Postings: A Deep Learning Approach

Really? Well. Apparently Bootstrapping Improves the Performance of Sarcasm and Nastiness Classifiers for Online Dialogue

Large scale Visual Sentiment Ontology and Detectors Using Adjective Noun Pairs

Dynamic Allocation of Crowd Contributions for Sentiment Analysis during the 2016 U.S. Presidential Election

A Pragmatic Study of the Recognition and Interpretation of Verbal Irony by Malaysian ESL Learners

Document downloaded from: This paper must be cited as:

Sparse, Contextually Informed Models for Irony Detection: Exploiting User Communities, Entities and Sentiment

Communication Mechanism of Ironic Discourse

Outline. Why do we classify? Audio Classification

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues

Bi-Modal Music Emotion Recognition: Novel Lyrical Features and Dataset

Modeling Satire in English Text for Automatic Detection

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues

LLT-PolyU: Identifying Sentiment Intensity in Ironic Tweets

A COMPREHENSIVE STUDY ON SARCASM DETECTION TECHNIQUES IN SENTIMENT ANALYSIS

REPORT DOCUMENTATION PAGE

Clues for Detecting Irony in User-Generated Contents: Oh...!! It s so easy ;-)

The Roles of Politeness and Humor in the Asymmetry of Affect in Verbal Irony

This is an author-deposited version published in : Eprints ID : 18921

Semantic Role Labeling of Emotions in Tweets. Saif Mohammad, Xiaodan Zhu, and Joel Martin! National Research Council Canada!

INGEOTEC at IberEval 2018 Task HaHa: µtc and EvoMSA to Detect and Score Humor in Texts

ABSOLUTE OR RELATIVE? A NEW APPROACH TO BUILDING FEATURE VECTORS FOR EMOTION TRACKING IN MUSIC

PREDICTING HUMOR RESPONSE IN DIALOGUES FROM TV SITCOMS. Dario Bertero, Pascale Fung

An extensive Survey On Sarcasm Detection Using Various Classifiers

Do we really know what people mean when they tweet? Dr. Diana Maynard University of Sheffield, UK

SemEval-2015 Task 11: Sentiment Analysis of Figurative Language in Twitter

Universität Bamberg Angewandte Informatik. Seminar KI: gestern, heute, morgen. We are Humor Beings. Understanding and Predicting visual Humor

Who would have thought of that! : A Hierarchical Topic Model for Extraction of Sarcasm-prevalent Topics and Sarcasm Detection

Figurative Language Processing: Mining Underlying Knowledge from Social Media

Affect-based Features for Humour Recognition

Sentiment Aggregation using ConceptNet Ontology

Approaches for Computational Sarcasm Detection: A Survey

Neural Network Predicating Movie Box Office Performance

Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest

Implicit Display Theory of Verbal Irony: Towards A Computational Model of Irony

Music Mood. Sheng Xu, Albert Peyton, Ryan Bhular

Tweet Sarcasm Detection Using Deep Neural Network

A COMPUTATIONAL MODEL OF IRONY INTERPRETATION

Verbal Ironv and Situational Ironv: Why do people use verbal irony?

2 o Semestre 2013/2014

Understanding People in Low Resourced Languages

First Stage of an Automated Content-Based Citation Analysis Study: Detection of Citation Sentences 1

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

Analysis of laughables: a preliminary perception study

Bilbo-Val: Automatic Identification of Bibliographical Zone in Papers

Sarcasm as Contrast between a Positive Sentiment and Negative Situation

Using Genre Classification to Make Content-based Music Recommendations

arxiv:submit/ [cs.cv] 8 Aug 2016

Detecting Sarcasm on Twitter: A Behavior Modeling Approach. Ashwin Rajadesingan

VBM683 Machine Learning

Combination of Audio & Lyrics Features for Genre Classication in Digital Audio Collections

MUSI-6201 Computational Music Analysis

Predicting Time-Varying Musical Emotion Distributions from Multi-Track Audio

Computational Laughing: Automatic Recognition of Humorous One-liners

Perceptual dimensions of short audio clips and corresponding timbre features

Francesco Barbieri. Machine Learning Methods for Understanding Social Media Communication: Modeling Irony and Emojis TESI DOCTORAL UPF / ANY 2017

Detecting Musical Key with Supervised Learning

Transcription:

Universität Bielefeld June 27, 2014 An Impact Analysis of Features in a Classification Approach to Irony Detection in Product Reviews Konstantin Buschmeier, Philipp Cimiano, Roman Klinger Semantic Computing Group, CIT-EC, Bielefeld University Slides are available at http://www.roman-klinger.de/talks/irony.pdf

Outline 1 Introduction 2 Method 3 Experiments 4 Summary Buschmeier, Cimiano, Klinger 2 / 35

Introduction Outline 1 Introduction 2 Method 3 Experiments 4 Summary Buschmeier, Cimiano, Klinger 3 / 35

Introduction What is Irony? Merriam Webster Dictionary, 2014 (excerpt) the use of words that mean the opposite of what you really think especially in order to be funny (verbal irony) the use of words to express something other than and especially the opposite of the literal meaning a situation that is strange or funny because things happen in a way that seems to be the opposite of what you expected (situational irony) incongruity between the actual result of a sequence of events and the normal or expected result Buschmeier, Cimiano, Klinger 3 / 35

Introduction What is Irony? Examples (1) Thanks that you took care of the dirty dishes. Buschmeier, Cimiano, Klinger 4 / 35

Introduction What is Irony? Examples (2) [Scene from breaking bad.] He might be upset. Buschmeier, Cimiano, Klinger 5 / 35

Introduction What is Irony? Examples (3) Buschmeier, Cimiano, Klinger 6 / 35

Introduction What is Irony? Examples (4) Buschmeier, Cimiano, Klinger 7 / 35

Introduction What is sarcasm? Merriam Webster Dictionary, 2014 (excerpt) a sharp and often satirical or ironic utterance designed to cut or give pain Buschmeier, Cimiano, Klinger 8 / 35

Introduction Irony markers and factors S. Attardo (2000). Irony Markers and Functions: Towards a Goal-oriented Theory of Irony and its Processing. In: Rask: Internationalt Tidsskrift for Sprog og Kommunikation Irony factors... are essential for irony to happen Irony markers... are marking the occurrence in irony Irony can happen without markers! Buschmeier, Cimiano, Klinger 9 / 35

Introduction Irony in product reviews (1) From a review for a movie Read the book! From a review for a book i would recomend this book to friends who have insomnia or those who i absolutely despise. Ironic Environment A. Utsumi (2000). Verbal irony as implicit display of ironic environment: Distinguishing ironic utterances from nonirony. In: Journal of Pragmatics Buschmeier, Cimiano, Klinger 10 / 35

Introduction Examples (2)... Pros: Fits my girthy frame, has wolves on it, attracts women Cons: Only 3 wolves [... ], cannot see wolves when sitting with arms crossed, wolves would have been better if they glowed in the dark. Buschmeier, Cimiano, Klinger 11 / 35

Introduction Examples (3) Buschmeier, Cimiano, Klinger 12 / 35

Introduction Examples (4) Buschmeier, Cimiano, Klinger 13 / 35

Introduction Why detect Irony? Error reduction by sarcasm detection in polarity detection of tweets D. Maynard et al. (2014). Who cares about Sarcastic Tweets? Investigating the Impact of Sarcasm on Sentiment Analysis. In: LREC Supports understanding of irony in language It is fun. Buschmeier, Cimiano, Klinger 14 / 35

Introduction Previous Work Definitions of Irony A. Utsumi (2000). Verbal irony as implicit display of ironic environment: Distinguishing ironic utterances from nonirony. In: Journal of Pragmatics D. Wilson et al. (2012). Explaining Irony. In: Meaning and Relevance H. H. Clark et al. (1984). On the pretense theory of irony. In: Journal of Experimental Psychology: General S. Kumon-Nakamura et al. (1995). How About Another Piece of Pie: The Allusional Pretense Theory of Discourse Irony. In: Journal of Experimental Psychology: General Buschmeier, Cimiano, Klinger 15 / 35

Introduction Previous Work Automatically Detecting Irony (excerpt) Feature Impact analysis in Twitter F. Barbieri et al. (2014). Modelling Irony in Twitter: Feature Analysis and Evaluation. In: LREC A. Reyes et al. (2011). Mining subjective knowledge from customer reviews: a specific case of irony detection. In: WASSA@ACL R. González-Ibáñez et al. (2011). Identifying sarcasm in Twitter: a closer look. In: ACL-HLT Google book search for specific phrases, automated classification M. L. Dress et al. (2008). Regional Variation in the Use of Sarcasm. In: Journal of Language and Social Psychology Portuguese Newspaper comments, specific features P. Carvalho et al. (2009). Clues for detecting irony in user-generated contents: oh...!! it s so easy ;-). In: TSA@CIKM Amazon review sentences, KNN, rich feature set O. Tsur et al. (2010). ICWSM A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews. In: ICWSM Buschmeier, Cimiano, Klinger 16 / 35

Introduction Data Resource Amazon Corpus published E. Filatova (2012). Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing. In: LREC Amazon Mechanical Turk Annotation of Corpus 1 st step: Selection of an ironic and a regular review for a product each, submission of review ID 2 nd step: Validation of annotation by 5 additional turkers, kept in corpus when majority agreed Additional information was extracted not taken into account in this work 437 ironic, 817 regular reviews, 1254 altogether sarcasm.= verbal irony Buschmeier, Cimiano, Klinger 17 / 35

Method Outline 1 Introduction 2 Method 3 Experiments 4 Summary Buschmeier, Cimiano, Klinger 18 / 35

Method Workflow Supervised classification problem Each review categorized into being ironic or non-ironic Corpus by Filatova, 2012 used Classifiers taken into account: Naïve Bayes, support vector machine (with linear kernel), logistic regression, decision tree, random forest As implemented in Python library scikit-learn Buschmeier, Cimiano, Klinger 18 / 35

Method Problem Specific Features Imbalance Star-rating is positive, more negative words (142/35) Star-rating is negative, more positive words (0/0) Example Avoid that TV show. Highly addictive. (ironic reviews with that feature/non-ironic reviews with that feature) Buschmeier, Cimiano, Klinger 19 / 35

Method Problem Specific Features Hyperbole Three successive positive words (2/4) Three successive negative words (4/4) Example That is the best, awesome, greatest, washing machine ever! Buschmeier, Cimiano, Klinger 20 / 35

Method Problem Specific Features Quotes Two succeeding positive adjectives/nouns in quotes (25/25) Two succeeding negative adjectives/nouns in quotes (16/15) Example They advertise it as very good. Buschmeier, Cimiano, Klinger 21 / 35

Method Problem Specific Features Pos/Neg and Punctuation Positive word, exclamation mark in a distance of four (7/19) Negative word, exclamation mark in a distance of four (4/2) Example Such a great thing! Buschmeier, Cimiano, Klinger 22 / 35

Method Problem Specific Features Pos/Neg and Ellipsis Positive word, ellipsis in a distance of four (27/33) Negative word, ellipsis in a distance of four (28/18) Example Such a great thing... Buschmeier, Cimiano, Klinger 23 / 35

Method Problem Specific Features Ellipsis and Punctuations An ellipsis is followed by multiple punctuation marks (4/1) Example You really say...?!? Buschmeier, Cimiano, Klinger 24 / 35

Method Problem Specific Features Punctuation Existence of multiple exclamation marks (31/51) Existence of multiple question marks (10/6) Combination of question with exclamation mark (12/4) Example!!!!!,??,?! Buschmeier, Cimiano, Klinger 25 / 35

Method Problem Specific Features Interjection Terms like wow and huh, lol (16/18) Laughter Onomatopoeia like haha (1/2) Smilies (6/25) Example That machine is really like... *WOW*... hahahaha :-) Buschmeier, Cimiano, Klinger 26 / 35

Method Bag-of-Words Every occurring term is used to generate a feature Features Example text: This is great. The word This occurs The word is occurs The word great occurs... Buschmeier, Cimiano, Klinger 27 / 35

Experiments Outline 1 Introduction 2 Method 3 Experiments 4 Summary Buschmeier, Cimiano, Klinger 28 / 35

Experiments Baselines Use the star-rating as five features ( star-rating ) Bag-of-Words ( BOW ) Majority of positive/negative words ( sentiment ) Buschmeier, Cimiano, Klinger 28 / 35

Experiments Results, Logistic Regression, 10-fold CV 100 80 71.7 68.8 74.4 67.8 60 58.1 50.8 F 1 40 20 0 Star-Rating BOW Sentiment All+Star-Rating Specific Buschmeier, Cimiano, Klinger 29 / 35

Experiments Distributions 700 600 Irony Non Irony Corpus 700 600 Irony Non Irony Prediction Number of Reviews 500 400 300 200 Number of Reviews 500 400 300 200 100 100 0 1 2 3 4 5 Stars 0 1 2 3 4 5 Stars Buschmeier, Cimiano, Klinger 30 / 35

Experiments Results for different classifiers 100 80 74.4 71.3 72.2 65.0 F 1 60 40 48.2 20 0 Logistic Regr. SVM Decision Tree Random Forest Naive Bayes Buschmeier, Cimiano, Klinger 31 / 35

Experiments Information Gain of Bag-of-Words Which phrases are important to decide for irony? great, I mean, easy, mean, is very, very, stupid, is a, worst, highly, a great, easy to, the worst, excellent, price, fast, a bit, shirt, works, money, man, simple, worse, use, Oh, idea, nothing, and it, How, the best, wrong Buschmeier, Cimiano, Klinger 32 / 35

Summary Outline 1 Introduction 2 Method 3 Experiments 4 Summary Buschmeier, Cimiano, Klinger 33 / 35

Summary Summary Summary & Future work The first feature evaluation for irony detection on a publicly available corpus Meta-information is a strong indicator Setting with actual text based features is more useful Outlook Measure text similarity of reviews of same product Transfer known theories about the use of irony to text Include method in our fine-grained aspect/evaluation phrase extraction model for sentiment analysis (Klinger et al., 2013b; Klinger et al., 2013a) Buschmeier, Cimiano, Klinger 33 / 35

Bibliography Bibliography I Attardo, S. (2000). Irony Markers and Functions: Towards a Goal-oriented Theory of Irony and its Processing. In: Rask: Internationalt Tidsskrift for Sprog og Kommunikation. Barbieri, F. et al. (2014). Modelling Irony in Twitter: Feature Analysis and Evaluation. In: LREC. Carvalho, P. et al. (2009). Clues for detecting irony in user-generated contents: oh...!! it s so easy ;-). In: TSA@CIKM. Clark, H. H. et al. (1984). On the pretense theory of irony. In: Journal of Experimental Psychology: General. Dress, M. L. et al. (2008). Regional Variation in the Use of Sarcasm. In: Journal of Language and Social Psychology. Buschmeier, Cimiano, Klinger 34 / 35

Bibliography Bibliography II Filatova, E. (2012). Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing. In: LREC. González-Ibáñez, R. et al. (2011). Identifying sarcasm in Twitter: a closer look. In: ACL-HLT. Klinger, R. et al. (2013a). Bi-directional Inter-dependencies of Subjective Expressions and Targets and their Value for a Joint Model. In: ACL. (2013b). Joint and Pipeline Probabilistic Models for Fine-Grained Sentiment Analysis: Extracting Aspects, Subjective Phrases and their Relations. In: ICDMW. Kumon-Nakamura, S. et al. (1995). How About Another Piece of Pie: The Allusional Pretense Theory of Discourse Irony. In: Journal of Experimental Psychology: General. Buschmeier, Cimiano, Klinger 35 / 35

Bibliography Bibliography III Maynard, D. et al. (2014). Who cares about Sarcastic Tweets? Investigating the Impact of Sarcasm on Sentiment Analysis. In: LREC. Reyes, A. et al. (2011). Mining subjective knowledge from customer reviews: a specific case of irony detection. In: WASSA@ACL. Tsur, O. et al. (2010). ICWSM A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews. In: ICWSM. Utsumi, A. (2000). Verbal irony as implicit display of ironic environment: Distinguishing ironic utterances from nonirony. In: Journal of Pragmatics. Buschmeier, Cimiano, Klinger 36 / 35

Bibliography Bibliography IV Wilson, D. et al. (2012). Explaining Irony. In: Meaning and Relevance. Buschmeier, Cimiano, Klinger 37 / 35

Universität Bielefeld June 27, 2014 An Impact Analysis of Features in a Classification Approach to Irony Detection in Product Reviews Konstantin Buschmeier, Philipp Cimiano, Roman Klinger Semantic Computing Group, CIT-EC, Bielefeld University Slides are available at http://www.roman-klinger.de/talks/irony.pdf