Automatic Detection of Sarcasm in BBS Posts Based on Sarcasm Classification

Similar documents
An Impact Analysis of Features in a Classification Approach to Irony Detection in Product Reviews

Your Sentiment Precedes You: Using an author s historical tweets to predict sarcasm

Projektseminar: Sentimentanalyse Dozenten: Michael Wiegand und Marc Schulder

ICWSM A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews

Sarcasm Detection in Text: Design Document

Harnessing Context Incongruity for Sarcasm Detection

Mining Subjective Knowledge from Customer Reviews: A Specific Case of Irony Detection

This is a repository copy of Who cares about sarcastic tweets? Investigating the impact of sarcasm on sentiment analysis.

Some Experiments in Humour Recognition Using the Italian Wikiquote Collection

arxiv: v1 [cs.cl] 3 May 2018

How Do Cultural Differences Impact the Quality of Sarcasm Annotation?: A Case Study of Indian Annotators and American Text

Detecting Intentional Lexical Ambiguity in English Puns

Sparse, Contextually Informed Models for Irony Detection: Exploiting User Communities, Entities and Sentiment

Affect-based Features for Humour Recognition

#SarcasmDetection Is Soooo General! Towards a Domain-Independent Approach for Detecting Sarcasm

World Journal of Engineering Research and Technology WJERT

Sentiment and Sarcasm Classification with Multitask Learning

arxiv: v1 [cs.cl] 8 Jun 2018

DICTIONARY OF SARCASM PDF

Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest

Are Word Embedding-based Features Useful for Sarcasm Detection?

Document downloaded from: This paper must be cited as:

Computational Laughing: Automatic Recognition of Humorous One-liners

Acoustic Prosodic Features In Sarcastic Utterances

Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing

Detecting Sarcasm in English Text. Andrew James Pielage. Artificial Intelligence MSc 2012/2013

Computational Models for Incongruity Detection in Humour

Sentiment Analysis. Andrea Esuli

Introduction to Sentiment Analysis. Text Analytics - Andrea Esuli

Fake News or Truth? Using Satirical Cues to Detect Potentially Misleading News.

Implementation of Emotional Features on Satire Detection

Automatic Polyphonic Music Composition Using the EMILE and ABL Grammar Inductors *

arxiv: v1 [cs.cl] 26 Jun 2015

Humor recognition using deep learning

Stierlitz Meets SVM: Humor Detection in Russian

LT3: Sentiment Analysis of Figurative Tweets: piece of cake #NotReally

TWITTER SARCASM DETECTOR (TSD) USING TOPIC MODELING ON USER DESCRIPTION

Effects of Semantic Relatedness between Setups and Punchlines in Twitter Hashtag Games

Dynamic Allocation of Crowd Contributions for Sentiment Analysis during the 2016 U.S. Presidential Election

Natural language s creative genres are traditionally considered to be outside the

저작권법에따른이용자의권리는위의내용에의하여영향을받지않습니다.

Identifying functions of citations with CiTalO

Towards a Contextual Pragmatic Model to Detect Irony in Tweets

arxiv: v1 [cs.ai] 10 Jan 2019

Figurative Language Processing in Social Media: Humor Recognition and Irony Detection

International Journal of Advance Engineering and Research Development MUSICAL INSTRUMENT IDENTIFICATION AND STATUS FINDING WITH MFCC

The final publication is available at

An extensive Survey On Sarcasm Detection Using Various Classifiers

Sarcasm in Social Media. sites. This research topic posed an interesting question. Sarcasm, being heavily conveyed

Sarcasm Detection on Facebook: A Supervised Learning Approach

NLPRL-IITBHU at SemEval-2018 Task 3: Combining Linguistic Features and Emoji Pre-trained CNN for Irony Detection in Tweets

Homonym Detection For Humor Recognition In Short Text

Sarcasm as Contrast between a Positive Sentiment and Negative Situation

SARCASM DETECTION IN SENTIMENT ANALYSIS Dr. Kalpesh H. Wandra 1, Mehul Barot 2 1

Semantic Role Labeling of Emotions in Tweets. Saif Mohammad, Xiaodan Zhu, and Joel Martin! National Research Council Canada!

Really? Well. Apparently Bootstrapping Improves the Performance of Sarcasm and Nastiness Classifiers for Online Dialogue

A Survey of Sarcasm Detection in Social Media

First Stage of an Automated Content-Based Citation Analysis Study: Detection of Citation Sentences 1

Linguistic Ethnography: Identifying Dominant Word Classes in Text

REPORT DOCUMENTATION PAGE

Linguistic Features of Humor in Academic Writing

KLUEnicorn at SemEval-2018 Task 3: A Naïve Approach to Irony Detection

SemEval-2015 Task 11: Sentiment Analysis of Figurative Language in Twitter

PREDICTING HUMOR RESPONSE IN DIALOGUES FROM TV SITCOMS. Dario Bertero, Pascale Fung

Humor: Prosody Analysis and Automatic Recognition for F * R * I * E * N * D * S *

A COMPREHENSIVE STUDY ON SARCASM DETECTION TECHNIQUES IN SENTIMENT ANALYSIS

SARCASM DETECTION IN SENTIMENT ANALYSIS

Bilbo-Val: Automatic Identification of Bibliographical Zone in Papers

Learning Word Meanings and Descriptive Parameter Spaces from Music. Brian Whitman, Deb Roy and Barry Vercoe MIT Media Lab

Many people struggle with rhetorical analysis theses.

Modelling Sarcasm in Twitter, a Novel Approach

High accuracy citation extraction and named entity recognition for a heterogeneous corpus of academic papers

A Pinch of Humor for Short-Text Conversation: an Information Retrieval Approach

arxiv: v2 [cs.cl] 20 Sep 2016

Improving Frame Based Automatic Laughter Detection

Sentence and Expression Level Annotation of Opinions in User-Generated Discourse

Automatic Classification of Reference Service Records

Automatic Extraction of Popular Music Ringtones Based on Music Structure Analysis

Video-based Vibrato Detection and Analysis for Polyphonic String Music

Introduction to Natural Language Processing This week & next week: Classification Sentiment Lexicons

Harnessing Sequence Labeling for Sarcasm Detection in Dialogue from TV Series Friends

Tweet Sarcasm Detection Using Deep Neural Network

Filling the Blanks (hint: plural noun) for Mad Libs R Humor

Automatic Sarcasm Detection: A Survey

Approaches for Computational Sarcasm Detection: A Survey

arxiv: v1 [cs.ir] 16 Jan 2019

Humor Recognition and Humor Anchor Extraction

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models

Fracking Sarcasm using Neural Network

Modelling Irony in Twitter: Feature Analysis and Evaluation

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

Automatic Laughter Detection

Acoustic Scene Classification

Finding Sarcasm in Reddit Postings: A Deep Learning Approach

CASCADE: Contextual Sarcasm Detection in Online Discussion Forums

ABSOLUTE OR RELATIVE? A NEW APPROACH TO BUILDING FEATURE VECTORS FOR EMOTION TRACKING IN MUSIC

Attending Sentences to detect Satirical Fake News

Modeling Satire in English Text for Automatic Detection

Detecting Hoaxes, Frauds and Deception in Writing Style Online

LLT-PolyU: Identifying Sentiment Intensity in Ironic Tweets

Transcription:

Web 1,a) 2,b) 2,c) Web Web 8 ( ) Support Vector Machine (SVM) F Web Automatic Detection of Sarcasm in BBS Posts Based on Sarcasm Classification Fumiya Isono 1,a) Suguru Matsuyoshi 2,b) Fumiyo Fukumoto 2,c) Abstract: We propose two detection systems that identify sarcasm and slander in posts on bulletin board system (BBS). We made a corpus of sarcasm in BBS, and classified sarcasm instances into eight classes: interrogative, guess, give-up, unbalance, exaggeration, shock, metaphor, and contrast. For each sarcasm class, we constructed syntactic patterns for detection of sarcasm that include sentence structures and polarity conditions of the target sentence, the previous sentence and the next sentence. Our first system detects sarcasm using a database of the syntactic patterns. We made a corpus of slander in BBS and a list of slander expressions extracted from the corpus. Our second system detects slander using Support Vector Machine (SVM), where as features, we use frequencies of words in the list, and positive expressions and negative expressions in the target sentence, the previous sentence and the next sentence. In the experiment, the proposed systems can achieve superior F-measures compared with baseline systems. Keywords: classification, filtering, sarcasm, slander, bulletin board system 1. 1 Department of Education Interdisciplinary Graduate School of Medicine and Engineering, University of Yamanashi 2 Interdisciplinary Graduate School of Medicine and Engineering, University of Yamanashi a) g13mk002@yamanashi.ac.jp b) sugurum@yamanashi.ac.jp c) fukumoto@yamanashi.ac.jp (1) (2) (3) (4) ( 1 ) ( 2 ) ( 3 ) A ()? B! 1

( 4 ) (1) (2) (3) B (B ) B (4) 2 ( ) (5) A B! (6) A B Web *1 1 *1 http://www.yahoo-help.jp/app/home/p/622/ 1 1 Web 2 3 Web 4 5 6 7 2. 2.1 [1] Mihalcea [2] Support Vector Machine (SVM) Burfoot [3] SVM 2

1 2,452 37 73 5,141 336 1,247 2,726 30 95 4,278 234 703 5,178 67 168 9,419 570 1,950 Muh [4] Twitter Amazon 2 SASI[5] k- (!? ) Amazon Twitter 2.2 [6] SVM 8 Adler [7] Wikipedia 4 4 3. 2 ( 1 ) : ( 2 ) Web 3.1 : : *2 ( ) [8] 90% 1 58 10 58 40 5,178 1 1 1 2 1 67 168 3.2 Web 5 Web * 3 ( )!? 9,419 1 1 570 1,950 *2 http://rit.rakuten.co.jp/rdr/index.html *3 http://blog.livedoor.jp/dqnplus/archives/ 1736747.html http://blog.livedoor.jp/dqnplus/archives/ 1736731.html http://blog.livedoor.jp/dqnplus/archives/ 1735211.html http://hamusoku.com/archives/7126094.html http://hamusoku.com/archives/7430403.html ( 2012 12 13 ) 3

1 3.3 40 20 2,452 20 2,726 3 Web 5,141 2 Web 4,278 1 4. 1 4.1 1 Web 2 8 2 4.2 1 *4 F 3 35 Neg + Neg Neg UniDic *5 4.2.1 () 4.2.2 () *4 http://www.cl.ecei.tohoku.ac.jp/resources/sent_lex/ wago.121808.pn *5 http://sourceforge.jp/projects/unidic/releases/57618 4

2 ( ) 15 104 119 0 44 44 0 68 68 8 48 56 0 51 51 ww 3 0 3 5 0 5 6 0 6 0 21 21 4.2.3 () 4.2.4 () 4.2.5 () ww 3 Neg Neg + Neg Neg Neg + Neg Neg Neg Neg Neg Neg + Neg + Neg + Neg + Neg Neg Neg Neg Neg Neg Neg Neg Neg Neg Neg Neg w Neg + Neg + Neg Neg + Neg Neg Neg + Neg + 5

4.2.6 () * 6 4.2.7 () 4.2.8 () 4.3 MeCab * 7 CaboCha *8 *6 *7 http://mecab.googlecode.com/svn/trunk/mecab/doc/ index.html *8 http://code.google.com/p/cabocha 2 5. 2 5.1 3 112 W WWW 5.2 4.3 6

5.3 SVM 4.3 SVM 6. 6.1 SVM SVM-light *9 5 2 2 P R F P = R = F = 2P R P + R 6.2 4 5 4 *9 http://svmlight.joachims.org 4 F 0.04 ( 35/921) 0.95 ( 35/37) 0.07 0.08 (326/4,075) 0.97 (326/336) 0.15 0.20 ( 37/185) 1.00 ( 37/37) 0.34 0.21 ( 211/994) 0.63 (211/336) 0.32 5 F 0.04 ( 72/1,782) 0.99 ( 72/73) 0.08 0.25 (1,234/4,981) 0.99 (1,234/1,247) 0.40 0.33 ( 71/212) 0.97 ( 71/73) 0.50 0.51 ( 560/1,104) 0.45 ( 560/1,247) 0.48 6 F 0.01 ( 6/907) 0.20 ( 6/30) 0.01 0.06 (147/2,435) 0.63 (147/234) 0.11 0.09 ( 14/150) 0.47 ( 14/30) 0.16 0.09 (102/1,150) 0.44 (102/234) 0.15 7 F 0.04 ( 93/2,408) 0.97 ( 93/95) 0.07 0.17 (685/4,045) 0.97 (685/703) 0.29 0.13 ( 60/452) 0.63 ( 60/95) 0.22 0.38 (449/1,176) 0.64 (449/703) 0.48 5 0.97 0.33 0.50 6 7 6 0.09 8 7

7 F 7. Web (B) ( : 25870278: ) [1],,, Vol. 9, No. 6, pp. 875 881 (1993). [2] Mihalcea, R. and Pulman, S. G.: Characterizing humour: An exploration of features in humorous texts, in CICLing, pp. 337 347 (2007). [3] Burfoot, C. and Baldwin, T.: Automatic satire detection: Are you having a laugh?, in Proceedings of the ACL-IJCNLP 2009 Conference Short Papers, pp. 161 164 (2009). [4] Muh, M., Tsur, O. and AriRappoport, : Semi-Supervised Recognition of Sarcastic Sentences in Twitter and Amazon, in Proceedings od the Fourteenth Conference on Computational Natural Language Learning, pp. 107 116 (2010). [5] Tsur, O., Davidiv, D. and Rappoport, A.: Icwsm - A Great Catchy Name: Semi-supervised Recognition of Sarcastic Sentences in Product Reviews, in International AAAI Conference on Weblogs and Social Media, pp. 162 169 (2010). [6],,,,,. NLC,, pp. 93 98 (2009). [7] Adler, B., Alfaro L., de, Mola-Velasco, S., Rosso, P. and West, A.: Wikipedia Vandalism Detection: Combining Natural Language, Metadata, and Reputation Features, in ICLing 11: Proceedings of the 12th International Conference on Intelligent Text Processing and Computational Linguistics, LNCS 6609, pp. 277 288 (2011). [8],,, 18, pp. 1188 1191 (2012). 8