Temporal patterns of happiness and sarcasm detection in social media (Twitter)

Size: px
Start display at page:

Download "Temporal patterns of happiness and sarcasm detection in social media (Twitter)"

Transcription

1

2 Temporal patterns of happiness and sarcasm detection in social media (Twitter) Pradeep Kumar NPSO Innovation Day November 22, 2017

3 Our Data Science Team Patricia Prüfer Pradeep Kumar Marcia den Uijl Next member? 3 Peter Fontein Hendri Adriaens

4 Content: 1. Average Happiness Measurement 1.1 Introduction 1.2 Data Collection from Twitter 1.3 Data Cleaning 1.4 Method 1.5 Result 1.6 Interpretation 2. Sarcasm Detection in Tweets 2.1 Introduction 2.2 Training data collection 2.3 Method 2.4 Training Result 4

5 Why Twitter data? Popular microblogging site 1.1 Introduction 500 million tweets a day, 200 billion a year 240+ million active users Twitter audience varies from commoner to celebrities User often discuss current affairs and personal views on various subjects Challenges Tweets are highly unstructured and also non grammatical Non standard vocabulary and abbreviations Lexical variations Cultural context of phrases, terms and symbols Hidden sarcasm 5

6 Population of Twitter Users in the Netherlands 2.6 million Dutch users, of which 0.9 million daily Usage by age category 10% 19% 8% 23% 25% yrs yrs yrs yrs 80+ yrs 6 By 2016, stable, decrease in youth use, increase of elderly people

7 Social media analytics and process Capture Gather data from various sources Preprocess the data Extract pertinent information from the data Understand Remove noisy data Perform advanced analytics: opinion mining, topic modelling, trend analysis, sentiment analysis Temporal (time series) Happiness/Sentiment analysis Sarcasm analysis Present Summarize and evaluate the findings from understand stage 7

8 8 Twitter structure

9 Data hidden in plain sight Time Social network Author Tweet Description Location Popularity 9 Sentiment Topic

10 Approach: An Overview Tweet download using Twitter API Preprocessing and Cleaning Sanitization and emoticons replacement Tokenizer 10 Happiness calculation, term frequency and topic modelling

11 1.2 Data Collection from Twitter Tweet Streaming criteria: 1 % of data streaming is possible The words in the top-10s are either articles (de, het, een), prepositions (in, van), conjunctions (en, dat), a personal, pronoun (ik), a negation (niet) or a conjugation of to be (is) [1][2] The words in the top-10 are the same for men and women 11 [1]. Collecting and Analysing Chats and Tweets in SoNaR Eric Sanders, CLST, Radboud University Nijmegen [2]

12 1.3 Tweet Cleaning Step 1 : Removing the HTTP links (urls) Step 2 : Removing the # tags Step 3 : Replacement of Emoticons (faces, objects, nature, flags) Emoji cheat sheet: Smileys and People Animals and nature Objects Activity, Travel and Places, Objects, Symbols, Flags 12 [3].

13 Step 4: Sanitization (treating the abbreviations and repeated sounds) Some Dutch abbreviations: Hgh: Hoe gaat het Gmj: goed met jou Idk: i don t know Gwn: gewoon Vgm: volgens mij K: ik Vaka: vakantie Das: dat is Sws: sowieso Wnr: wanneer T: het 13 Step 5: Monogram model

14 1.4 Method (Word collection and ranking) Sourcing of words: Twitter Google books Ranking methodology: New York times Music Library 1. Top 5000 words (most frequent) from each corpus merged resulting in 10,222 words [4] evaluations per word 3. Words ranked on the scale of Top words are: Words Average happiness 1. laughter happiness love joy 8.16 Bottom words are: Words Average happiness 1. killer cancer death terrorist 1.30 [4] Data-Set: Data collected from LabMT [3]. The over 10,222 unique words were labeled with Amazon's Mechanical Turk.

15 1.4 Method (Temporal Happiness Calculation) Mathematical formula: Average Happiness = n i=1 h avg (W i ) f i n 1 f i f i = frequency of ith word h avg (W i ) = estimate of average happiness of ith word 15

16 Average Happiness Index 1.5 Result Interactive dynamic graph is available at 16

17 Two instances: 1.6 Interpretation Most used terms Term Top hashtags Score : 3.92 (5 AM on 19 th August 2017) { Hard, idiot, good, 'get, 'knows, 'like, 'police, 'Mexican, struggle fuck, 'loves, 'blonde, fantastic, drug, "government, dismissed 'care } [('#nieuws', 248), ('#nieuwstwitter', 207), ('#vacature', 204), ('#NL', 191), ('#actueel', 120), ('#NieuwsTwitter', 120), ('#Krant', 102), ('#feywil ', 73), ('#lab', 64), ('#kkl', 61), ('#brugopen', 60), ('#Nieuws', 55), ('#Nederland', 54), ('#voetbal', 53), ('#Politie', 46), ('#Amsterdam', 43), ('#LaraconEU ', 37), ('#HLN', 36), ('#E313', 35), ('#tdd', 33)] 17 Score : 4.65 (4 PM on 20 th August 2017, Sunday) {'good', 'request',bright' 'care', 'okay, victory, lucky, sunday, 'passion, well, 'cookies', happy''dismissed': 'theaters 'like,'mature''weekend,'har d, 'thought''strange, 'main': 'car 'personal,'social,'stole.'lov e,helps,walking,negative,s pa,laugh,ride,start,sea, sonic': 'needed' 'sitting } [('#ajagro', 787), ('#Ajax', 328), ('#nieuwstwitter', 275), ('#nieuws', 254), ('#NieuwsTwitter', 169), ('#actueel', 168), ('#ANDSTV', 163), ('#brugopen', 143), ('#AJAgro', 116), ('#utrwil', 116), ('#excfey', 106), ('#CNBLUE', 105), ('#andstv', 89), ('#PushAwardsKathNiels', 83), ('#FCGroningen', 82), ('#voetbal', 79), ('#PSV', 78), ('#NACpsv', 77), ('#NACpraat', 72)]

18 Topic Modelling on Twitter Data Score : 4.65 (4 PM on 20 th August 2017, Sunday) Topic 1 [goed,nook,ooik,we,juist,steed,zee,my,kom,wel,meisje,vrouwen, nederland] Topic 2 [Iik,wel,ooik,hebt,waar,echt,heel,d enik,weer,erg,gaat,zit,mensen,zin morgen] Topic 3 [iik,leuik,vind,video,frans,gelezen, waal,waarom,geld,nl,stuik,wedstrij d,blijft,vragen] Topic 4 [Minder,bedankt,middle,smokkelma ffia,ngo s,knechten,afname,verdrin kingen,verdraait,club,bal,tijd,leuke, blonde] Topic 5 [Weer,nee,niet,ooik,gewoon,volgen s,smile, we, gaat,uur, keer,gaan,zomer,wel,ij] Score : 3.92 (5 AM on 19 th August 2017) iik,zegtweer,waar,he,ooik,gaan,allemaa l,wel,mooi,nieuwe,bal,barcelona,we,va kantie [man.weg,grote,tijdens,krijgt,rood,gei k,twee,meisje,no,jullie,mensen,omg,sp eelt] [iik,video,vind,leuik,via,wonder,live,wis dom,toegevoegd,ooik,we,afspeellijst,g oal,onze,amp] Nou,minder,juist,gaat,nederland,zee,m iddel,worden,tijd,ooik,smokkelmaffia,k nechten,afname,ngo s [my,beste,gemaakt,school,gelijik,you,t oe,geniet,pa,gewoon,extra] 18

19 2. Sarcasm Detection in Tweets 2.1 Introduction 2.2 Training data collection 2.3 Methods 2.4 Training Result 19

20 Underlying Hypothesis: 2.1 Introduction Contrast in Sarcastic Tweets: Sarcasm detection relies on the assumption that a negative situation often appears after the positive situations in a sarcastic document.[5] [positive verb phrase] + [negative verb phrase] 1. "Honesty is the best policy - when there is money in it." - Mark Twain 2. (een ouder tegen een kind met een slecht rapport) Je bent weer eens de beste leerling van de klas! The training dataset consists of clean sarcastic tweets clean non-sarcastic tweets 20 [5] Ellen Riloff, Ashequl Qadir, Prafulla Surve, Lalindra De Silva, Nathan Gilbert and Ruihong Huang, Sarcasm as Contrast between a Positive Sentiment and Negative Situation

21 2.2 Feature Engineering Sentiment analysis Topic modeling Part of speech tagging n-grams model e.g. 1. unigrams is one word (example: really, great, super, awesome, etc.) and 2. bigrams words (example: really great, super awesome, etc) 21

22 2.3 Method Step 1: Split each tweet in one, two and three parts! Step 2: Sentiment analysis on all parts Splitting Blob sentiment Blob subjectivity Blob sentiment 1/ Blob sentiment 2/ Blob subjectivity 1/ Blob subjectivity 2/ Blob sentiment 1/ Blob sentiment 2/ Blob sentiment 3/ Blob subjectivity 1/ Blob subjectivity 2/ Blob subjectivity 3/ No Split Two Splits Three Splits Step 3: Topic modelling- decompose each tweet as sum of topics, to be used as feature 22

23 2.4 Training Results Training results, Support vector machine (linear kernel) Precision Recall f1-score Sarcasm Non sarcasm Avg

24 Conclusions and future work Enrich library for acronym expansion and emoticons replacement Applying deep learning methods for sarcasm analysis Collection of labelled Dutch tweets for training the model for sarcasm detection Additional features to be explored to tweak the algorithm for sarcasm detection in Dutch tweets Consideration retweet as a factor 24

25 Thanks for your attention!!

Your Sentiment Precedes You: Using an author s historical tweets to predict sarcasm

Your Sentiment Precedes You: Using an author s historical tweets to predict sarcasm Your Sentiment Precedes You: Using an author s historical tweets to predict sarcasm Anupam Khattri 1 Aditya Joshi 2,3,4 Pushpak Bhattacharyya 2 Mark James Carman 3 1 IIT Kharagpur, India, 2 IIT Bombay,

More information

Sarcasm Detection in Text: Design Document

Sarcasm Detection in Text: Design Document CSC 59866 Senior Design Project Specification Professor Jie Wei Wednesday, November 23, 2016 Sarcasm Detection in Text: Design Document Jesse Feinman, James Kasakyan, Jeff Stolzenberg 1 Table of contents

More information

The Lowest Form of Wit: Identifying Sarcasm in Social Media

The Lowest Form of Wit: Identifying Sarcasm in Social Media 1 The Lowest Form of Wit: Identifying Sarcasm in Social Media Saachi Jain, Vivian Hsu Abstract Sarcasm detection is an important problem in text classification and has many applications in areas such as

More information

World Journal of Engineering Research and Technology WJERT

World Journal of Engineering Research and Technology WJERT wjert, 2018, Vol. 4, Issue 4, 218-224. Review Article ISSN 2454-695X Maheswari et al. WJERT www.wjert.org SJIF Impact Factor: 5.218 SARCASM DETECTION AND SURVEYING USER AFFECTATION S. Maheswari* 1 and

More information

Harnessing Context Incongruity for Sarcasm Detection

Harnessing Context Incongruity for Sarcasm Detection Harnessing Context Incongruity for Sarcasm Detection Aditya Joshi 1,2,3 Vinita Sharma 1 Pushpak Bhattacharyya 1 1 IIT Bombay, India, 2 Monash University, Australia 3 IITB-Monash Research Academy, India

More information

arxiv: v1 [cs.cl] 3 May 2018

arxiv: v1 [cs.cl] 3 May 2018 Binarizer at SemEval-2018 Task 3: Parsing dependency and deep learning for irony detection Nishant Nikhil IIT Kharagpur Kharagpur, India nishantnikhil@iitkgp.ac.in Muktabh Mayank Srivastava ParallelDots,

More information

Sarcasm Detection on Facebook: A Supervised Learning Approach

Sarcasm Detection on Facebook: A Supervised Learning Approach Sarcasm Detection on Facebook: A Supervised Learning Approach Dipto Das Anthony J. Clark Missouri State University Springfield, Missouri, USA dipto175@live.missouristate.edu anthonyclark@missouristate.edu

More information

KLUEnicorn at SemEval-2018 Task 3: A Naïve Approach to Irony Detection

KLUEnicorn at SemEval-2018 Task 3: A Naïve Approach to Irony Detection KLUEnicorn at SemEval-2018 Task 3: A Naïve Approach to Irony Detection Luise Dürlich Friedrich-Alexander Universität Erlangen-Nürnberg / Germany luise.duerlich@fau.de Abstract This paper describes the

More information

PDF hosted at the Radboud Repository of the Radboud University Nijmegen

PDF hosted at the Radboud Repository of the Radboud University Nijmegen PDF hosted at the Radboud Repository of the Radboud University Nijmegen The following full text is a publisher's version. For additional information about this publication click this link. http://hdl.handle.net/2066/158815

More information

TWITTER SARCASM DETECTOR (TSD) USING TOPIC MODELING ON USER DESCRIPTION

TWITTER SARCASM DETECTOR (TSD) USING TOPIC MODELING ON USER DESCRIPTION TWITTER SARCASM DETECTOR (TSD) USING TOPIC MODELING ON USER DESCRIPTION Supriya Jyoti Hiwave Technologies, Toronto, Canada Ritu Chaturvedi MCS, University of Toronto, Canada Abstract Internet users go

More information

Introduction to Natural Language Processing This week & next week: Classification Sentiment Lexicons

Introduction to Natural Language Processing This week & next week: Classification Sentiment Lexicons Introduction to Natural Language Processing This week & next week: Classification Sentiment Lexicons Center for Games and Playable Media http://games.soe.ucsc.edu Kendall review of HW 2 Next two weeks

More information

How Do Cultural Differences Impact the Quality of Sarcasm Annotation?: A Case Study of Indian Annotators and American Text

How Do Cultural Differences Impact the Quality of Sarcasm Annotation?: A Case Study of Indian Annotators and American Text How Do Cultural Differences Impact the Quality of Sarcasm Annotation?: A Case Study of Indian Annotators and American Text Aditya Joshi 1,2,3 Pushpak Bhattacharyya 1 Mark Carman 2 Jaya Saraswati 1 Rajita

More information

Modelling Sarcasm in Twitter, a Novel Approach

Modelling Sarcasm in Twitter, a Novel Approach Modelling Sarcasm in Twitter, a Novel Approach Francesco Barbieri and Horacio Saggion and Francesco Ronzano Pompeu Fabra University, Barcelona, Spain .@upf.edu Abstract Automatic detection

More information

#SarcasmDetection Is Soooo General! Towards a Domain-Independent Approach for Detecting Sarcasm

#SarcasmDetection Is Soooo General! Towards a Domain-Independent Approach for Detecting Sarcasm Proceedings of the Thirtieth International Florida Artificial Intelligence Research Society Conference #SarcasmDetection Is Soooo General! Towards a Domain-Independent Approach for Detecting Sarcasm Natalie

More information

arxiv: v1 [cs.cl] 8 Jun 2018

arxiv: v1 [cs.cl] 8 Jun 2018 #SarcasmDetection is soooo general! Towards a Domain-Independent Approach for Detecting Sarcasm Natalie Parde and Rodney D. Nielsen Department of Computer Science and Engineering University of North Texas

More information

Detecting Sarcasm in English Text. Andrew James Pielage. Artificial Intelligence MSc 2012/2013

Detecting Sarcasm in English Text. Andrew James Pielage. Artificial Intelligence MSc 2012/2013 Detecting Sarcasm in English Text Andrew James Pielage Artificial Intelligence MSc 0/0 The candidate confirms that the work submitted is their own and the appropriate credit has been given where reference

More information

SARCASM DETECTION IN SENTIMENT ANALYSIS Dr. Kalpesh H. Wandra 1, Mehul Barot 2 1

SARCASM DETECTION IN SENTIMENT ANALYSIS Dr. Kalpesh H. Wandra 1, Mehul Barot 2 1 SARCASM DETECTION IN SENTIMENT ANALYSIS Dr. Kalpesh H. Wandra 1, Mehul Barot 2 1 Director (Academic Administration) Babaria Institute of Technology, 2 Research Scholar, C.U.Shah University Abstract Sentiment

More information

Are Word Embedding-based Features Useful for Sarcasm Detection?

Are Word Embedding-based Features Useful for Sarcasm Detection? Are Word Embedding-based Features Useful for Sarcasm Detection? Aditya Joshi 1,2,3 Vaibhav Tripathi 1 Kevin Patel 1 Pushpak Bhattacharyya 1 Mark Carman 2 1 Indian Institute of Technology Bombay, India

More information

Sarcasm as Contrast between a Positive Sentiment and Negative Situation

Sarcasm as Contrast between a Positive Sentiment and Negative Situation Sarcasm as Contrast between a Positive Sentiment and Negative Situation Ellen Riloff, Ashequl Qadir, Prafulla Surve, Lalindra De Silva, Nathan Gilbert, Ruihong Huang School Of Computing University of Utah

More information

Text Analysis. Language is complex. The goal of text analysis is to strip away some of that complexity to extract meaning.

Text Analysis. Language is complex. The goal of text analysis is to strip away some of that complexity to extract meaning. Text Analysis Language is complex. The goal of text analysis is to strip away some of that complexity to extract meaning. Image Source How to talk like a Democrat (or a Republican) Reddit N-gram Viewer:

More information

NETFLIX MOVIE RATING ANALYSIS

NETFLIX MOVIE RATING ANALYSIS NETFLIX MOVIE RATING ANALYSIS Danny Dean EXECUTIVE SUMMARY Perhaps only a few us have wondered whether or not the number words in a movie s title could be linked to its success. You may question the relevance

More information

LLT-PolyU: Identifying Sentiment Intensity in Ironic Tweets

LLT-PolyU: Identifying Sentiment Intensity in Ironic Tweets LLT-PolyU: Identifying Sentiment Intensity in Ironic Tweets Hongzhi Xu, Enrico Santus, Anna Laszlo and Chu-Ren Huang The Department of Chinese and Bilingual Studies The Hong Kong Polytechnic University

More information

Projektseminar: Sentimentanalyse Dozenten: Michael Wiegand und Marc Schulder

Projektseminar: Sentimentanalyse Dozenten: Michael Wiegand und Marc Schulder Projektseminar: Sentimentanalyse Dozenten: Michael Wiegand und Marc Schulder Präsentation des Papers ICWSM A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews

More information

Basic Natural Language Processing

Basic Natural Language Processing Basic Natural Language Processing Why NLP? Understanding Intent Search Engines Question Answering Azure QnA, Bots, Watson Digital Assistants Cortana, Siri, Alexa Translation Systems Azure Language Translation,

More information

SARCASM DETECTION IN SENTIMENT ANALYSIS

SARCASM DETECTION IN SENTIMENT ANALYSIS SARCASM DETECTION IN SENTIMENT ANALYSIS Shruti Kaushik 1, Prof. Mehul P. Barot 2 1 Research Scholar, CE-LDRP-ITR, KSV University Gandhinagar, Gujarat, India 2 Lecturer, CE-LDRP-ITR, KSV University Gandhinagar,

More information

PREDICTING HUMOR RESPONSE IN DIALOGUES FROM TV SITCOMS. Dario Bertero, Pascale Fung

PREDICTING HUMOR RESPONSE IN DIALOGUES FROM TV SITCOMS. Dario Bertero, Pascale Fung PREDICTING HUMOR RESPONSE IN DIALOGUES FROM TV SITCOMS Dario Bertero, Pascale Fung Human Language Technology Center The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong dbertero@connect.ust.hk,

More information

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG? WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG? NICHOLAS BORG AND GEORGE HOKKANEN Abstract. The possibility of a hit song prediction algorithm is both academically interesting and industry motivated.

More information

Finding Sarcasm in Reddit Postings: A Deep Learning Approach

Finding Sarcasm in Reddit Postings: A Deep Learning Approach Finding Sarcasm in Reddit Postings: A Deep Learning Approach Nick Guo, Ruchir Shah {nickguo, ruchirfs}@stanford.edu Abstract We use the recently published Self-Annotated Reddit Corpus (SARC) with a recurrent

More information

WHAT'S HOT: LINEAR POPULARITY PREDICTION FROM TV AND SOCIAL USAGE DATA Jan Neumann, Xiaodong Yu, and Mohamad Ali Torkamani Comcast Labs

WHAT'S HOT: LINEAR POPULARITY PREDICTION FROM TV AND SOCIAL USAGE DATA Jan Neumann, Xiaodong Yu, and Mohamad Ali Torkamani Comcast Labs WHAT'S HOT: LINEAR POPULARITY PREDICTION FROM TV AND SOCIAL USAGE DATA Jan Neumann, Xiaodong Yu, and Mohamad Ali Torkamani Comcast Labs Abstract Large numbers of TV channels are available to TV consumers

More information

Who would have thought of that! : A Hierarchical Topic Model for Extraction of Sarcasm-prevalent Topics and Sarcasm Detection

Who would have thought of that! : A Hierarchical Topic Model for Extraction of Sarcasm-prevalent Topics and Sarcasm Detection Who would have thought of that! : A Hierarchical Topic Model for Extraction of Sarcasm-prevalent Topics and Sarcasm Detection Aditya Joshi 1,2,3 Prayas Jain 4 Pushpak Bhattacharyya 1 Mark James Carman

More information

저작권법에따른이용자의권리는위의내용에의하여영향을받지않습니다.

저작권법에따른이용자의권리는위의내용에의하여영향을받지않습니다. 저작자표시 - 비영리 - 동일조건변경허락 2.0 대한민국 이용자는아래의조건을따르는경우에한하여자유롭게 이저작물을복제, 배포, 전송, 전시, 공연및방송할수있습니다. 이차적저작물을작성할수있습니다. 다음과같은조건을따라야합니다 : 저작자표시. 귀하는원저작자를표시하여야합니다. 비영리. 귀하는이저작물을영리목적으로이용할수없습니다. 동일조건변경허락. 귀하가이저작물을개작, 변형또는가공했을경우에는,

More information

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues Kate Park, Annie Hu, Natalie Muenster Email: katepark@stanford.edu, anniehu@stanford.edu, ncm000@stanford.edu Abstract We propose

More information

Citation & Journal Impact Analysis

Citation & Journal Impact Analysis Citation & Journal Impact Analysis Several University Library article databases may be used to gather citation data and journal impact factors. Find them at library.otago.ac.nz under Research. Citation

More information

Fracking Sarcasm using Neural Network

Fracking Sarcasm using Neural Network Fracking Sarcasm using Neural Network Aniruddha Ghosh University College Dublin aniruddha.ghosh@ucdconnect.ie Tony Veale University College Dublin tony.veale@ucd.ie Abstract Precise semantic representation

More information

DOCTORAL DISSERTATIONS OF MAHATMA GANDHI UNIVERSITY A STUDY OF THE REFERENCES CITED

DOCTORAL DISSERTATIONS OF MAHATMA GANDHI UNIVERSITY A STUDY OF THE REFERENCES CITED DOCTORAL DISSERTATIONS OF MAHATMA GANDHI UNIVERSITY A STUDY OF THE REFERENCES CITED UNNIKRISHNAN S* & ANNU GEORGE** *Assistant Librarian Sr. Sc. **Assistant Librarian Sel.Gr. University Library Mahatma

More information

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues Kate Park katepark@stanford.edu Annie Hu anniehu@stanford.edu Natalie Muenster ncm000@stanford.edu Abstract We propose detecting

More information

Comparative study of Sentiment Analysis on trending issues on Social Media

Comparative study of Sentiment Analysis on trending issues on Social Media Comparative study of Sentiment Analysis on trending issues on Social Media Vibhore Jain Department of Computer Science and Engineering Bhilai Institute of Technology, Durg vibhorejain@outlook.com M.V.

More information

Semantic Role Labeling of Emotions in Tweets. Saif Mohammad, Xiaodan Zhu, and Joel Martin! National Research Council Canada!

Semantic Role Labeling of Emotions in Tweets. Saif Mohammad, Xiaodan Zhu, and Joel Martin! National Research Council Canada! Semantic Role Labeling of Emotions in Tweets Saif Mohammad, Xiaodan Zhu, and Joel Martin! National Research Council Canada! 1 Early Project Specifications Emotion analysis of tweets! Who is feeling?! What

More information

Sarcasm in Social Media. sites. This research topic posed an interesting question. Sarcasm, being heavily conveyed

Sarcasm in Social Media. sites. This research topic posed an interesting question. Sarcasm, being heavily conveyed Tekin and Clark 1 Michael Tekin and Daniel Clark Dr. Schlitz Structures of English 5/13/13 Sarcasm in Social Media Introduction The research goals for this project were to figure out the different methodologies

More information

Automatic Sarcasm Detection: A Survey

Automatic Sarcasm Detection: A Survey Automatic Sarcasm Detection: A Survey Aditya Joshi 1,2,3 Pushpak Bhattacharyya 2 Mark James Carman 3 1 IITB-Monash Research Academy, India 2 IIT Bombay, India, 3 Monash University, Australia {adityaj,pb}@cse.iitb.ac.in,

More information

Towards a Contextual Pragmatic Model to Detect Irony in Tweets

Towards a Contextual Pragmatic Model to Detect Irony in Tweets Towards a Contextual Pragmatic Model to Detect Irony in Tweets Jihen Karoui Farah Benamara Zitoune IRIT, MIRACL IRIT, CNRS Toulouse University, Sfax University Toulouse University karoui@irit.fr benamara@irit.fr

More information

This is a repository copy of Who cares about sarcastic tweets? Investigating the impact of sarcasm on sentiment analysis.

This is a repository copy of Who cares about sarcastic tweets? Investigating the impact of sarcasm on sentiment analysis. This is a repository copy of Who cares about sarcastic tweets? Investigating the impact of sarcasm on sentiment analysis. White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/130763/

More information

Supplementary Note. Supplementary Table 1. Coverage in patent families with a granted. all patent. Nature Biotechnology: doi: /nbt.

Supplementary Note. Supplementary Table 1. Coverage in patent families with a granted. all patent. Nature Biotechnology: doi: /nbt. Supplementary Note Of the 100 million patent documents residing in The Lens, there are 7.6 million patent documents that contain non patent literature citations as strings of free text. These strings have

More information

Information Networks

Information Networks Information Networks World Wide Web Network of a corporate website Vertices: web pages Directed edges: hyperlinks World Wide Web Developed by scientists at the CERN high-energy physics lab in Geneva World

More information

LT3: Sentiment Analysis of Figurative Tweets: piece of cake #NotReally

LT3: Sentiment Analysis of Figurative Tweets: piece of cake #NotReally LT3: Sentiment Analysis of Figurative Tweets: piece of cake #NotReally Cynthia Van Hee, Els Lefever and Véronique hoste LT 3, Language and Translation Technology Team Department of Translation, Interpreting

More information

Promo Mojo: Fox Takes First and Second Place with NFL, '9-1-1'

Promo Mojo: Fox Takes First and Second Place with NFL, '9-1-1' Promo Mojo: Fox Takes First and Second Place with NFL, '9-1-1' 10.10.2018 On the strength of 318 million TV ad impressions, a spot for Thursday Night Football tops this week's chart. An announcer notes

More information

Understanding People in Low Resourced Languages

Understanding People in Low Resourced Languages Understanding People in Low Resourced Languages Thesis submitted in partial fulfillment of the requirements for the degree of Masters of Science in Computer Science by Research by Sahil Swami 201302071

More information

Evaluation Tools. Journal Impact Factor. Journal Ranking. Citations. H-index. Library Service Section Elyachar Central Library.

Evaluation Tools. Journal Impact Factor. Journal Ranking. Citations. H-index. Library Service Section Elyachar Central Library. Evaluation Tools Journal Impact Factor Journal Ranking Citations H-index Page 1 of 12 Journal Impact Factor Journal Citation Reports is a comprehensive resource that allows you to evaluate and compare

More information

An Impact Analysis of Features in a Classification Approach to Irony Detection in Product Reviews

An Impact Analysis of Features in a Classification Approach to Irony Detection in Product Reviews Universität Bielefeld June 27, 2014 An Impact Analysis of Features in a Classification Approach to Irony Detection in Product Reviews Konstantin Buschmeier, Philipp Cimiano, Roman Klinger Semantic Computing

More information

CrystalNest at SemEval-2017 Task 4: Using Sarcasm Detection for Enhancing Sentiment Classification and Quantification

CrystalNest at SemEval-2017 Task 4: Using Sarcasm Detection for Enhancing Sentiment Classification and Quantification CrystalNest at SemEval-2017 Task 4: Using Sarcasm Detection for Enhancing Sentiment Classification and Quantification Raj Kumar Gupta and Yinping Yang Institute of High Performance Computing (IHPC) Agency

More information

Figures in Scientific Open Access Publications

Figures in Scientific Open Access Publications Figures in Scientific Open Access Publications Lucia Sohmen 2[0000 0002 2593 8754], Jean Charbonnier 1[0000 0001 6489 7687], Ina Blümel 1,2[0000 0002 3075 7640], Christian Wartena 1[0000 0001 5483 1529],

More information

Audio Feature Extraction for Corpus Analysis

Audio Feature Extraction for Corpus Analysis Audio Feature Extraction for Corpus Analysis Anja Volk Sound and Music Technology 5 Dec 2017 1 Corpus analysis What is corpus analysis study a large corpus of music for gaining insights on general trends

More information

Introduction to Sentiment Analysis. Text Analytics - Andrea Esuli

Introduction to Sentiment Analysis. Text Analytics - Andrea Esuli Introduction to Sentiment Analysis Text Analytics - Andrea Esuli What is Sentiment Analysis? What is Sentiment Analysis? Sentiment analysis and opinion mining is the field of study that analyzes people

More information

Tweet Sarcasm Detection Using Deep Neural Network

Tweet Sarcasm Detection Using Deep Neural Network Tweet Sarcasm Detection Using Deep Neural Network Meishan Zhang 1, Yue Zhang 2 and Guohong Fu 1 1. School of Computer Science and Technology, Heilongjiang University, China 2. Singapore University of Technology

More information

Discussing some basic critique on Journal Impact Factors: revision of earlier comments

Discussing some basic critique on Journal Impact Factors: revision of earlier comments Scientometrics (2012) 92:443 455 DOI 107/s11192-012-0677-x Discussing some basic critique on Journal Impact Factors: revision of earlier comments Thed van Leeuwen Received: 1 February 2012 / Published

More information

Markers of Literary Language A Computational-Linguistic Odyssey

Markers of Literary Language A Computational-Linguistic Odyssey Markers of Literary Language A Computational-Linguistic Odyssey Andreas van Cranenburgh Huygens ING Royal Netherlands Academy of Arts and Sciences Institute for Logic, Language and Computation University

More information

Do we really know what people mean when they tweet? Dr. Diana Maynard University of Sheffield, UK

Do we really know what people mean when they tweet? Dr. Diana Maynard University of Sheffield, UK Do we really know what people mean when they tweet? Dr. Diana Maynard University of Sheffield, UK We are all connected to each other... Information, thoughts and opinions are shared prolifically on the

More information

arxiv:submit/ [cs.cv] 8 Aug 2016

arxiv:submit/ [cs.cv] 8 Aug 2016 Detecting Sarcasm in Multimodal Social Platforms arxiv:submit/1633907 [cs.cv] 8 Aug 2016 ABSTRACT Rossano Schifanella University of Turin Corso Svizzera 185 10149, Turin, Italy schifane@di.unito.it Sarcasm

More information

Modelling Irony in Twitter: Feature Analysis and Evaluation

Modelling Irony in Twitter: Feature Analysis and Evaluation Modelling Irony in Twitter: Feature Analysis and Evaluation Francesco Barbieri, Horacio Saggion Pompeu Fabra University Barcelona, Spain francesco.barbieri@upf.edu, horacio.saggion@upf.edu Abstract Irony,

More information

Analysis of data from the pilot exercise to develop bibliometric indicators for the REF

Analysis of data from the pilot exercise to develop bibliometric indicators for the REF February 2011/03 Issues paper This report is for information This analysis aimed to evaluate what the effect would be of using citation scores in the Research Excellence Framework (REF) for staff with

More information

Measuring #GamerGate: A Tale of Hate, Sexism, and Bullying

Measuring #GamerGate: A Tale of Hate, Sexism, and Bullying Measuring #GamerGate: A Tale of Hate, Sexism, and Bullying Despoina Chatzakou, Nicolas Kourtellis, Jeremy Blackburn Emiliano De Cristofaro, Gianluca Stringhini, Athena Vakali Aristotle University of Thessaloniki

More information

Creating Mindmaps of Documents

Creating Mindmaps of Documents Creating Mindmaps of Documents Using an Example of a News Surveillance System Oskar Gross Hannu Toivonen Teemu Hynonen Esther Galbrun February 6, 2011 Outline Motivation Bisociation Network Tpf-Idf-Tpu

More information

The decoder in statistical machine translation: how does it work?

The decoder in statistical machine translation: how does it work? The decoder in statistical machine translation: how does it work? Alexandre Patry RALI/DIRO Université de Montréal June 20, 2006 Alexandre Patry (RALI) The decoder in SMT June 20, 2006 1 / 42 Machine translation

More information

Improving MeSH Classification of Biomedical Articles using Citation Contexts

Improving MeSH Classification of Biomedical Articles using Citation Contexts Improving MeSH Classification of Biomedical Articles using Citation Contexts Bader Aljaber a, David Martinez a,b,, Nicola Stokes c, James Bailey a,b a Department of Computer Science and Software Engineering,

More information

Deriving the Impact of Scientific Publications by Mining Citation Opinion Terms

Deriving the Impact of Scientific Publications by Mining Citation Opinion Terms Deriving the Impact of Scientific Publications by Mining Citation Opinion Terms Sofia Stamou Nikos Mpouloumpasis Lefteris Kozanidis Computer Engineering and Informatics Department, Patras University, 26500

More information

Sentiment of two women Sentiment analysis and social media

Sentiment of two women Sentiment analysis and social media Sentiment of two women Sentiment analysis and social media Lillian Lee Bo Pang Romance should never begin with sentiment. It should begin with science and end with a settlement. --- Oscar Wilde, An Ideal

More information

Implementation of Emotional Features on Satire Detection

Implementation of Emotional Features on Satire Detection Implementation of Emotional Features on Satire Detection Pyae Phyo Thu1, Than Nwe Aung2 1 University of Computer Studies, Mandalay, Patheingyi Mandalay 1001, Myanmar pyaephyothu149@gmail.com 2 University

More information

Your research footprint:

Your research footprint: Your research footprint: tracking and enhancing scholarly impact Presenters: Marié Roux and Pieter du Plessis Authors: Lucia Schoombee (April 2014) and Marié Theron (March 2015) Outline Introduction Citations

More information

Article Title: Discovering the Influence of Sarcasm in Social Media Responses

Article Title: Discovering the Influence of Sarcasm in Social Media Responses Article Title: Discovering the Influence of Sarcasm in Social Media Responses Article Type: Opinion Wei Peng (W.Peng@latrobe.edu.au) a, Achini Adikari (A.Adikari@latrobe.edu.au) a, Damminda Alahakoon (D.Alahakoon@latrobe.edu.au)

More information

A Corpus of English-Hindi Code-Mixed Tweets for Sarcasm Detection

A Corpus of English-Hindi Code-Mixed Tweets for Sarcasm Detection A Corpus of English-Hindi Code-Mixed Tweets for Sarcasm Detection by Sahil Swami, Ankush Khandelwal, Vinay Singh, Syed S. Akhtar, Manish Shrivastava in 19th International Conference on Computational Linguistics

More information

Connected Industry and Enterprise Role of AI, IoT and Geospatial Technology. Vijay Kumar, CTO ESRI India

Connected Industry and Enterprise Role of AI, IoT and Geospatial Technology. Vijay Kumar, CTO ESRI India Connected Industry and Enterprise Role of AI, IoT and Geospatial Technology Vijay Kumar, CTO ESRI India Agenda: 1 2 3 4 Understanding IoT IoT component and deployment patterns ArcGIS Geospatial Platform

More information

Sentiment Analysis. Andrea Esuli

Sentiment Analysis. Andrea Esuli Sentiment Analysis Andrea Esuli What is Sentiment Analysis? What is Sentiment Analysis? Sentiment analysis and opinion mining is the field of study that analyzes people s opinions, sentiments, evaluations,

More information

Repeated measures ANOVA

Repeated measures ANOVA Repeated measures ANOVA Pronoun interpretation in direct and indirect speech 07-05-2013 1 Franziska Köder Seminar in Methodology and Statistics, May 23, 2013 24-10-2012 2 Overview 1. Experimental design

More information

What are Bibliometrics?

What are Bibliometrics? What are Bibliometrics? Bibliometrics are statistical measurements that allow us to compare attributes of published materials (typically journal articles) Research output Journal level Institution level

More information

BIBLIOMETRIC REPORT. Bibliometric analysis of Mälardalen University. Final Report - updated. April 28 th, 2014

BIBLIOMETRIC REPORT. Bibliometric analysis of Mälardalen University. Final Report - updated. April 28 th, 2014 BIBLIOMETRIC REPORT Bibliometric analysis of Mälardalen University Final Report - updated April 28 th, 2014 Bibliometric analysis of Mälardalen University Report for Mälardalen University Per Nyström PhD,

More information

Using Calibration Pinpoints for locating devices indoor Master of Science Thesis

Using Calibration Pinpoints for locating devices indoor Master of Science Thesis Faculty of Mathematics and Natural Science Department of Computer Science University of Groningen Using Calibration Pinpoints for locating devices indoor Master of Science Thesis By: Dennis Kanon S1673491

More information

An extensive Survey On Sarcasm Detection Using Various Classifiers

An extensive Survey On Sarcasm Detection Using Various Classifiers Volume 119 No. 12 2018, 13183-13187 ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu An extensive Survey On Sarcasm Detection Using Various Classifiers K.R.Jansi* Department of Computer

More information

Understanding the Changing Roles of Scientific Publications via Citation Embeddings

Understanding the Changing Roles of Scientific Publications via Citation Embeddings Understanding the Changing Roles of Scientific Publications via Citation Embeddings Jiangen He Chaomei Chen {jiangen.he, chaomei.chen}@drexel.edu College of Computing and Informatics, Drexel University,

More information

Universität Bamberg Angewandte Informatik. Seminar KI: gestern, heute, morgen. We are Humor Beings. Understanding and Predicting visual Humor

Universität Bamberg Angewandte Informatik. Seminar KI: gestern, heute, morgen. We are Humor Beings. Understanding and Predicting visual Humor Universität Bamberg Angewandte Informatik Seminar KI: gestern, heute, morgen We are Humor Beings. Understanding and Predicting visual Humor by Daniel Tremmel 18. Februar 2017 advised by Professor Dr. Ute

More information

Detecting Attempts at Humor in Multiparty Meetings

Detecting Attempts at Humor in Multiparty Meetings Detecting Attempts at Humor in Multiparty Meetings Kornel Laskowski Carnegie Mellon University Pittsburgh PA, USA 14 September, 2008 K. Laskowski ICSC 2009, Berkeley CA, USA 1/26 Why bother with humor?

More information

Maths-Whizz Investigations Paper-Back Book

Maths-Whizz Investigations Paper-Back Book Paper-Back Book are new features of our Teachers Resource to help you get the most from our award-winning software and offer new and imaginative ways to explore mathematical problem-solving with real-world

More information

Analyzing Second Screen Based Social Soundtrack of TV Viewers from Diverse Cultural Settings

Analyzing Second Screen Based Social Soundtrack of TV Viewers from Diverse Cultural Settings Analyzing Second Screen Based Social Soundtrack of TV Viewers from Diverse Cultural Settings Partha Mukherjee ( ) and Bernard J. Jansen College of Information Science and Technology, Pennsylvania State

More information

AIIP Connections. Part I: Writers Guidelines Part II: Editorial Style Guide

AIIP Connections. Part I: Writers Guidelines Part II: Editorial Style Guide AIIP Connections Part I: Writers Guidelines Part II: Editorial Style Guide January 2018 Table of Contents PART I: WRITER S GUIDELINES 1 ABOUT AIIP CONNECTIONS 1 ARTICLE DEVELOPMENT AND SUBMISSION 1 SOCIAL

More information

Large scale Visual Sentiment Ontology and Detectors Using Adjective Noun Pairs

Large scale Visual Sentiment Ontology and Detectors Using Adjective Noun Pairs Large scale Visual Sentiment Ontology and Detectors Using Adjective Noun Pairs Damian Borth 1,2, Rongrong Ji 1, Tao Chen 1, Thomas Breuel 2, Shih-Fu Chang 1 1 Columbia University, New York, USA 2 University

More information

A COMPREHENSIVE STUDY ON SARCASM DETECTION TECHNIQUES IN SENTIMENT ANALYSIS

A COMPREHENSIVE STUDY ON SARCASM DETECTION TECHNIQUES IN SENTIMENT ANALYSIS Volume 118 No. 22 2018, 433-442 ISSN: 1314-3395 (on-line version) url: http://acadpubl.eu/hub ijpam.eu A COMPREHENSIVE STUDY ON SARCASM DETECTION TECHNIQUES IN SENTIMENT ANALYSIS 1 Sindhu. C, 2 G.Vadivu,

More information

Approaches for Computational Sarcasm Detection: A Survey

Approaches for Computational Sarcasm Detection: A Survey Approaches for Computational Sarcasm Detection: A Survey Lakshya Kumar, Arpan Somani and Pushpak Bhattacharyya Dept. of Computer Science and Engineering Indian Institute of Technology, Powai Mumbai, Maharashtra,

More information

Sentiment Aggregation using ConceptNet Ontology

Sentiment Aggregation using ConceptNet Ontology Sentiment Aggregation using ConceptNet Ontology Subhabrata Mukherjee Sachindra Joshi IBM Research - India 7th International Joint Conference on Natural Language Processing (IJCNLP 2013), Nagoya, Japan

More information

Inverted Index Construction

Inverted Index Construction Inverted Index Construction Adapted from Lectures by Prabhakar Raghavan (Yahoo and Stanford) and Christopher Manning (Stanford) Prasad L3InvertedIndex 1 Unstructured data in 1650 Which plays of Shakespeare

More information

Harnessing Cognitive Features for Sarcasm Detection

Harnessing Cognitive Features for Sarcasm Detection Harnessing Cognitive Features for Sarcasm Detection Abhijit Mishra, Diptesh Kanojia, Seema Nagar, Kuntal Dey, Pushpak Bhattacharyya Indian Institute of Technology Bombay, India IBM Research, India {abhijitmishra,

More information

Outline. Why do we classify? Audio Classification

Outline. Why do we classify? Audio Classification Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification Implementation Future Work Why do we classify

More information

Web of Science Unlock the full potential of research discovery

Web of Science Unlock the full potential of research discovery Web of Science Unlock the full potential of research discovery Hungarian Academy of Sciences, 28 th April 2016 Dr. Klementyna Karlińska-Batres Customer Education Specialist Dr. Klementyna Karlińska- Batres

More information

EE373B Project Report Can we predict general public s response by studying published sales data? A Statistical and adaptive approach

EE373B Project Report Can we predict general public s response by studying published sales data? A Statistical and adaptive approach EE373B Project Report Can we predict general public s response by studying published sales data? A Statistical and adaptive approach Song Hui Chon Stanford University Everyone has different musical taste,

More information

Comparison of N-Gram 1 Rank Frequency Data from the Written Texts of the British National Corpus World Edition (BNC) and the author s Web Corpus

Comparison of N-Gram 1 Rank Frequency Data from the Written Texts of the British National Corpus World Edition (BNC) and the author s Web Corpus Comparison of N-Gram 1 Rank Frequency Data from the Written Texts of the British National Corpus World Edition (BNC) and the author s Web Corpus Both sets of texts were preprocessed to provide comparable

More information

Navigate to the Journal Profile page

Navigate to the Journal Profile page Navigate to the Journal Profile page You can reach the journal profile page of any journal covered in Journal Citation Reports by: 1. Using the Master Search box. Enter full titles, title keywords, abbreviations,

More information

A combination of approaches to solve Task How Many Ratings? of the KDD CUP 2007

A combination of approaches to solve Task How Many Ratings? of the KDD CUP 2007 A combination of approaches to solve Tas How Many Ratings? of the KDD CUP 2007 Jorge Sueiras C/ Arequipa +34 9 382 45 54 orge.sueiras@neo-metrics.com Daniel Vélez C/ Arequipa +34 9 382 45 54 José Luis

More information

Sentiment and Sarcasm Classification with Multitask Learning

Sentiment and Sarcasm Classification with Multitask Learning 1 Sentiment and Sarcasm Classification with Multitask Learning Navonil Majumder, Soujanya Poria, Haiyun Peng, Niyati Chhaya, Erik Cambria, and Alexander Gelbukh arxiv:1901.08014v1 [cs.cl] 23 Jan 2019 Abstract

More information

Why t? TEACHER NOTES MATH NSPIRED. Math Objectives. Vocabulary. About the Lesson

Why t? TEACHER NOTES MATH NSPIRED. Math Objectives. Vocabulary. About the Lesson Math Objectives Students will recognize that when the population standard deviation is unknown, it must be estimated from the sample in order to calculate a standardized test statistic. Students will recognize

More information

Detecting Musical Key with Supervised Learning

Detecting Musical Key with Supervised Learning Detecting Musical Key with Supervised Learning Robert Mahieu Department of Electrical Engineering Stanford University rmahieu@stanford.edu Abstract This paper proposes and tests performance of two different

More information

Automatic Music Clustering using Audio Attributes

Automatic Music Clustering using Audio Attributes Automatic Music Clustering using Audio Attributes Abhishek Sen BTech (Electronics) Veermata Jijabai Technological Institute (VJTI), Mumbai, India abhishekpsen@gmail.com Abstract Music brings people together,

More information

Journal Citation Reports on the Web. Don Sechler Customer Education Science and Scholarly Research

Journal Citation Reports on the Web. Don Sechler Customer Education Science and Scholarly Research Journal Citation Reports on the Web Don Sechler Customer Education Science and Scholarly Research don.sechler@thomsonreuters.com Introduction JCR distills citation trend data for over 10,000 journals from

More information