TWITTER SARCASM DETECTOR (TSD) USING TOPIC MODELING ON USER DESCRIPTION

Size: px
Start display at page:

Download "TWITTER SARCASM DETECTOR (TSD) USING TOPIC MODELING ON USER DESCRIPTION"

Transcription

1 TWITTER SARCASM DETECTOR (TSD) USING TOPIC MODELING ON USER DESCRIPTION Supriya Jyoti Hiwave Technologies, Toronto, Canada Ritu Chaturvedi MCS, University of Toronto, Canada Abstract Internet users go to different social media platforms to read about reviews or comments about a product before they buy or invest in one. Thus, for many companies it is very important to keep track of sentiments of their customers review, respond to them in-time, and to keep the brand value high. There is no dearth of models being created for mining sentiments from twitter data but these models fail when sarcasm is involved in tweets. Thus, sarcasm detection (when expressed implicitly, in contrary to being expressed using explicit words) can help in gaining better insight of customer sentiments on their opinion or review about a product or company. In order to fulfil the above mentioned objective of detecting sarcasm, this paper engineers features from tweets and their description in order to capture the sarcasm expressed in tweets. Each registered author on twitter has the opportunity to selfdescribe oneself. We utilize this selfdescription to extract extra information about personality traits of twitter authors. This additional knowledge about the author of a tweet, helps in classifying sarcastic tweets from nonsarcastic ones with a high degree of accuracy. We engineer and test a range of different feature-sets from tweet-text and it s author s self-description using LinearSVM and Logistic Regression predictive models. We achieve reliable results (an F-score of 0.78 and AUC of 0.88), which are comparable to the other sarcasm detection models reported in literature. Keywords: Sarcasm Detection, Sentiment Analysis, Text Classification, Natural Language Processing, Text Mining, Machine Learning, Social Media I. INTRODUCTION Sarcasm has become a common form of communication on social media platforms as people find it entertaining. Detecting sarcasm is considered a difficult task even for humans without the knowledge of the author s personality and without hand or facial gestures. Identifying sarcasm automatically is an even more challenging task, especially when it is expressed implicitly. For example, a tweet written by a high school boy says I LOVE being kicked out of my BBall team based on my FIRST Match!! #Sarcasm. Automated methods that analyse this tweet may (incorrectly) label it as a positive tweet because of the presence of the word LOVE. Detecting sarcasm from unstructured text generated on social media platforms has many applications, both in industry and the government sectors: such as sentiment analysis on political issues, product ranking system, review summarization and opinion mining. From the perspective of a business, it is very important to recognize sentiments of their customers but sarcastic comments make it a difficult task. As most of the internet users go to different social media platforms (Twitter, Facebook, Reddit etc.) to read 94

2 product reviews before buying or investing into any product or service. Thus, it is very important for businesses to know customers true opinions expressed on these social media platforms to keep the brand value high and make better business decisions. Building a machine learning model that can detect sarcasm from written text data is not a trivial task. Twitter has grown to 328 million active users per month in the is quarter of Twitter s growth and ease of using Twitter API for downloading tweets has opened a great opportunity for private businesses and government sectors to analyse twitter data and make informed and timely decisions. Building automatic sarcasm detection model on tweets comes with many challenges. Twitter has a word limit of 140 words, this compels users to use abbreviations, short sentences, unconventional style of writing and incorrect sentence structure to express opinions. In addition, positive words are mostly used to express negative sentiments, and thus, conventional text analysis techniques such as SentiWordNet1 (lexical database for opinion mining), bag-of-words, TF-IDF (term frequency - inverse document frequency), tokenisation, part-of-speech tagging etc. alone cannot be used to detect sarcasm. In our approach of Twitter Sarcasm Detector (TSD), we engineer features from two main pieces of information: a tweet s text and the author s selfdescription. Twitter allows a tweet of maximum 140 characters, which can contain #hashtags, images, to other users and emojis along with the plain text. Throughout this paper, we will use TT to refer to tweet s text. Twitter also gives it s users an option to describe him/herself within a limit of 160 words - we define this as Author s Self- Description, hereafter referred as ASD.Fig 2 shows an example of Twitter user s self-description. In this paper we categories our features into three categories, as explained below: _ Lexical based features L: Features such as n- grams,intensifiers, capital words, double quotes, and count of part-of-speech. We generate these features from TT and refer them as Lexical based features, related to words or vocabulary of a language. These features helps in capturing the nuances of writing style of a Twitter user. Figure 1. Example of a Sarcastic Tweet _ Sentiment based features S: Sarcastic tweets have contrasts in sentiments, as identified by earlier research work [14]. Thus, we calculate positive, negative and difference of sentiment score (using SentiWordNet and Python s TextBlob package) of different parts of TT as features. _ Topic-Modeling based features T: In machine learning and natural language processing, topicmodeling is a type of statistical model for discovering hidden semantic structures from a collection of documents. A document typically consists of multiple topics, where a topic is a cluster of co-occurring words. We apply topic modeling technique on ASD and use these as features. These features help us in capturing our assumption that ASD (author s selfdescription) can give a better idea of one s inclination towards sarcasm and author s twitter behaviour. Details of all the three categories and features can be found in Table II. The motivation of this work is to improve the accuracy of the automatic sarcasm detection model and make it more generalizable so that it can flag sarcastic tweets being generated on Twitter. This is based on the suggestions given in earlier research [3], 95

3 [11], [12] that includes features from tweets and information available about its author can help in detecting sarcasm. Thus, in our TSD model we propose to use Twitter s ASD (author s selfdescription) in order to generate features that can capture a tweet s author s Twitter behaviour / personality traits. For example, a high school teacher who describes himself as Ordinary Next door Guy Love football Live Simple is less likely to make sarcastic comments than a high school student whose self description is Funny Sarcastic Live Life to Fullest Social Media Enthusiast. More often than not, users on all social media platforms, while creating an account, get to write about themselves in a user-description/profiledescription section. For capturing personality traits of the tweet s author, we have applied Latent Dirichlet Allocation (LDA) [4], to create features to train our classifier. LDA is a popular topic modeling technique for finding topics, where a topic is defined as a cluster of co-occurring words with different probabilities of appearance in documents discussing these topics, as it assumes that a document is a mixture of such topics. We first learn these topics from the user s self-description text, then the classifier will have to learn about these specific topics and create an association with authors. This technique will add that extra information about the personality of an author. We propose a Twitter Sarcasm Detector model (TSD), which captures knowledge about a tweet s author in addition to lexical features from TT to enhance performance of the sarcasm classifiers. Fig 3 is a pictorial representation of our model. Our results suggest that lexical features alone are not Figure 2. Example of a User s/author s profile enough to capture sarcasm, so including more information about the author helps in improving the accuracy of the predictive model. The rest of the paper is organized as follows: Section II has the problem definition. Section III discusses the related literature work. Section IV describes our proposed methodology of sarcasm detection in detail through three sub-sections: data description, data cleaning and preprocessing, and feature engineering. A discussion of experimental setup and performance evaluation is given in Section V. Final conclusion and future directions are outlined in Section VI. II. PROBLEM DEFINITION Sarcasm, "remarks that mean the opposite of what someone says, made to criticize someone or something in a way that is amusing to others but annoying to the person criticized, as defined in Cambridge Dictionary 2. A way of expressing words that are the opposite of what you mean in order to be unpleasant to someone or to make fun of someone [12]. We formally define sarcasm detection problem for Twitter as follows: Definition: Sarcasm Detection on Twitter: given an unlabeled tweet T along with its author s profile description d, our aim is to automatically detect whether tweet T is sarcastic or not. 96

4 III. RELATED WORK Sarcasm detection can be formulated as a classification problem [8], in which, given a piece of text, the goal is to classify it either as sarcastic or non-sarcastic. Literature on sarcasm detection is mostly centered around the twitter data. This is mostly because Twitter has become the most popular platform for communication and sharing opinions with 319 million active users per month3. Ease of using Twitter API to download tweets has opened gates to many research topics and application of text mining. Sarcasm has been a widely researched topic under Natural Language Processing. A solution to the problem of sarcasm detection can improve the accuracy of many other important research topics such as Figure 3. Pictorial representation of TSD sentiment analysis, information retrieval, and all its additional linguistic features that characterize applications (example customer service, sarcasm such as intensifiers, exclamations and making/maintaining brand value). Quick response to negative or sarcastic tweets can help in saving a company s reputation. explicit markers. Bamman and Smith [3] focused on four classes of features which included author s feature, tweet features, response features and Davidov, Tsur and Rappoport [5] used a semisupervised audience features. They used binary logistic classification technique on Twitter data and Amazon product reviews. They used patternbased and punctuation-based features to train their weighted knn-like strategy for sarcasm detection. Liebrecht, Kunneman and Bosch [10] experimented on Dutch tweets and used the #sarcasm as their gold standard for collecting labeled data. They proposed regression to run experiments. Their analysis concluded that a significant improvement in accuracy is seen in models with features derived from author s information and not just tweet alone. Riloff et al. [14] used lexicon-based approach to detect sarcasm on an assumption that sarcastic tweets have a contrast in positive situation and negative situations. They used 97

5 SVM classifier for experimentations.rajadesingan, Zafarani and Lui [12] based their model of automatic sarcasm detection on theories from behavioural and psychological studies. They proposed a behavioural modeling framework for sarcasm detection, SCUBA, which captured sarcasm in form of features such as contrast of sentiments, complex forms of expression, mean of conveying emotions, familiarity with language and written expressions. SCUBA also takes help of past tweets of each author as additional knowledge to detect sarcasm. Joshi, Jain, Bhattacharyya and Carman [9] designed an approach which uses topic modeling that detect sarcasmprevalent topics from dataset of tweets and, estimates distributions corresponding to prevalence of a topic, prevalence of a sentiment-bearing words. Mukherjee and Bala [11] proposed feature-set that captured both the content as well as writing style of the author. They experimented with fuzzy clustering and Naive Bayes methods. Most of the work on automatic sarcasm detection is focused on linguistic and lexical features captured from tweets text. However, recently, Mukherjee and Bala [11] suggested that capturing knowledge about the authors of the tweet can improve automatic detection of sarcasm. This paper use a variation of Joshi, Jain, Bhattacharyya and Carman [9] approach. Our approach applies topic modeling on author s selfdescription instead of tweet s text. The rationale behind our approach is that most often, people when describe themselves on social networking sites, like Twitter, are concise while expressing their personality traits. Thus, this feature helps the model in capturing prevalence of sarcasm from self-reported personality traits in profile description of Twitter accounts. IV. METHODOLOGY The following subsections describe the data used in the proposed Twitter Sarcasm Detector (TSD) model and the required steps to prepare and preprocess this data so that it can be used towards extracting relevant features used in predicting the sarcasm in tweets. A. Data Description In order classify tweets into sarcastic or non-sarcastic categories using a machine learning model, we need labeled tweets. Labeling tweets manually is a time consuming task. Self-reporting of moods or sentiments with hashtags is a common practice on social networking sites. Prior work, [3],[9], [13], have utilized this important and common behavior of twitter users to generate labeled dataset. We follow the same standard approach to download tweets from Twitter API4 annotated with #sarcasm or #sarcastic as sarcastic tweet and all the other tweets as nonsarcastic tweets. In addition to just tweet s text (TT), we also download each tweet s author s selfdescription (ASD). The dataset contains 9,991 sarcastic (labeled as positive) and 10,000 nonsarcastic (labeled as negative) tweets, collected on 17 January Taking data directly from Twitter API is always noisy, thus we use many preprocessing steps, as explained in the next section, before applying the feature engineering module. B. Data Cleaning and Preprocessing The two main pieces of information we are using to build our sarcasm model are: tweet s text (TT) and the author s selfdescription (ASD). Both are in the string format, thus, we apply many preprocessing before feeding it into our feature engineering module. We apply separate preprocessing steps on TT and ASD. Twitter allows a tweet of maximum 140 characters, which can contain #hashtags, images, URLs, 98

6 @mentions to other users and emojis along with the plain text. For example, a tweet in Fig 1 contains #hashtags (#Sarcasm, #iphone). Similarly, when a user creates an account on Twitter, he/she gets to write a short bio for his/her profile. For example, a user in Fig 2 describes himself as sarcastic, genius and future billionaire. There is a restriction of 160 characters, thus, most people use different adjectives, separated by punctuation marks. During preprocessing of tweets, we prune all non- English tweets, as our model s focus is on English language tweets only. We exclude hyperlinks as they do not contribute much in capturing sarcasm. We also remove images attached to tweets as we are not handling sarcasm expressed in images in this model. We replace most common contractions (like don t with do not, won t with will not, lol with laughing out loud, etc.) with full forms and also substituted emojis with their meanings such as :-( with sad, :-) with happy, :-@ angry. We remove the #Sarcasm and #Sarcastic hashtags from our positive dataset. Whereas, all the other hashtag words are kept by just removing the # character. Preprocessing steps of ASD includes, replacing missing ASD s with an alphabet and removing all the punctuation marks. Hashtag words are kept by just removing # character. C. Feature Engineering The most important module of our Twitter Sarcasm Detector (TSD) model is feature engineering where we extract features from tweets text (TT) and author s self-description (ASD) that can help in building an automatic sarcasm classifier.we divide these features into three categories L-LEXICAL, S- SENTIMENT, and T-TOPIC_MODELING based features. Table I lists a brief overview of these features, their categories and whether they are applied to TT or ASD. The features are defined as: L-LEXICAL = {N-grams, Intensifiers, Capital Letters, Word-Count, Double-Quotes, Part-of-Speech Tags}; S-SENTIMENT = {Sentiment Score and Contrast in Sentiments}; T-TOPIC_MODELING = {Topic_modeling using LDA}. L-LEXICAL features are based on words or vocabulary of English language, all these features are designed to capture the nuances of writing style of a Twitter user. S- SENTIMENT based features generally capture both positive, negative and difference in sentiment scores of tweet. This feature is based on Riloff et al. [14] suggestion that sarcastic tweets have a positive/negative contrast between a sentiment and a situation. Use of topic modeling technique to create feature category TTOPIC_ MODELING is inspired by Joshi, Jain, Bhattacharyya and Carman [9]. Table II describes each feature proposed with an example and rational behind using that feature. Running Example of TSD: The following is a running example of feature engineering module of TSD. It takes as input a tweet text (TT) and it s author s self-description (ASD). It generates the features shown below for the given TT and ASD. These features are described in Table II. Inputs: _ Tweet: "Dream Date: Love take me to Sephora Give me your credit card - Just Leave #Dream Date #Sarcasm #Sephora _ User self-description: "Optimistic Researcher Stem cell biologist Explorer witty :) sarcasm my 1st language fun loving. TSD-generated-features: _ L-LEXICAL = {n-grams:{ contains(dream, date) : 1, contains(love) : 1.0, contains(take me to sephora) : 1.0, contains(credit card) : 1.0, 99

7 contains(just leave) : 1.0, contains(give me your) : 1.0, contains(credit) : 1.0 etc. }, word_count :13, Capital_Letter : 0, POSVerb : 3.0, POSNoun : 5.0, POSAdverb : 2.0, POSAdjective : 3.0, intensifiers : 0}; _ S-SENTIMENT = { Sentiment_score: { { Negative sentiment : ,Positivesentiment : }, Sentiment_Contrast: { Sentiment contrast : }}; _ T-TOPIC_MODELING={ Topic :184 : , Topic :134 : , Topic :7 : , Topic:8 : , Topic :5 : } The LDA algorithm [4] assigns five topics as features ofthe given ASD. Each feature consist of the topic identifierand the probabilty of its existence in ASD. V. EXPERIMENTAL EVALUATION The goal of our experiment is to show that the feature engineering done from tweet-text (TT) and author s self-description (ASD) in our proposed Twitter Sarcasm Detector (TSD) model can improve the performance of automatic sarcasm detection on Twitter Network. To classify a tweet into SARCASTIC or Table I LIST OF FEATURES, APPLIED TO AND THEIR CATEGORIES Table II DETAILS OF FEATURES ENGINEERED (*NEW FEATURES) 100

8 NON-SARCASTIC category we use SVM with Linear Kernel (C=0.1) and Logistic Regression (l2 penalty and C = 1.0) from python s Scikit-learn package. Performance of these models is evaluated using the 10-fold cross-validation method. Cross validation generalizes the model to unseen datasets by automatically performing a number of trainpredict-evaluate operations on different input data. In 10-fold cross-validation, input data is split into 10 parts, where 9 parts of data are used for training and 1 part is used for testing the model. This process is repeated 10 times and evaluation metrics are averaged. Thus, for an empirical performance evaluation of full and different combinations of feature-set, we use standard Precision, Recall, F- score, Accuracy and AUC of ROC (Area Under the Curve of Receiver Operating Characteristic) as metrics. Precision is a measure of fraction of retrieved instances that are relevant, thus, can be seen as a measure of exactness. Recall is defined as a fraction of relevant instances that are retrieved, thus, seen as a measure of correctness. F-score is the harmonic mean of precision and recall. Accuracy of a binary classifier is the proportion of correctly classified data, and Area Under the ROC Curve of a classifier is equivalent to the probability that the classifier will rank a randomly chosen positive instance higher than a randomly chosen negative instance i.e. P(score(x+) > score(x )). We experiment using different combinations of categories defined in the TSD model to study the impact of proposed additional feature (T- TOPIC_MODELING) on the performance metrics. The combinations used are: (a) L-LEXICAL + S- SENTIMENT + T-TOPIC_MODELING; (b) L- LEXICAL + T-TOPIC_MODELING; (c) L- LEXICAL + S-SENTIMENT; (d) L-LEXICAL + S_SENTIMENT. Table III shows experimental results of Twitter Sarcasm Detector (TSD) model for all the three categories (L-LEXICAL, S-SENTIMENT and T- TOPIC_MODELING) of features. Table IV shows results of two categories (L- LEXICAL & TTOPIC_ MODELING). Similarly, results of experiments for L-LEXICAL & S SENTIMENT and S-SENTIMENT & TTOPIC_ MODELING are shown in Table V and Table VI, respectively. Joachims [7] provides both theoretical and empirical evidence that SVM is well suited for text classification. The rationale behind using Linear Kernel is its faster performance and avoidance to overfitting with sparse matrix, as our model generates a number of nominal features under n-gram feature. A. Performance Evaluation To perform empirical evaluation of our proposed model, we download tweets using Twitter API and create a labeled dataset using the technique explained in Section IV-A. We apply 10-fold cross-validation evaluation method on both LinearSVM (C=0.1) and Logistic Regression (l2 penalty and C= 1.0). Davis and Goadrich [6] regard AUC as a single number summary for a classifier s performance. Thus, we use AUC to compare performance of individual and different combinations of our feature-sets. The AUC value of n-grams and Topic_Modeling features, when tested individually with 101

9 Figure 4. Comparative performance values of AUC LinearSVM, is and , respectively. There is a significant improvement in AUC values (TSD-LinearSVM: and TSD- LogisticRegression: ), when T- TOPIC_MODELING is combined with L-LEXICAL feature category. Both LinearSVM and LogisticRegression perform well and give similar results for all the metrics for the different combinations of features. The AUC score of both the classifiers is around 88%, which is comparatively higher than the results of other automatic sarcasm detection models, provided in literature (See Fig. 4). We compare AUC values of our TSD with reported AUC values of Liebrecht, Kunneman and Bosch [10], SCUBA [12], and Abercrombie and Hovy [1][8], and we observe an improvement in AUC value in the range of 5 to 28% (Fig 4). Other performance measures are also found to be comparable to existing methods, if not higher (e.g. Accuracy and F-score are found to be 79%). Experiments with TSD data also show that excluding L-LEXICAL features (Tables III and VI) give much lower values of Accuracy and AUC, this supports the fact that lexical features extracted from tweets are essential to detecting sarcasm. An analysis of all the features and different combinations assert our hypothesis that providing additional information about the tweet s author s personality helps in improving the performance of automatic detection of sarcasm on Twitter data. VI. CONCLUSIONS AND FUTURE WORK In this paper, we propose a model called Twitter Sarcasm Detector (TSD), for automatically classifying sarcastic tweets by using features generated from tweet-text along with taking it s author s self-description into account. Our proposed approach acknowledges that some people are more sarcastic than others. Experimental results assert our hypothesis that features obtained from author s information in addition to their tweet, perform better in terms of correctly classifying sarcastic tweets. One of the application of our model is that classifying sarcasm can help businesses to analyse sentiments of customers towards their business. Knowing true sentiments of customers can help business make informed and better decisions by having the ability to answer questions like 102

10 What are the true emotions of my customers towards my products/company?. In future this work can be extended to analyse non- English tweets, extending this model to detect sarcasm on other microblogs (such as Facebook and Reddit) and engineer more features from information available about a tweet s author (e.g. number of followers, verified account, frequency of tweets per day). VII. ACKNOWLEDGEMENT This research was sponsored by Hiwave Technologies ( Canada Inc.), Toronto, Canada. The authors gratefully acknowledge the use of services and facilities of Hiwave Technologies. The authors also thank Vibhu Bhan, who provided support towards this research. REFERENCES [1] Gavin Abercrombie and Dirk Hovy. Putting sarcasm detection into context: The effects of class imbalance and manual labelling on supervised machine classification of twitter conversations. In Proceedings of the ACL 2016 Student Research Workshop, pages , Berlin, Germany, August Association for Computational Linguistics. [2] Stefano Baccianella, Andrea Esuli, and Fabrizio Sebastiani. Sentiwordnet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining. LREC, 10: , [3] David Bamman and Noah Smith. Contextualized sarcasm detection on twitter. International AAAI Confreence on Web and Social Media, Feb [4] David M Blei, Andrew Y Ng, and Michael I Jordan. Latent dirichlet allocation. Journal of machine Learning research, 3(9), Jan

11 [5] Dmitry Davidov, Oren Tsur, and Ari Rappoport. Semi-supervised recognition of sarcastic sentences in twitter and amazon. In Proceedings of the Fourteenth Conference on Computational Natural Language Learning, CoNLL 10, pages , Stroudsburg, PA, USA, Association for Computational Linguistics. [6] Jesse Davis and Mark Goadrich. The relationship between precisionrecall and roc curves. In Proceedings of the 23rd International Conference on Machine Learning, ICML 06, pages , New York, NY, USA, ACM. [7] Thorsten Joachims. Text categorization with support vector machines: Learning with many relevant features. In European conference on machine learning, pages Springer, [8] Aditya Joshi, Pushpak Bhattacharyya, and Mark James Carman. Automatic sarcasm detection: Survey. Computing Research Repository (CoRR), abs/ , [9] Aditya Joshi, Prayas Jain, Pushpak Bhattacharyya, and Mark Carman. Who would have thought of that! : A hierarchical topic model for extraction of sarcasm-prevalent topics and sarcasm detection. arxiv preprint arxiv: , Nov [10] Christine Liebrecht, Florian Kunneman, and Antal Van den Bosch. The perfect solution for detecting sarcasm in tweets #not. In Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pages 29 37, Atlanta, Georgia, June Association for Computational Linguistics. [11] Shubhadeep Mukherjee and Pradip Kumar Bala. Sarcasm detection in microblogs using naive bayes and fuzzy clustering. Technology in Society, 48:19 27, [12] Ashwin Rajadesingan, Reza Zafarani, and Huan Liu. Sarcasm detection on twitter: A behavioral modeling approach. In Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, WSDM 15, pages , New York, NY, USA, ACM. [13] Kumar Ravi and Vadlamani Ravi. A novel automatic satire and irony detection using ensembled feature selection and data mining. Knowledge-Based Systems, [14] Ellen Riloff, Ashequl Qadir, Prafulla Surve, Lalindra De Silva, Nathan Gilbert, and Ruihong Huang. Sarcasm as contrast between a positive sentiment and negative situation. Empirical Methods in Natural Language Processing, 13: , Oct [15] Zelin Wang, Zhijian Wu, Ruimin Wang, and Yafeng Ren. Twitter sarcasm detection exploiting a context-based model. Web Information Systems Engineering WISE 2015, 9418 of the series Lecture Notes in Computer Science:77 91, Dec

Sarcasm Detection in Text: Design Document

Sarcasm Detection in Text: Design Document CSC 59866 Senior Design Project Specification Professor Jie Wei Wednesday, November 23, 2016 Sarcasm Detection in Text: Design Document Jesse Feinman, James Kasakyan, Jeff Stolzenberg 1 Table of contents

More information

Your Sentiment Precedes You: Using an author s historical tweets to predict sarcasm

Your Sentiment Precedes You: Using an author s historical tweets to predict sarcasm Your Sentiment Precedes You: Using an author s historical tweets to predict sarcasm Anupam Khattri 1 Aditya Joshi 2,3,4 Pushpak Bhattacharyya 2 Mark James Carman 3 1 IIT Kharagpur, India, 2 IIT Bombay,

More information

World Journal of Engineering Research and Technology WJERT

World Journal of Engineering Research and Technology WJERT wjert, 2018, Vol. 4, Issue 4, 218-224. Review Article ISSN 2454-695X Maheswari et al. WJERT www.wjert.org SJIF Impact Factor: 5.218 SARCASM DETECTION AND SURVEYING USER AFFECTATION S. Maheswari* 1 and

More information

Who would have thought of that! : A Hierarchical Topic Model for Extraction of Sarcasm-prevalent Topics and Sarcasm Detection

Who would have thought of that! : A Hierarchical Topic Model for Extraction of Sarcasm-prevalent Topics and Sarcasm Detection Who would have thought of that! : A Hierarchical Topic Model for Extraction of Sarcasm-prevalent Topics and Sarcasm Detection Aditya Joshi 1,2,3 Prayas Jain 4 Pushpak Bhattacharyya 1 Mark James Carman

More information

The Lowest Form of Wit: Identifying Sarcasm in Social Media

The Lowest Form of Wit: Identifying Sarcasm in Social Media 1 The Lowest Form of Wit: Identifying Sarcasm in Social Media Saachi Jain, Vivian Hsu Abstract Sarcasm detection is an important problem in text classification and has many applications in areas such as

More information

Harnessing Context Incongruity for Sarcasm Detection

Harnessing Context Incongruity for Sarcasm Detection Harnessing Context Incongruity for Sarcasm Detection Aditya Joshi 1,2,3 Vinita Sharma 1 Pushpak Bhattacharyya 1 1 IIT Bombay, India, 2 Monash University, Australia 3 IITB-Monash Research Academy, India

More information

An Impact Analysis of Features in a Classification Approach to Irony Detection in Product Reviews

An Impact Analysis of Features in a Classification Approach to Irony Detection in Product Reviews Universität Bielefeld June 27, 2014 An Impact Analysis of Features in a Classification Approach to Irony Detection in Product Reviews Konstantin Buschmeier, Philipp Cimiano, Roman Klinger Semantic Computing

More information

How Do Cultural Differences Impact the Quality of Sarcasm Annotation?: A Case Study of Indian Annotators and American Text

How Do Cultural Differences Impact the Quality of Sarcasm Annotation?: A Case Study of Indian Annotators and American Text How Do Cultural Differences Impact the Quality of Sarcasm Annotation?: A Case Study of Indian Annotators and American Text Aditya Joshi 1,2,3 Pushpak Bhattacharyya 1 Mark Carman 2 Jaya Saraswati 1 Rajita

More information

Temporal patterns of happiness and sarcasm detection in social media (Twitter)

Temporal patterns of happiness and sarcasm detection in social media (Twitter) Temporal patterns of happiness and sarcasm detection in social media (Twitter) Pradeep Kumar NPSO Innovation Day November 22, 2017 Our Data Science Team Patricia Prüfer Pradeep Kumar Marcia den Uijl Next

More information

arxiv: v1 [cs.cl] 3 May 2018

arxiv: v1 [cs.cl] 3 May 2018 Binarizer at SemEval-2018 Task 3: Parsing dependency and deep learning for irony detection Nishant Nikhil IIT Kharagpur Kharagpur, India nishantnikhil@iitkgp.ac.in Muktabh Mayank Srivastava ParallelDots,

More information

Sarcasm Detection on Facebook: A Supervised Learning Approach

Sarcasm Detection on Facebook: A Supervised Learning Approach Sarcasm Detection on Facebook: A Supervised Learning Approach Dipto Das Anthony J. Clark Missouri State University Springfield, Missouri, USA dipto175@live.missouristate.edu anthonyclark@missouristate.edu

More information

Are Word Embedding-based Features Useful for Sarcasm Detection?

Are Word Embedding-based Features Useful for Sarcasm Detection? Are Word Embedding-based Features Useful for Sarcasm Detection? Aditya Joshi 1,2,3 Vaibhav Tripathi 1 Kevin Patel 1 Pushpak Bhattacharyya 1 Mark Carman 2 1 Indian Institute of Technology Bombay, India

More information

Automatic Sarcasm Detection: A Survey

Automatic Sarcasm Detection: A Survey Automatic Sarcasm Detection: A Survey Aditya Joshi 1,2,3 Pushpak Bhattacharyya 2 Mark James Carman 3 1 IITB-Monash Research Academy, India 2 IIT Bombay, India, 3 Monash University, Australia {adityaj,pb}@cse.iitb.ac.in,

More information

Large scale Visual Sentiment Ontology and Detectors Using Adjective Noun Pairs

Large scale Visual Sentiment Ontology and Detectors Using Adjective Noun Pairs Large scale Visual Sentiment Ontology and Detectors Using Adjective Noun Pairs Damian Borth 1,2, Rongrong Ji 1, Tao Chen 1, Thomas Breuel 2, Shih-Fu Chang 1 1 Columbia University, New York, USA 2 University

More information

SARCASM DETECTION IN SENTIMENT ANALYSIS Dr. Kalpesh H. Wandra 1, Mehul Barot 2 1

SARCASM DETECTION IN SENTIMENT ANALYSIS Dr. Kalpesh H. Wandra 1, Mehul Barot 2 1 SARCASM DETECTION IN SENTIMENT ANALYSIS Dr. Kalpesh H. Wandra 1, Mehul Barot 2 1 Director (Academic Administration) Babaria Institute of Technology, 2 Research Scholar, C.U.Shah University Abstract Sentiment

More information

KLUEnicorn at SemEval-2018 Task 3: A Naïve Approach to Irony Detection

KLUEnicorn at SemEval-2018 Task 3: A Naïve Approach to Irony Detection KLUEnicorn at SemEval-2018 Task 3: A Naïve Approach to Irony Detection Luise Dürlich Friedrich-Alexander Universität Erlangen-Nürnberg / Germany luise.duerlich@fau.de Abstract This paper describes the

More information

Detecting Musical Key with Supervised Learning

Detecting Musical Key with Supervised Learning Detecting Musical Key with Supervised Learning Robert Mahieu Department of Electrical Engineering Stanford University rmahieu@stanford.edu Abstract This paper proposes and tests performance of two different

More information

SARCASM DETECTION IN SENTIMENT ANALYSIS

SARCASM DETECTION IN SENTIMENT ANALYSIS SARCASM DETECTION IN SENTIMENT ANALYSIS Shruti Kaushik 1, Prof. Mehul P. Barot 2 1 Research Scholar, CE-LDRP-ITR, KSV University Gandhinagar, Gujarat, India 2 Lecturer, CE-LDRP-ITR, KSV University Gandhinagar,

More information

Music Mood. Sheng Xu, Albert Peyton, Ryan Bhular

Music Mood. Sheng Xu, Albert Peyton, Ryan Bhular Music Mood Sheng Xu, Albert Peyton, Ryan Bhular What is Music Mood A psychological & musical topic Human emotions conveyed in music can be comprehended from two aspects: Lyrics Music Factors that affect

More information

Projektseminar: Sentimentanalyse Dozenten: Michael Wiegand und Marc Schulder

Projektseminar: Sentimentanalyse Dozenten: Michael Wiegand und Marc Schulder Projektseminar: Sentimentanalyse Dozenten: Michael Wiegand und Marc Schulder Präsentation des Papers ICWSM A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews

More information

This is a repository copy of Who cares about sarcastic tweets? Investigating the impact of sarcasm on sentiment analysis.

This is a repository copy of Who cares about sarcastic tweets? Investigating the impact of sarcasm on sentiment analysis. This is a repository copy of Who cares about sarcastic tweets? Investigating the impact of sarcasm on sentiment analysis. White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/130763/

More information

Finding Sarcasm in Reddit Postings: A Deep Learning Approach

Finding Sarcasm in Reddit Postings: A Deep Learning Approach Finding Sarcasm in Reddit Postings: A Deep Learning Approach Nick Guo, Ruchir Shah {nickguo, ruchirfs}@stanford.edu Abstract We use the recently published Self-Annotated Reddit Corpus (SARC) with a recurrent

More information

#SarcasmDetection Is Soooo General! Towards a Domain-Independent Approach for Detecting Sarcasm

#SarcasmDetection Is Soooo General! Towards a Domain-Independent Approach for Detecting Sarcasm Proceedings of the Thirtieth International Florida Artificial Intelligence Research Society Conference #SarcasmDetection Is Soooo General! Towards a Domain-Independent Approach for Detecting Sarcasm Natalie

More information

Lyric-Based Music Mood Recognition

Lyric-Based Music Mood Recognition Lyric-Based Music Mood Recognition Emil Ian V. Ascalon, Rafael Cabredo De La Salle University Manila, Philippines emil.ascalon@yahoo.com, rafael.cabredo@dlsu.edu.ph Abstract: In psychology, emotion is

More information

Modelling Sarcasm in Twitter, a Novel Approach

Modelling Sarcasm in Twitter, a Novel Approach Modelling Sarcasm in Twitter, a Novel Approach Francesco Barbieri and Horacio Saggion and Francesco Ronzano Pompeu Fabra University, Barcelona, Spain .@upf.edu Abstract Automatic detection

More information

Harnessing Sequence Labeling for Sarcasm Detection in Dialogue from TV Series Friends

Harnessing Sequence Labeling for Sarcasm Detection in Dialogue from TV Series Friends Harnessing Sequence Labeling for Sarcasm Detection in Dialogue from TV Series Friends Aditya Joshi 1,2,3 Vaibhav Tripathi 1 Pushpak Bhattacharyya 1 Mark Carman 2 1 Indian Institute of Technology Bombay,

More information

Lyrics Classification using Naive Bayes

Lyrics Classification using Naive Bayes Lyrics Classification using Naive Bayes Dalibor Bužić *, Jasminka Dobša ** * College for Information Technologies, Klaićeva 7, Zagreb, Croatia ** Faculty of Organization and Informatics, Pavlinska 2, Varaždin,

More information

arxiv: v1 [cs.cl] 8 Jun 2018

arxiv: v1 [cs.cl] 8 Jun 2018 #SarcasmDetection is soooo general! Towards a Domain-Independent Approach for Detecting Sarcasm Natalie Parde and Rodney D. Nielsen Department of Computer Science and Engineering University of North Texas

More information

Sentiment and Sarcasm Classification with Multitask Learning

Sentiment and Sarcasm Classification with Multitask Learning 1 Sentiment and Sarcasm Classification with Multitask Learning Navonil Majumder, Soujanya Poria, Haiyun Peng, Niyati Chhaya, Erik Cambria, and Alexander Gelbukh arxiv:1901.08014v1 [cs.cl] 23 Jan 2019 Abstract

More information

Approaches for Computational Sarcasm Detection: A Survey

Approaches for Computational Sarcasm Detection: A Survey Approaches for Computational Sarcasm Detection: A Survey Lakshya Kumar, Arpan Somani and Pushpak Bhattacharyya Dept. of Computer Science and Engineering Indian Institute of Technology, Powai Mumbai, Maharashtra,

More information

Tweet Sarcasm Detection Using Deep Neural Network

Tweet Sarcasm Detection Using Deep Neural Network Tweet Sarcasm Detection Using Deep Neural Network Meishan Zhang 1, Yue Zhang 2 and Guohong Fu 1 1. School of Computer Science and Technology, Heilongjiang University, China 2. Singapore University of Technology

More information

Article Title: Discovering the Influence of Sarcasm in Social Media Responses

Article Title: Discovering the Influence of Sarcasm in Social Media Responses Article Title: Discovering the Influence of Sarcasm in Social Media Responses Article Type: Opinion Wei Peng (W.Peng@latrobe.edu.au) a, Achini Adikari (A.Adikari@latrobe.edu.au) a, Damminda Alahakoon (D.Alahakoon@latrobe.edu.au)

More information

arxiv: v2 [cs.cl] 20 Sep 2016

arxiv: v2 [cs.cl] 20 Sep 2016 A Automatic Sarcasm Detection: A Survey ADITYA JOSHI, IITB-Monash Research Academy PUSHPAK BHATTACHARYYA, Indian Institute of Technology Bombay MARK J CARMAN, Monash University arxiv:1602.03426v2 [cs.cl]

More information

Implementation of Emotional Features on Satire Detection

Implementation of Emotional Features on Satire Detection Implementation of Emotional Features on Satire Detection Pyae Phyo Thu1, Than Nwe Aung2 1 University of Computer Studies, Mandalay, Patheingyi Mandalay 1001, Myanmar pyaephyothu149@gmail.com 2 University

More information

Automatic Rhythmic Notation from Single Voice Audio Sources

Automatic Rhythmic Notation from Single Voice Audio Sources Automatic Rhythmic Notation from Single Voice Audio Sources Jack O Reilly, Shashwat Udit Introduction In this project we used machine learning technique to make estimations of rhythmic notation of a sung

More information

LT3: Sentiment Analysis of Figurative Tweets: piece of cake #NotReally

LT3: Sentiment Analysis of Figurative Tweets: piece of cake #NotReally LT3: Sentiment Analysis of Figurative Tweets: piece of cake #NotReally Cynthia Van Hee, Els Lefever and Véronique hoste LT 3, Language and Translation Technology Team Department of Translation, Interpreting

More information

NLPRL-IITBHU at SemEval-2018 Task 3: Combining Linguistic Features and Emoji Pre-trained CNN for Irony Detection in Tweets

NLPRL-IITBHU at SemEval-2018 Task 3: Combining Linguistic Features and Emoji Pre-trained CNN for Irony Detection in Tweets NLPRL-IITBHU at SemEval-2018 Task 3: Combining Linguistic Features and Emoji Pre-trained CNN for Irony Detection in Tweets Harsh Rangwani, Devang Kulshreshtha and Anil Kumar Singh Indian Institute of Technology

More information

Automatic Music Clustering using Audio Attributes

Automatic Music Clustering using Audio Attributes Automatic Music Clustering using Audio Attributes Abhishek Sen BTech (Electronics) Veermata Jijabai Technological Institute (VJTI), Mumbai, India abhishekpsen@gmail.com Abstract Music brings people together,

More information

DICTIONARY OF SARCASM PDF

DICTIONARY OF SARCASM PDF DICTIONARY OF SARCASM PDF ==> Download: DICTIONARY OF SARCASM PDF DICTIONARY OF SARCASM PDF - Are you searching for Dictionary Of Sarcasm Books? Now, you will be happy that at this time Dictionary Of Sarcasm

More information

Bilbo-Val: Automatic Identification of Bibliographical Zone in Papers

Bilbo-Val: Automatic Identification of Bibliographical Zone in Papers Bilbo-Val: Automatic Identification of Bibliographical Zone in Papers Amal Htait, Sebastien Fournier and Patrice Bellot Aix Marseille University, CNRS, ENSAM, University of Toulon, LSIS UMR 7296,13397,

More information

Automatic Detection of Sarcasm in BBS Posts Based on Sarcasm Classification

Automatic Detection of Sarcasm in BBS Posts Based on Sarcasm Classification Web 1,a) 2,b) 2,c) Web Web 8 ( ) Support Vector Machine (SVM) F Web Automatic Detection of Sarcasm in BBS Posts Based on Sarcasm Classification Fumiya Isono 1,a) Suguru Matsuyoshi 2,b) Fumiyo Fukumoto

More information

An extensive Survey On Sarcasm Detection Using Various Classifiers

An extensive Survey On Sarcasm Detection Using Various Classifiers Volume 119 No. 12 2018, 13183-13187 ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu An extensive Survey On Sarcasm Detection Using Various Classifiers K.R.Jansi* Department of Computer

More information

WHAT'S HOT: LINEAR POPULARITY PREDICTION FROM TV AND SOCIAL USAGE DATA Jan Neumann, Xiaodong Yu, and Mohamad Ali Torkamani Comcast Labs

WHAT'S HOT: LINEAR POPULARITY PREDICTION FROM TV AND SOCIAL USAGE DATA Jan Neumann, Xiaodong Yu, and Mohamad Ali Torkamani Comcast Labs WHAT'S HOT: LINEAR POPULARITY PREDICTION FROM TV AND SOCIAL USAGE DATA Jan Neumann, Xiaodong Yu, and Mohamad Ali Torkamani Comcast Labs Abstract Large numbers of TV channels are available to TV consumers

More information

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

Music Emotion Recognition. Jaesung Lee. Chung-Ang University Music Emotion Recognition Jaesung Lee Chung-Ang University Introduction Searching Music in Music Information Retrieval Some information about target music is available Query by Text: Title, Artist, or

More information

Combination of Audio & Lyrics Features for Genre Classication in Digital Audio Collections

Combination of Audio & Lyrics Features for Genre Classication in Digital Audio Collections 1/23 Combination of Audio & Lyrics Features for Genre Classication in Digital Audio Collections Rudolf Mayer, Andreas Rauber Vienna University of Technology {mayer,rauber}@ifs.tuwien.ac.at Robert Neumayer

More information

Some Experiments in Humour Recognition Using the Italian Wikiquote Collection

Some Experiments in Humour Recognition Using the Italian Wikiquote Collection Some Experiments in Humour Recognition Using the Italian Wikiquote Collection Davide Buscaldi and Paolo Rosso Dpto. de Sistemas Informáticos y Computación (DSIC), Universidad Politécnica de Valencia, Spain

More information

Bi-Modal Music Emotion Recognition: Novel Lyrical Features and Dataset

Bi-Modal Music Emotion Recognition: Novel Lyrical Features and Dataset Bi-Modal Music Emotion Recognition: Novel Lyrical Features and Dataset Ricardo Malheiro, Renato Panda, Paulo Gomes, Rui Paiva CISUC Centre for Informatics and Systems of the University of Coimbra {rsmal,

More information

Understanding Book Popularity on Goodreads

Understanding Book Popularity on Goodreads Understanding Book Popularity on Goodreads Suman Kalyan Maity sumankalyan.maity@ cse.iitkgp.ernet.in Ayush Kumar ayush235317@gmail.com Ankan Mullick Bing Microsoft India ankan.mullick@microsoft.com Vishnu

More information

PREDICTING HUMOR RESPONSE IN DIALOGUES FROM TV SITCOMS. Dario Bertero, Pascale Fung

PREDICTING HUMOR RESPONSE IN DIALOGUES FROM TV SITCOMS. Dario Bertero, Pascale Fung PREDICTING HUMOR RESPONSE IN DIALOGUES FROM TV SITCOMS Dario Bertero, Pascale Fung Human Language Technology Center The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong dbertero@connect.ust.hk,

More information

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval

DAY 1. Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval DAY 1 Intelligent Audio Systems: A review of the foundations and applications of semantic audio analysis and music information retrieval Jay LeBoeuf Imagine Research jay{at}imagine-research.com Rebecca

More information

An Introduction to Deep Image Aesthetics

An Introduction to Deep Image Aesthetics Seminar in Laboratory of Visual Intelligence and Pattern Analysis (VIPA) An Introduction to Deep Image Aesthetics Yongcheng Jing College of Computer Science and Technology Zhejiang University Zhenchuan

More information

BIBLIOGRAPHIC DATA: A DIFFERENT ANALYSIS PERSPECTIVE. Francesca De Battisti *, Silvia Salini

BIBLIOGRAPHIC DATA: A DIFFERENT ANALYSIS PERSPECTIVE. Francesca De Battisti *, Silvia Salini Electronic Journal of Applied Statistical Analysis EJASA (2012), Electron. J. App. Stat. Anal., Vol. 5, Issue 3, 353 359 e-issn 2070-5948, DOI 10.1285/i20705948v5n3p353 2012 Università del Salento http://siba-ese.unile.it/index.php/ejasa/index

More information

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues Kate Park, Annie Hu, Natalie Muenster Email: katepark@stanford.edu, anniehu@stanford.edu, ncm000@stanford.edu Abstract We propose

More information

Melody classification using patterns

Melody classification using patterns Melody classification using patterns Darrell Conklin Department of Computing City University London United Kingdom conklin@city.ac.uk Abstract. A new method for symbolic music classification is proposed,

More information

Sentiment Analysis. Andrea Esuli

Sentiment Analysis. Andrea Esuli Sentiment Analysis Andrea Esuli What is Sentiment Analysis? What is Sentiment Analysis? Sentiment analysis and opinion mining is the field of study that analyzes people s opinions, sentiments, evaluations,

More information

Introduction to Sentiment Analysis. Text Analytics - Andrea Esuli

Introduction to Sentiment Analysis. Text Analytics - Andrea Esuli Introduction to Sentiment Analysis Text Analytics - Andrea Esuli What is Sentiment Analysis? What is Sentiment Analysis? Sentiment analysis and opinion mining is the field of study that analyzes people

More information

Acoustic Prosodic Features In Sarcastic Utterances

Acoustic Prosodic Features In Sarcastic Utterances Acoustic Prosodic Features In Sarcastic Utterances Introduction: The main goal of this study is to determine if sarcasm can be detected through the analysis of prosodic cues or acoustic features automatically.

More information

Automatic Polyphonic Music Composition Using the EMILE and ABL Grammar Inductors *

Automatic Polyphonic Music Composition Using the EMILE and ABL Grammar Inductors * Automatic Polyphonic Music Composition Using the EMILE and ABL Grammar Inductors * David Ortega-Pacheco and Hiram Calvo Centro de Investigación en Computación, Instituto Politécnico Nacional, Av. Juan

More information

Sarcasm in Social Media. sites. This research topic posed an interesting question. Sarcasm, being heavily conveyed

Sarcasm in Social Media. sites. This research topic posed an interesting question. Sarcasm, being heavily conveyed Tekin and Clark 1 Michael Tekin and Daniel Clark Dr. Schlitz Structures of English 5/13/13 Sarcasm in Social Media Introduction The research goals for this project were to figure out the different methodologies

More information

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes hello Jay Biernat Third author University of Rochester University of Rochester Affiliation3 words jbiernat@ur.rochester.edu author3@ismir.edu

More information

Subjective Similarity of Music: Data Collection for Individuality Analysis

Subjective Similarity of Music: Data Collection for Individuality Analysis Subjective Similarity of Music: Data Collection for Individuality Analysis Shota Kawabuchi and Chiyomi Miyajima and Norihide Kitaoka and Kazuya Takeda Nagoya University, Nagoya, Japan E-mail: shota.kawabuchi@g.sp.m.is.nagoya-u.ac.jp

More information

LLT-PolyU: Identifying Sentiment Intensity in Ironic Tweets

LLT-PolyU: Identifying Sentiment Intensity in Ironic Tweets LLT-PolyU: Identifying Sentiment Intensity in Ironic Tweets Hongzhi Xu, Enrico Santus, Anna Laszlo and Chu-Ren Huang The Department of Chinese and Bilingual Studies The Hong Kong Polytechnic University

More information

Computational Laughing: Automatic Recognition of Humorous One-liners

Computational Laughing: Automatic Recognition of Humorous One-liners Computational Laughing: Automatic Recognition of Humorous One-liners Rada Mihalcea (rada@cs.unt.edu) Department of Computer Science, University of North Texas Denton, Texas, USA Carlo Strapparava (strappa@itc.it)

More information

Sarcasm as Contrast between a Positive Sentiment and Negative Situation

Sarcasm as Contrast between a Positive Sentiment and Negative Situation Sarcasm as Contrast between a Positive Sentiment and Negative Situation Ellen Riloff, Ashequl Qadir, Prafulla Surve, Lalindra De Silva, Nathan Gilbert, Ruihong Huang School Of Computing University of Utah

More information

First Stage of an Automated Content-Based Citation Analysis Study: Detection of Citation Sentences 1

First Stage of an Automated Content-Based Citation Analysis Study: Detection of Citation Sentences 1 First Stage of an Automated Content-Based Citation Analysis Study: Detection of Citation Sentences 1 Zehra Taşkın *, Umut Al * and Umut Sezen ** * {ztaskin; umutal}@hacettepe.edu.tr Department of Information

More information

MUSI-6201 Computational Music Analysis

MUSI-6201 Computational Music Analysis MUSI-6201 Computational Music Analysis Part 9.1: Genre Classification alexander lerch November 4, 2015 temporal analysis overview text book Chapter 8: Musical Genre, Similarity, and Mood (pp. 151 155)

More information

Multilabel Subject-Based Classification of Poetry

Multilabel Subject-Based Classification of Poetry Proceedings of the Twenty-Eighth International Florida Artificial Intelligence Research Society Conference Multilabel Subject-Based Classification of Poetry Andrés Lou, Diana Inkpen and Chris Tǎnasescu

More information

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG? WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG? NICHOLAS BORG AND GEORGE HOKKANEN Abstract. The possibility of a hit song prediction algorithm is both academically interesting and industry motivated.

More information

Multimodal Music Mood Classification Framework for Christian Kokborok Music

Multimodal Music Mood Classification Framework for Christian Kokborok Music Journal of Engineering Technology (ISSN. 0747-9964) Volume 8, Issue 1, Jan. 2019, PP.506-515 Multimodal Music Mood Classification Framework for Christian Kokborok Music Sanchali Das 1*, Sambit Satpathy

More information

Introduction to Natural Language Processing This week & next week: Classification Sentiment Lexicons

Introduction to Natural Language Processing This week & next week: Classification Sentiment Lexicons Introduction to Natural Language Processing This week & next week: Classification Sentiment Lexicons Center for Games and Playable Media http://games.soe.ucsc.edu Kendall review of HW 2 Next two weeks

More information

arxiv:submit/ [cs.cv] 8 Aug 2016

arxiv:submit/ [cs.cv] 8 Aug 2016 Detecting Sarcasm in Multimodal Social Platforms arxiv:submit/1633907 [cs.cv] 8 Aug 2016 ABSTRACT Rossano Schifanella University of Turin Corso Svizzera 185 10149, Turin, Italy schifane@di.unito.it Sarcasm

More information

Modelling Irony in Twitter: Feature Analysis and Evaluation

Modelling Irony in Twitter: Feature Analysis and Evaluation Modelling Irony in Twitter: Feature Analysis and Evaluation Francesco Barbieri, Horacio Saggion Pompeu Fabra University Barcelona, Spain francesco.barbieri@upf.edu, horacio.saggion@upf.edu Abstract Irony,

More information

A Fast Alignment Scheme for Automatic OCR Evaluation of Books

A Fast Alignment Scheme for Automatic OCR Evaluation of Books A Fast Alignment Scheme for Automatic OCR Evaluation of Books Ismet Zeki Yalniz, R. Manmatha Multimedia Indexing and Retrieval Group Dept. of Computer Science, University of Massachusetts Amherst, MA,

More information

Enhancing Music Maps

Enhancing Music Maps Enhancing Music Maps Jakob Frank Vienna University of Technology, Vienna, Austria http://www.ifs.tuwien.ac.at/mir frank@ifs.tuwien.ac.at Abstract. Private as well as commercial music collections keep growing

More information

Outline. Why do we classify? Audio Classification

Outline. Why do we classify? Audio Classification Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification Implementation Future Work Why do we classify

More information

Joint Image and Text Representation for Aesthetics Analysis

Joint Image and Text Representation for Aesthetics Analysis Joint Image and Text Representation for Aesthetics Analysis Ye Zhou 1, Xin Lu 2, Junping Zhang 1, James Z. Wang 3 1 Fudan University, China 2 Adobe Systems Inc., USA 3 The Pennsylvania State University,

More information

A Survey of Sarcasm Detection in Social Media

A Survey of Sarcasm Detection in Social Media A Survey of Sarcasm Detection in Social Media V. Haripriya 1, Dr. Poornima G Patil 2 1 Department of MCA Jain University Bangalore, India. 2 Department of MCA Visweswaraya Technological University Belagavi,

More information

arxiv: v1 [cs.ir] 16 Jan 2019

arxiv: v1 [cs.ir] 16 Jan 2019 It s Only Words And Words Are All I Have Manash Pratim Barman 1, Kavish Dahekar 2, Abhinav Anshuman 3, and Amit Awekar 4 1 Indian Institute of Information Technology, Guwahati 2 SAP Labs, Bengaluru 3 Dell

More information

A Corpus of English-Hindi Code-Mixed Tweets for Sarcasm Detection

A Corpus of English-Hindi Code-Mixed Tweets for Sarcasm Detection A Corpus of English-Hindi Code-Mixed Tweets for Sarcasm Detection by Sahil Swami, Ankush Khandelwal, Vinay Singh, Syed S. Akhtar, Manish Shrivastava in 19th International Conference on Computational Linguistics

More information

Music Composition with RNN

Music Composition with RNN Music Composition with RNN Jason Wang Department of Statistics Stanford University zwang01@stanford.edu Abstract Music composition is an interesting problem that tests the creativity capacities of artificial

More information

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues Kate Park katepark@stanford.edu Annie Hu anniehu@stanford.edu Natalie Muenster ncm000@stanford.edu Abstract We propose detecting

More information

Fracking Sarcasm using Neural Network

Fracking Sarcasm using Neural Network Fracking Sarcasm using Neural Network Aniruddha Ghosh University College Dublin aniruddha.ghosh@ucdconnect.ie Tony Veale University College Dublin tony.veale@ucd.ie Abstract Precise semantic representation

More information

Automatic Music Genre Classification

Automatic Music Genre Classification Automatic Music Genre Classification Nathan YongHoon Kwon, SUNY Binghamton Ingrid Tchakoua, Jackson State University Matthew Pietrosanu, University of Alberta Freya Fu, Colorado State University Yue Wang,

More information

Using Genre Classification to Make Content-based Music Recommendations

Using Genre Classification to Make Content-based Music Recommendations Using Genre Classification to Make Content-based Music Recommendations Robbie Jones (rmjones@stanford.edu) and Karen Lu (karenlu@stanford.edu) CS 221, Autumn 2016 Stanford University I. Introduction Our

More information

Composer Style Attribution

Composer Style Attribution Composer Style Attribution Jacqueline Speiser, Vishesh Gupta Introduction Josquin des Prez (1450 1521) is one of the most famous composers of the Renaissance. Despite his fame, there exists a significant

More information

Detecting Sarcasm in English Text. Andrew James Pielage. Artificial Intelligence MSc 2012/2013

Detecting Sarcasm in English Text. Andrew James Pielage. Artificial Intelligence MSc 2012/2013 Detecting Sarcasm in English Text Andrew James Pielage Artificial Intelligence MSc 0/0 The candidate confirms that the work submitted is their own and the appropriate credit has been given where reference

More information

Automatic Labelling of tabla signals

Automatic Labelling of tabla signals ISMIR 2003 Oct. 27th 30th 2003 Baltimore (USA) Automatic Labelling of tabla signals Olivier K. GILLET, Gaël RICHARD Introduction Exponential growth of available digital information need for Indexing and

More information

MUSICAL MOODS: A MASS PARTICIPATION EXPERIMENT FOR AFFECTIVE CLASSIFICATION OF MUSIC

MUSICAL MOODS: A MASS PARTICIPATION EXPERIMENT FOR AFFECTIVE CLASSIFICATION OF MUSIC 12th International Society for Music Information Retrieval Conference (ISMIR 2011) MUSICAL MOODS: A MASS PARTICIPATION EXPERIMENT FOR AFFECTIVE CLASSIFICATION OF MUSIC Sam Davies, Penelope Allen, Mark

More information

Comparative study of Sentiment Analysis on trending issues on Social Media

Comparative study of Sentiment Analysis on trending issues on Social Media Comparative study of Sentiment Analysis on trending issues on Social Media Vibhore Jain Department of Computer Science and Engineering Bhilai Institute of Technology, Durg vibhorejain@outlook.com M.V.

More information

Formalizing Irony with Doxastic Logic

Formalizing Irony with Doxastic Logic Formalizing Irony with Doxastic Logic WANG ZHONGQUAN National University of Singapore April 22, 2015 1 Introduction Verbal irony is a fundamental rhetoric device in human communication. It is often characterized

More information

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng Introduction In this project we were interested in extracting the melody from generic audio files. Due to the

More information

Sentiment Aggregation using ConceptNet Ontology

Sentiment Aggregation using ConceptNet Ontology Sentiment Aggregation using ConceptNet Ontology Subhabrata Mukherjee Sachindra Joshi IBM Research - India 7th International Joint Conference on Natural Language Processing (IJCNLP 2013), Nagoya, Japan

More information

The final publication is available at

The final publication is available at Document downloaded from: http://hdl.handle.net/10251/64255 This paper must be cited as: Hernández Farías, I.; Benedí Ruiz, JM.; Rosso, P. (2015). Applying basic features from sentiment analysis on automatic

More information

Centre for Economic Policy Research

Centre for Economic Policy Research The Australian National University Centre for Economic Policy Research DISCUSSION PAPER The Reliability of Matches in the 2002-2004 Vietnam Household Living Standards Survey Panel Brian McCaig DISCUSSION

More information

ICWSM A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews

ICWSM A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews ICWSM A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews Oren Tsur Institute of Computer Science The Hebrew University Jerusalem, Israel oren@cs.huji.ac.il

More information

EasyChair Preprint. How good is good enough? Establishing quality thresholds for the automatic text analysis of retro-digitized comics

EasyChair Preprint. How good is good enough? Establishing quality thresholds for the automatic text analysis of retro-digitized comics EasyChair Preprint 573 How good is good enough? Establishing quality thresholds for the automatic text analysis of retro-digitized comics Rita Hartel and Alexander Dunst EasyChair preprints are intended

More information

CASCADE: Contextual Sarcasm Detection in Online Discussion Forums

CASCADE: Contextual Sarcasm Detection in Online Discussion Forums CASCADE: Contextual Sarcasm Detection in Online Discussion Forums Devamanyu Hazarika School of Computing, National University of Singapore hazarika@comp.nus.edu.sg Erik Cambria School of Computer Science

More information

VBM683 Machine Learning

VBM683 Machine Learning VBM683 Machine Learning Pinar Duygulu Slides are adapted from Dhruv Batra, David Sontag, Aykut Erdem Quotes If you were a current computer science student what area would you start studying heavily? Answer:

More information

Universität Bamberg Angewandte Informatik. Seminar KI: gestern, heute, morgen. We are Humor Beings. Understanding and Predicting visual Humor

Universität Bamberg Angewandte Informatik. Seminar KI: gestern, heute, morgen. We are Humor Beings. Understanding and Predicting visual Humor Universität Bamberg Angewandte Informatik Seminar KI: gestern, heute, morgen We are Humor Beings. Understanding and Predicting visual Humor by Daniel Tremmel 18. Februar 2017 advised by Professor Dr. Ute

More information

Regression Model for Politeness Estimation Trained on Examples

Regression Model for Politeness Estimation Trained on Examples Regression Model for Politeness Estimation Trained on Examples Mikhail Alexandrov 1, Natalia Ponomareva 2, Xavier Blanco 1 1 Universidad Autonoma de Barcelona, Spain 2 University of Wolverhampton, UK Email:

More information