Sarcasm Detection in Text: Design Document

Size: px
Start display at page:

Download "Sarcasm Detection in Text: Design Document"

Transcription

1 CSC Senior Design Project Specification Professor Jie Wei Wednesday, November 23, 2016 Sarcasm Detection in Text: Design Document Jesse Feinman, James Kasakyan, Jeff Stolzenberg

2 1 Table of contents Overview 2 Literature Review 3 Contextualized Sarcasm Detection on Twitter 3 Overview 3 Dataset 3 Features 4 Tweet features 4 Author features 4 Audience features 4 Response features 5 Results 5 Semi-Supervised Recognition of Sarcastic Sentences in Twitter and Amazon 6 Overview 6 Datasets 6 Twitter Dataset 6 Amazon Dataset 7 Classification Algorithm 7 Results 9 Design outline 9 Dataset 10 Classification 11 N-Gram Frequency Classification 11 Suffixes 11 Term Frequency-Inverse Document Frequency 12 Sentiment Analysis 12 N-Gram Sentiment 12 Capitalization, Punctuation, Hashtags, and Emoji 12 Hashtags and Emoji 13 Hashtags 13 Emoji 13 Long words and vowel-less words 14 Pattern Collection and Matching 15 Part of speech patterns 15

3 2 GloVe 15 N-Grams 16 Patterns 16 Part of Speech Patterns 16 Context 16 N-fold Cross-Validation 17 Neural Network 17 References 18 Overview This document outlines our design methodology for building a classification model to detect sarcasm in text. First, in the literature review section, we review recent attempts by professionals in the field to tackle the problem of modeling sarcasm, and examine their methodologies and the machine learning techniques used in their models. We review two papers by Bamman and Smith [1] and Davidov et. al [2]. We then describe the design methodology we intend to follow for our approach to the problem, drawing on the techniques in the referenced papers but also expanding upon them with our own ideas.

4 3 Literature Review Contextualized Sarcasm Detection on Twitter Overview This paper [1] written in 2015 by David Bamman and Nolan Smith, professors of Computer Science at Carnegie Mellon University, attempts to tackle the problem of building a classification model to detect sarcasm in tweets. Because of the unique structure of tweets, they are able to gather data that is both pre-labeled as sarcastic, and that contains information about the context of the text. Dataset When considering the source for their dataset, Bamman and Smith noted that in previous attempts to design systems to classify sarcasm the datasets were labeled by human judges who were prone to error, claiming they found low agreement rates between human annotators at the task of judging the sarcasm of others tweets [1]. They also noted that previous attempts to model sarcasm treated it as a text categorization problem, while they felt that sarcasm requires shared knowledge between speaker and audience; it is a profoundly contextual phenomenon [1]. For this reason, Bamman and Smith wanted to capture contextual features for their model. To achieve these goals, they crawled through the last 3,200 tweets of all tweet authors between a nine month period spanning From this set, they took 9,767 tweets that were replies to another tweet (context), and that contained at least three words and had #sarcastic or

5 4 #sarcasm as their final term. For the negative sample, they examined tweets during that same time period that were not self labeled with #sarcastic or #sarcasm. This yielded a balanced training set with 9,767 self labeled sarcastic tweets, and 9,767 non self labeled tweets. Features Features were divided into four classes according to the type of information they captured. The four classes were tweet features, author features, audience features, and response features. Tweet features Tweet features are those derived completely from the text of the tweet to be predicted. These include binary indicators of unigrams and bigrams, as well as binary indicators of unigrams and bigrams in a reduced 1000 Brown cluster space. Part of speech features like ratio of nouns to verbs and density of hashtags or emoticons are also included as tweet features, as well as capitalization features, and both tweet level and word level sentiment features. Author features Author features are derived from information about the user who wrote the tweet to be predicted. These include binary indicators of the top 100 terms in the author corpus scored by TF-IDF. Bamman and Smith note that this is the single most informative feature, where a binary logistic regression classifier scores an accuracy of 81.2% when trained only on this feature. Other author features include profile information like gender and number of followers, as well as historical author sentiment features. Audience features Audience features attempt to capture information about the shared context between the author of the tweet to be predicted, and the author of the tweet being replied to. These included all the

6 5 features listed above as author features, but computed for the author of the original tweet that was replied to by the author of the tweet being predicted. Features that capture the historical communication between these two users like number of previous messages sent are also included as audience features. Response features Response features are derived from information about the contents of the original and reply tweets. These include binary indicators of pairwise Brown features between the two tweets, as well as binary indicators of unigrams in the original tweet. Results Bamman and Smith trained binary logistic regression models on all possible combinations of features. Using only tweet level features, their model achieved an average accuracy of 75.4% across 10 fold cross validation. Adding response features increased the accuracy of the model by under 2% to 77.3%, and combining tweet level features with audience features increased accuracy by 3.6% to 79.0%. Combing tweet features and author features provided the largest jump in accuracy, going from 75.4% using only tweet features to 84.9% when combining tweet and author level features. This is just.2% lower than the accuracy of a model trained on all features, which scored an accuracy of 85.1%. From these results, Bamman and Smith conclude that capturing context is vital for models that attempt to predict sarcasm, since the features design to capture context provide significant improvements in accuracy over tweet level features alone.

7 6 Semi-Supervised Recognition of Sarcastic Sentences in Twitter and Amazon Overview This paper [2] written in 2010 by Dmitry Davidov, Oren Tsur, and Ari Rappoport, PhD students at The Hebrew University, focuses on using a semi-supervised approach to sarcasm identification. This experiment was performed on two very different data sets, the first being a set of tweets from Twitter and the second a collection of Amazon reviews. Utilizing sentences that were ranked and pre-labeled based on level of sarcasm, the team constructed feature vectors that were in turn used to build a classifier model that assigned scores to unlabeled examples. Datasets Twitter Dataset The first dataset that this team utilized came from Twitter. Twitter is a very popular microblogging service. It allows users to publish and read short messages called tweets. Tweets are restricted to 140 characters and may contain references to url addresses, references to other Twitter users (these appear and content tags (called hashtags) assigned by the tweeter (#). Due to Twitter s informal nature and its constraint on character length the team found that users are often forced to use a large amount of slang, shortened lingo, ascii emoticons and other tokens absent from formal lexicons. The three experimenters stated that These characteristics make Twitter a fascinating domain for NLP applications, although posing great challenges due

8 7 to the length constraint, the complete freedom of style and the out of discourse nature of tweets [2]. The Twitter dataset that was used was comprised of 5.9 million unique tweets. In this dataset, the average number of words per tweet was Additionally, 18.7% of the tweets contained a url, 35.5% contained a reference to another twitter user, and 6.9% contained at least one hashtag. Amazon Dataset The second dataset that was used in this experiment was a collection of reviews from Amazon.com. This dataset contained 66,000 reviews of 120 different products found on Amazon. The reason the researchers selected this dataset was because of its stark contrast to the Twitter dataset. The Amazon reviews averaged 953 characters which are much longer than the tweets. They were more structured and grammatical than tweets and are delivered in a known context. Classification Algorithm The algorithm used by this team of researchers was semi-supervised. The input was a small seed of labeled sentences that had been annotated by three humans. The annotated sentences were ranked on a scale from 1 to 5 in which a score of a 5 indicted a clearly sarcastic sentence and a score of a 1 indicated a clear absence of sarcasm. Once the team had the labeled sentences they extracted a set of features to be used in feature vectors. The main feature types that were utilized were syntactic and pattern based features. Feature vectors for each of the labeled examples in the training set were constructed and used to build a classifier model that assigned scores to the unlabeled examples. Data Preprocessing

9 8 The first aspect of the framework for the algorithm this team used was the preprocessing of the data. To facilitate pattern matching the team had specific information replaced with meta data tags. Each appearance of a product, author, company, book name, user, url, and hashtag were replaced with the following corresponding generalized tags: [PRODUCT], [COMPANY], [TITLE], [AUTHOR], [USER], [LINK] and [HASHTAG]. Pattern Extraction The main feature type for the algorithm was based on surface patterns. The team classified words into two types. The first type was high frequency words (HFW) for words with a frequency greater than 1,000 words per million. The second type were content words (CW) for words with a frequency of less than 100 words per million. A pattern was then defined as an ordered sequence of 2-6 HFW s and 1-6 CW s [2]. Pattern Matching Once patterns are identified, a single entry was constructed in the feature vectors for each sentence. A feature value was then calculated for each pattern. An exact match to a sentence labeled sarcastic in the training set scored a 1. Sparse and Incomplete matches scored slightly lower respectively. Sentences with no pattern matches scored a zero. Additional Features In addition to pattern-based features some generic features were used as well. These included the sentence length in words, the number of exclamation point! characters in the sentence,

10 9 the number of question mark? characters in the sentence, the number of quotes in the sentence, and the number of capitalized words in the sentence. Classification Lastly the team needed to assign scores to the new examples in the test set. To do this they use a k-nearest neighbors (knn)-like strategy. Feature vectors were constructed for each example in the training and test sets. For each feature vector v in the test set, they computed the Euclidean distance to each of the matching vectors in the extended training set, where matching vectors share at least one pattern feature with v. The score was then a weighted average of the k closest training set vectors. Results The experiment conducted by these three PhD students yielded promising results. The researchers found that on average, the semi-supervised algorithm achieved a precision of 77% and a recall of 83.1%. They were surprised to find that punctuation marks served as the weakest indicator for sarcasm. However, the use of excessive exclamation marks and capital letters were moderately useful sarcasm indicators. The use of three consecutive dots which when combined with other features constituted a strong predictor. Design outline Our methodology to detect sarcasm is based upon the research we have done. Going step by step, from simpler to more advanced classification methods, we will evaluate the efficacy of

11 10 each using cross validation. The end result will be a system that uses a variety of classification tools and which we have eliminated the classifiers which produced no benefit or hurt the results. Dataset The data we will gather to do the project with will likely be live streaming tweets that we gather for a period of a few days or weeks until we have a sufficiently large amount to train our system with. However twitter data is problematic, while the data is readily available there is little to no context due to the short messages. In order to gain context with twitter data we need to look at replies, past tweets, and the user s profile. While these may be possible it is a more advanced option that we hope to be able to get to by the end of the project, but using and gathering the context for each tweet is more of a stretch goal. Ideally we will be able to attain a data set that has more context in the surrounding text and does not require specific background of the actors. To narrow the scope, using twitter data initially, we will select #sarcasm, #sarcastic, etc as well as other hashtags that allow us to tailor our system to a specific niche area to focus upon. This focus will make our system less generalizable but, in theory, be more accurate with that particular data set. If we are able to attain a sufficiently high accuracy with a niche focus then testing the system on a more general data selection would be the next goal. Our data will be tagged as sarcasm or not sarcasm so we will be using primarily supervised learning techniques. Tweets will require pre-processing to remove erroneous hashtags and replace proper nouns with generics so that we are analyzing the sarcasm of the language not of the subject of the message. We can also process the tweets with the proper nouns included in case subject matter expertise yields better results.

12 11 Classification We will be implementing multiple systems to attempt to classify sarcasm. Each system will return a normalized value between 0 to 1 which can then be processed by a neural network. Using python s scipy and nltk libraries we will create classifiers that indicate on a scale of 0 to 1 with 0 being entirely non-sarcastic and 1 being completely sarcastic, what the confidence of an individual classifier s results are. N-Gram Frequency Classification We plan to explore using n-gram frequencies with different sizes of n. By looking at words and phrases that are common in sarcastic remarks we hope to be able to train the system to recognize and be able to classify sarcastic remarks. Creating 2 frequency tables, 1 for sarcasm and 1 for non-sarcasm then comparing the frequency of a given n-gram to the frequency tables. Returning the percent match to each of the tables. We would then repeat the n-gram frequency analysis with a lemmatized version of the message to see if there s a difference in results. Suffixes Using the same technique as n-gram frequency analysis we would also create a frequency analysis using the suffixes of words in the tweets by lemmatizing the words and subtracting the

13 12 lemmatized word from the original word. For the same reasoning as n-gram frequency analysis, we hope to discover patterns in suffix frequency that can help classify sarcastic remarks. Term Frequency-Inverse Document Frequency With the same n-grams above, we will also look at the frequency of each n-gram and its relation to the inverse frequency of that term in the corpus of all messages we are looking at. This may tell us the importance of certain terms and we may be able to draw trends towards some words indicating sarcasm or not sarcasm. This is mainly useful on the tweets that have proper nouns left in them to see if certain people, places, or things indicate sarcasm. Sentiment Analysis We are going to look into the sentiment of each message we are going to analyze. If we are able to determine a trend in the sentiment that can help us classify sarcastic remarks then we will include full message sentiment analysis. We will be using the Minqing Hu and Bing Liu s sentiment word list to train the sentiment system that is provided in NLTK. N-Gram Sentiment Whole message sentiment may not be very revealing but if we look at partial sentiment, looking at n-gram sentiment of various sizes, we hope to be able to identify a trend in sarcastic remarks that can help us better classify sarcasm. Capitalization, Punctuation, Hashtags, and Emoji Most of the time with NLP we would make everything the same case and not pay much attention to punctuation. In our case we will attempt to find patterns in capitalization and punctuation that can help us determine sarcasm classifications. For instance text that appears

14 13 in quotes may be treated differently than text that is surrounding the quoted message or multiple exclamation and question marks may indicate a different meaning than a message without them. This analysis will be difficult as it can be done both with the context of the words that are surrounded by relevant punctuation or without the given word and just looking at the punctuation patterns. Hashtags and Emoji With social media being more involved many users use hashtags and emoji to provide implicit information that helps the reader understand the true intent of their message. Hashtags We will remove the obvious hashtags such as sarcasm, serious, and anything that can be used to definitively identify a message as sarcasm but we will try and analyze the remaining hashtags that are present to see if theres trends in hashtags that can elude to the sarcastic or serious nature of a remark. Emoji There are 2 types of emoji that we will be looking at, the first is strings of non word characters such as :) : ( >_<, etc which are supposed to represent faces and express an emotion towards the topic that s discussed. These will not necessarily be easy to identify and may require us to compile a database of existing ones prior to analyzing, then looking for them in messages and using frequency analysis and patterns to try and identify trends. The second type are single character emoji that are often used by users from the emoji selection on mobile keyboards. These come in as single characters, usually a unicode identifier,

15 14 that uniquely represents that symbol. Attempting to analyze the emoji by itself would be difficult and would be futile given that they render differently on different systems. However we will take the character identifier and use frequency and pattern analysis to try and derive the meaning of the emoji as it relates to sarcastic remarks given the context in which it occurs in our training data set. For instance, if a positive sentiment message is followed by a particular emoji: Yay, Trump Won! Ύ To us it s clear that the message writer is not happy because that face is usually associated with disappointment. WIth enough examples in a training set of emoji being used in context we hope to be able to establish their meaning. Additionally we have the option to pull a database of existing emoji and the words used to describe them then assign them with sentiments based on the database. Using these sentiments we could substitute the emoji for the given sentiment or for a synonym to the emotion it s intended to mean. Long words and vowel-less words Looking at words with a large number of syllables and words that don t have vowels has shown to be a possible method of sarcasm detection that we are going to explore. Looking at the frequency of vowel-less words in a message and the frequency of messages with large numbers of syllables may help us in our classification.

16 15 Pattern Collection and Matching We will attempt to find phrase patterns that occur in sarcastic remarks but not, or far less frequently, in non sarcastic remarks. Patterns are n-grams which have generic values for some of the words. For instance I went to the [generic]. would be a pattern. Then by checking with the common words from our previous analysis, check if that pattern matches in the message and if it does, if the generic term is high in our sarcasm frequency table. Part of speech patterns Previous research has suggested that the parts of speech and the frequency, density, and patterns of those parts may be another useful tool. Using N-gram and pattern matching analysis on the POS tags of a message we hope to be able to extract useful classification information. GloVe The research we looked at used Brown clustering to establish context however we will be using Global Vectors for Word Representation or GloVe to plot words in multidimensional vector space based upon their context due to the more advanced nature of GloVe we are expecting better results. When vectors are closer to one another in this space it means they have a more similar context, which for us may indicate that words which occur in certain contexts are more likely to be sarcastic or not.

17 16 N-Grams Using the n-grams we previously looked at, we intend to compute the n-grams position in multidimensional space with GloVe based on the context in which the n-grams. This can allow us to draw similarities to the context of the n-grams we found to the trained n-grams to see if there s context that s similar to sarcastic remarks. Patterns Using the same technique for the n-grams above, we would look for the context in which the patterns occur in multidimensional space based on their context. By looking at the language patterns that are previously discussed to see if certain patterns occur in concert with other patterns that may indicate sarcasm. Part of Speech Patterns In the same way both the n-grams and patterns are checked in multidimensional space, we would, too, check the part of speech patterns against this space. Context This is not as practical for tweets given the inherent lack of contextual information that they provide however if possible, we would like to be able to analyze the surrounding sentences to a potentially sarcastic remark then use that context to help drive our decision when classifying the potentially sarcastic phrase. Using the sentiment and the subjects of the preceding and succeeding sentence we may be able to establish that a given sentence is sarcastic.

18 17 N-fold Cross-Validation With the various techniques we will be using to try and detect sarcasm we need to be able to analyze which are working the best and why. Using cross validation on the various classifiers we will use will help to validate the results of each individual tool and to allow us to fine tune each one. Neural Network The number of individual classifiers we are going to try to explore is quite large so in order to try and appropriately balance how much weight each classifier should get, besides basic hard coded cross validation, we will create a neural network using the Scikit-learn library s MLP Classifier that can take the outputs of all these classifiers and integrate their outputs into a fully connected neural network and adjust the weights within the network to give us optimal results in as many circumstances as possible.

19 18 References [1] D. Bamman and N. A. Smith, Contextualized Sarcasm Detection on Twitter in International Conference on Web and Social Media, [2] D. Davidov, O. Tsur, and A. Rappaport, Semi-Supervised Recognition of Sarcastic Sentences in Twitter and Amazon in Proceedings of the Fourteenth Conference on Computational Natural Language Learning, 2010, pp

Projektseminar: Sentimentanalyse Dozenten: Michael Wiegand und Marc Schulder

Projektseminar: Sentimentanalyse Dozenten: Michael Wiegand und Marc Schulder Projektseminar: Sentimentanalyse Dozenten: Michael Wiegand und Marc Schulder Präsentation des Papers ICWSM A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews

More information

Your Sentiment Precedes You: Using an author s historical tweets to predict sarcasm

Your Sentiment Precedes You: Using an author s historical tweets to predict sarcasm Your Sentiment Precedes You: Using an author s historical tweets to predict sarcasm Anupam Khattri 1 Aditya Joshi 2,3,4 Pushpak Bhattacharyya 2 Mark James Carman 3 1 IIT Kharagpur, India, 2 IIT Bombay,

More information

Finding Sarcasm in Reddit Postings: A Deep Learning Approach

Finding Sarcasm in Reddit Postings: A Deep Learning Approach Finding Sarcasm in Reddit Postings: A Deep Learning Approach Nick Guo, Ruchir Shah {nickguo, ruchirfs}@stanford.edu Abstract We use the recently published Self-Annotated Reddit Corpus (SARC) with a recurrent

More information

An Impact Analysis of Features in a Classification Approach to Irony Detection in Product Reviews

An Impact Analysis of Features in a Classification Approach to Irony Detection in Product Reviews Universität Bielefeld June 27, 2014 An Impact Analysis of Features in a Classification Approach to Irony Detection in Product Reviews Konstantin Buschmeier, Philipp Cimiano, Roman Klinger Semantic Computing

More information

Detecting Sarcasm in English Text. Andrew James Pielage. Artificial Intelligence MSc 2012/2013

Detecting Sarcasm in English Text. Andrew James Pielage. Artificial Intelligence MSc 2012/2013 Detecting Sarcasm in English Text Andrew James Pielage Artificial Intelligence MSc 0/0 The candidate confirms that the work submitted is their own and the appropriate credit has been given where reference

More information

arxiv: v1 [cs.cl] 3 May 2018

arxiv: v1 [cs.cl] 3 May 2018 Binarizer at SemEval-2018 Task 3: Parsing dependency and deep learning for irony detection Nishant Nikhil IIT Kharagpur Kharagpur, India nishantnikhil@iitkgp.ac.in Muktabh Mayank Srivastava ParallelDots,

More information

Harnessing Context Incongruity for Sarcasm Detection

Harnessing Context Incongruity for Sarcasm Detection Harnessing Context Incongruity for Sarcasm Detection Aditya Joshi 1,2,3 Vinita Sharma 1 Pushpak Bhattacharyya 1 1 IIT Bombay, India, 2 Monash University, Australia 3 IITB-Monash Research Academy, India

More information

World Journal of Engineering Research and Technology WJERT

World Journal of Engineering Research and Technology WJERT wjert, 2018, Vol. 4, Issue 4, 218-224. Review Article ISSN 2454-695X Maheswari et al. WJERT www.wjert.org SJIF Impact Factor: 5.218 SARCASM DETECTION AND SURVEYING USER AFFECTATION S. Maheswari* 1 and

More information

Introduction to Natural Language Processing This week & next week: Classification Sentiment Lexicons

Introduction to Natural Language Processing This week & next week: Classification Sentiment Lexicons Introduction to Natural Language Processing This week & next week: Classification Sentiment Lexicons Center for Games and Playable Media http://games.soe.ucsc.edu Kendall review of HW 2 Next two weeks

More information

This is a repository copy of Who cares about sarcastic tweets? Investigating the impact of sarcasm on sentiment analysis.

This is a repository copy of Who cares about sarcastic tweets? Investigating the impact of sarcasm on sentiment analysis. This is a repository copy of Who cares about sarcastic tweets? Investigating the impact of sarcasm on sentiment analysis. White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/130763/

More information

TWITTER SARCASM DETECTOR (TSD) USING TOPIC MODELING ON USER DESCRIPTION

TWITTER SARCASM DETECTOR (TSD) USING TOPIC MODELING ON USER DESCRIPTION TWITTER SARCASM DETECTOR (TSD) USING TOPIC MODELING ON USER DESCRIPTION Supriya Jyoti Hiwave Technologies, Toronto, Canada Ritu Chaturvedi MCS, University of Toronto, Canada Abstract Internet users go

More information

#SarcasmDetection Is Soooo General! Towards a Domain-Independent Approach for Detecting Sarcasm

#SarcasmDetection Is Soooo General! Towards a Domain-Independent Approach for Detecting Sarcasm Proceedings of the Thirtieth International Florida Artificial Intelligence Research Society Conference #SarcasmDetection Is Soooo General! Towards a Domain-Independent Approach for Detecting Sarcasm Natalie

More information

The Lowest Form of Wit: Identifying Sarcasm in Social Media

The Lowest Form of Wit: Identifying Sarcasm in Social Media 1 The Lowest Form of Wit: Identifying Sarcasm in Social Media Saachi Jain, Vivian Hsu Abstract Sarcasm detection is an important problem in text classification and has many applications in areas such as

More information

How Do Cultural Differences Impact the Quality of Sarcasm Annotation?: A Case Study of Indian Annotators and American Text

How Do Cultural Differences Impact the Quality of Sarcasm Annotation?: A Case Study of Indian Annotators and American Text How Do Cultural Differences Impact the Quality of Sarcasm Annotation?: A Case Study of Indian Annotators and American Text Aditya Joshi 1,2,3 Pushpak Bhattacharyya 1 Mark Carman 2 Jaya Saraswati 1 Rajita

More information

WHAT'S HOT: LINEAR POPULARITY PREDICTION FROM TV AND SOCIAL USAGE DATA Jan Neumann, Xiaodong Yu, and Mohamad Ali Torkamani Comcast Labs

WHAT'S HOT: LINEAR POPULARITY PREDICTION FROM TV AND SOCIAL USAGE DATA Jan Neumann, Xiaodong Yu, and Mohamad Ali Torkamani Comcast Labs WHAT'S HOT: LINEAR POPULARITY PREDICTION FROM TV AND SOCIAL USAGE DATA Jan Neumann, Xiaodong Yu, and Mohamad Ali Torkamani Comcast Labs Abstract Large numbers of TV channels are available to TV consumers

More information

arxiv: v1 [cs.cl] 8 Jun 2018

arxiv: v1 [cs.cl] 8 Jun 2018 #SarcasmDetection is soooo general! Towards a Domain-Independent Approach for Detecting Sarcasm Natalie Parde and Rodney D. Nielsen Department of Computer Science and Engineering University of North Texas

More information

Bi-Modal Music Emotion Recognition: Novel Lyrical Features and Dataset

Bi-Modal Music Emotion Recognition: Novel Lyrical Features and Dataset Bi-Modal Music Emotion Recognition: Novel Lyrical Features and Dataset Ricardo Malheiro, Renato Panda, Paulo Gomes, Rui Paiva CISUC Centre for Informatics and Systems of the University of Coimbra {rsmal,

More information

An Introduction to Deep Image Aesthetics

An Introduction to Deep Image Aesthetics Seminar in Laboratory of Visual Intelligence and Pattern Analysis (VIPA) An Introduction to Deep Image Aesthetics Yongcheng Jing College of Computer Science and Technology Zhejiang University Zhenchuan

More information

Temporal patterns of happiness and sarcasm detection in social media (Twitter)

Temporal patterns of happiness and sarcasm detection in social media (Twitter) Temporal patterns of happiness and sarcasm detection in social media (Twitter) Pradeep Kumar NPSO Innovation Day November 22, 2017 Our Data Science Team Patricia Prüfer Pradeep Kumar Marcia den Uijl Next

More information

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues Kate Park katepark@stanford.edu Annie Hu anniehu@stanford.edu Natalie Muenster ncm000@stanford.edu Abstract We propose detecting

More information

Combination of Audio & Lyrics Features for Genre Classication in Digital Audio Collections

Combination of Audio & Lyrics Features for Genre Classication in Digital Audio Collections 1/23 Combination of Audio & Lyrics Features for Genre Classication in Digital Audio Collections Rudolf Mayer, Andreas Rauber Vienna University of Technology {mayer,rauber}@ifs.tuwien.ac.at Robert Neumayer

More information

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG? WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG? NICHOLAS BORG AND GEORGE HOKKANEN Abstract. The possibility of a hit song prediction algorithm is both academically interesting and industry motivated.

More information

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues Kate Park, Annie Hu, Natalie Muenster Email: katepark@stanford.edu, anniehu@stanford.edu, ncm000@stanford.edu Abstract We propose

More information

ICWSM A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews

ICWSM A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews ICWSM A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews Oren Tsur Institute of Computer Science The Hebrew University Jerusalem, Israel oren@cs.huji.ac.il

More information

A COMPREHENSIVE STUDY ON SARCASM DETECTION TECHNIQUES IN SENTIMENT ANALYSIS

A COMPREHENSIVE STUDY ON SARCASM DETECTION TECHNIQUES IN SENTIMENT ANALYSIS Volume 118 No. 22 2018, 433-442 ISSN: 1314-3395 (on-line version) url: http://acadpubl.eu/hub ijpam.eu A COMPREHENSIVE STUDY ON SARCASM DETECTION TECHNIQUES IN SENTIMENT ANALYSIS 1 Sindhu. C, 2 G.Vadivu,

More information

LT3: Sentiment Analysis of Figurative Tweets: piece of cake #NotReally

LT3: Sentiment Analysis of Figurative Tweets: piece of cake #NotReally LT3: Sentiment Analysis of Figurative Tweets: piece of cake #NotReally Cynthia Van Hee, Els Lefever and Véronique hoste LT 3, Language and Translation Technology Team Department of Translation, Interpreting

More information

Basic Natural Language Processing

Basic Natural Language Processing Basic Natural Language Processing Why NLP? Understanding Intent Search Engines Question Answering Azure QnA, Bots, Watson Digital Assistants Cortana, Siri, Alexa Translation Systems Azure Language Translation,

More information

Automatic Detection of Sarcasm in BBS Posts Based on Sarcasm Classification

Automatic Detection of Sarcasm in BBS Posts Based on Sarcasm Classification Web 1,a) 2,b) 2,c) Web Web 8 ( ) Support Vector Machine (SVM) F Web Automatic Detection of Sarcasm in BBS Posts Based on Sarcasm Classification Fumiya Isono 1,a) Suguru Matsuyoshi 2,b) Fumiyo Fukumoto

More information

Music Mood. Sheng Xu, Albert Peyton, Ryan Bhular

Music Mood. Sheng Xu, Albert Peyton, Ryan Bhular Music Mood Sheng Xu, Albert Peyton, Ryan Bhular What is Music Mood A psychological & musical topic Human emotions conveyed in music can be comprehended from two aspects: Lyrics Music Factors that affect

More information

Can Song Lyrics Predict Genre? Danny Diekroeger Stanford University

Can Song Lyrics Predict Genre? Danny Diekroeger Stanford University Can Song Lyrics Predict Genre? Danny Diekroeger Stanford University danny1@stanford.edu 1. Motivation and Goal Music has long been a way for people to express their emotions. And because we all have a

More information

Lyric-Based Music Mood Recognition

Lyric-Based Music Mood Recognition Lyric-Based Music Mood Recognition Emil Ian V. Ascalon, Rafael Cabredo De La Salle University Manila, Philippines emil.ascalon@yahoo.com, rafael.cabredo@dlsu.edu.ph Abstract: In psychology, emotion is

More information

Automatic Music Clustering using Audio Attributes

Automatic Music Clustering using Audio Attributes Automatic Music Clustering using Audio Attributes Abhishek Sen BTech (Electronics) Veermata Jijabai Technological Institute (VJTI), Mumbai, India abhishekpsen@gmail.com Abstract Music brings people together,

More information

Supervised Learning in Genre Classification

Supervised Learning in Genre Classification Supervised Learning in Genre Classification Introduction & Motivation Mohit Rajani and Luke Ekkizogloy {i.mohit,luke.ekkizogloy}@gmail.com Stanford University, CS229: Machine Learning, 2009 Now that music

More information

Neural Network Predicating Movie Box Office Performance

Neural Network Predicating Movie Box Office Performance Neural Network Predicating Movie Box Office Performance Alex Larson ECE 539 Fall 2013 Abstract The movie industry is a large part of modern day culture. With the rise of websites like Netflix, where people

More information

Music Genre Classification and Variance Comparison on Number of Genres

Music Genre Classification and Variance Comparison on Number of Genres Music Genre Classification and Variance Comparison on Number of Genres Miguel Francisco, miguelf@stanford.edu Dong Myung Kim, dmk8265@stanford.edu 1 Abstract In this project we apply machine learning techniques

More information

Implementation of Emotional Features on Satire Detection

Implementation of Emotional Features on Satire Detection Implementation of Emotional Features on Satire Detection Pyae Phyo Thu1, Than Nwe Aung2 1 University of Computer Studies, Mandalay, Patheingyi Mandalay 1001, Myanmar pyaephyothu149@gmail.com 2 University

More information

REPORT DOCUMENTATION PAGE

REPORT DOCUMENTATION PAGE REPORT DOCUMENTATION PAGE Form Approved OMB NO. 0704-0188 The public reporting burden for this collection of information is estimated to average 1 hour per response, including the time for reviewing instructions,

More information

SARCASM DETECTION IN SENTIMENT ANALYSIS Dr. Kalpesh H. Wandra 1, Mehul Barot 2 1

SARCASM DETECTION IN SENTIMENT ANALYSIS Dr. Kalpesh H. Wandra 1, Mehul Barot 2 1 SARCASM DETECTION IN SENTIMENT ANALYSIS Dr. Kalpesh H. Wandra 1, Mehul Barot 2 1 Director (Academic Administration) Babaria Institute of Technology, 2 Research Scholar, C.U.Shah University Abstract Sentiment

More information

KLUEnicorn at SemEval-2018 Task 3: A Naïve Approach to Irony Detection

KLUEnicorn at SemEval-2018 Task 3: A Naïve Approach to Irony Detection KLUEnicorn at SemEval-2018 Task 3: A Naïve Approach to Irony Detection Luise Dürlich Friedrich-Alexander Universität Erlangen-Nürnberg / Germany luise.duerlich@fau.de Abstract This paper describes the

More information

Sarcasm as Contrast between a Positive Sentiment and Negative Situation

Sarcasm as Contrast between a Positive Sentiment and Negative Situation Sarcasm as Contrast between a Positive Sentiment and Negative Situation Ellen Riloff, Ashequl Qadir, Prafulla Surve, Lalindra De Silva, Nathan Gilbert, Ruihong Huang School Of Computing University of Utah

More information

International Journal of Advance Engineering and Research Development MUSICAL INSTRUMENT IDENTIFICATION AND STATUS FINDING WITH MFCC

International Journal of Advance Engineering and Research Development MUSICAL INSTRUMENT IDENTIFICATION AND STATUS FINDING WITH MFCC Scientific Journal of Impact Factor (SJIF): 5.71 International Journal of Advance Engineering and Research Development Volume 5, Issue 04, April -2018 e-issn (O): 2348-4470 p-issn (P): 2348-6406 MUSICAL

More information

Measuring #GamerGate: A Tale of Hate, Sexism, and Bullying

Measuring #GamerGate: A Tale of Hate, Sexism, and Bullying Measuring #GamerGate: A Tale of Hate, Sexism, and Bullying Despoina Chatzakou, Nicolas Kourtellis, Jeremy Blackburn Emiliano De Cristofaro, Gianluca Stringhini, Athena Vakali Aristotle University of Thessaloniki

More information

arxiv: v1 [cs.ir] 16 Jan 2019

arxiv: v1 [cs.ir] 16 Jan 2019 It s Only Words And Words Are All I Have Manash Pratim Barman 1, Kavish Dahekar 2, Abhinav Anshuman 3, and Amit Awekar 4 1 Indian Institute of Information Technology, Guwahati 2 SAP Labs, Bengaluru 3 Dell

More information

Creating Mindmaps of Documents

Creating Mindmaps of Documents Creating Mindmaps of Documents Using an Example of a News Surveillance System Oskar Gross Hannu Toivonen Teemu Hynonen Esther Galbrun February 6, 2011 Outline Motivation Bisociation Network Tpf-Idf-Tpu

More information

Musical Hit Detection

Musical Hit Detection Musical Hit Detection CS 229 Project Milestone Report Eleanor Crane Sarah Houts Kiran Murthy December 12, 2008 1 Problem Statement Musical visualizers are programs that process audio input in order to

More information

CS229 Project Report Polyphonic Piano Transcription

CS229 Project Report Polyphonic Piano Transcription CS229 Project Report Polyphonic Piano Transcription Mohammad Sadegh Ebrahimi Stanford University Jean-Baptiste Boin Stanford University sadegh@stanford.edu jbboin@stanford.edu 1. Introduction In this project

More information

저작권법에따른이용자의권리는위의내용에의하여영향을받지않습니다.

저작권법에따른이용자의권리는위의내용에의하여영향을받지않습니다. 저작자표시 - 비영리 - 동일조건변경허락 2.0 대한민국 이용자는아래의조건을따르는경우에한하여자유롭게 이저작물을복제, 배포, 전송, 전시, 공연및방송할수있습니다. 이차적저작물을작성할수있습니다. 다음과같은조건을따라야합니다 : 저작자표시. 귀하는원저작자를표시하여야합니다. 비영리. 귀하는이저작물을영리목적으로이용할수없습니다. 동일조건변경허락. 귀하가이저작물을개작, 변형또는가공했을경우에는,

More information

Semi-supervised Musical Instrument Recognition

Semi-supervised Musical Instrument Recognition Semi-supervised Musical Instrument Recognition Master s Thesis Presentation Aleksandr Diment 1 1 Tampere niversity of Technology, Finland Supervisors: Adj.Prof. Tuomas Virtanen, MSc Toni Heittola 17 May

More information

Sarcasm in Social Media. sites. This research topic posed an interesting question. Sarcasm, being heavily conveyed

Sarcasm in Social Media. sites. This research topic posed an interesting question. Sarcasm, being heavily conveyed Tekin and Clark 1 Michael Tekin and Daniel Clark Dr. Schlitz Structures of English 5/13/13 Sarcasm in Social Media Introduction The research goals for this project were to figure out the different methodologies

More information

Using Genre Classification to Make Content-based Music Recommendations

Using Genre Classification to Make Content-based Music Recommendations Using Genre Classification to Make Content-based Music Recommendations Robbie Jones (rmjones@stanford.edu) and Karen Lu (karenlu@stanford.edu) CS 221, Autumn 2016 Stanford University I. Introduction Our

More information

Determining sentiment in citation text and analyzing its impact on the proposed ranking index

Determining sentiment in citation text and analyzing its impact on the proposed ranking index Determining sentiment in citation text and analyzing its impact on the proposed ranking index Souvick Ghosh 1, Dipankar Das 1 and Tanmoy Chakraborty 2 1 Jadavpur University, Kolkata 700032, WB, India {

More information

An extensive Survey On Sarcasm Detection Using Various Classifiers

An extensive Survey On Sarcasm Detection Using Various Classifiers Volume 119 No. 12 2018, 13183-13187 ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu An extensive Survey On Sarcasm Detection Using Various Classifiers K.R.Jansi* Department of Computer

More information

UWaterloo at SemEval-2017 Task 7: Locating the Pun Using Syntactic Characteristics and Corpus-based Metrics

UWaterloo at SemEval-2017 Task 7: Locating the Pun Using Syntactic Characteristics and Corpus-based Metrics UWaterloo at SemEval-2017 Task 7: Locating the Pun Using Syntactic Characteristics and Corpus-based Metrics Olga Vechtomova University of Waterloo Waterloo, ON, Canada ovechtom@uwaterloo.ca Abstract The

More information

Enabling editors through machine learning

Enabling editors through machine learning Meta Follow Meta is an AI company that provides academics & innovation-driven companies with powerful views of t Dec 9, 2016 9 min read Enabling editors through machine learning Examining the data science

More information

arxiv:submit/ [cs.cv] 8 Aug 2016

arxiv:submit/ [cs.cv] 8 Aug 2016 Detecting Sarcasm in Multimodal Social Platforms arxiv:submit/1633907 [cs.cv] 8 Aug 2016 ABSTRACT Rossano Schifanella University of Turin Corso Svizzera 185 10149, Turin, Italy schifane@di.unito.it Sarcasm

More information

A repetition-based framework for lyric alignment in popular songs

A repetition-based framework for lyric alignment in popular songs A repetition-based framework for lyric alignment in popular songs ABSTRACT LUONG Minh Thang and KAN Min Yen Department of Computer Science, School of Computing, National University of Singapore We examine

More information

SARCASM DETECTION IN SENTIMENT ANALYSIS

SARCASM DETECTION IN SENTIMENT ANALYSIS SARCASM DETECTION IN SENTIMENT ANALYSIS Shruti Kaushik 1, Prof. Mehul P. Barot 2 1 Research Scholar, CE-LDRP-ITR, KSV University Gandhinagar, Gujarat, India 2 Lecturer, CE-LDRP-ITR, KSV University Gandhinagar,

More information

Text Analysis. Language is complex. The goal of text analysis is to strip away some of that complexity to extract meaning.

Text Analysis. Language is complex. The goal of text analysis is to strip away some of that complexity to extract meaning. Text Analysis Language is complex. The goal of text analysis is to strip away some of that complexity to extract meaning. Image Source How to talk like a Democrat (or a Republican) Reddit N-gram Viewer:

More information

Melody classification using patterns

Melody classification using patterns Melody classification using patterns Darrell Conklin Department of Computing City University London United Kingdom conklin@city.ac.uk Abstract. A new method for symbolic music classification is proposed,

More information

Bilbo-Val: Automatic Identification of Bibliographical Zone in Papers

Bilbo-Val: Automatic Identification of Bibliographical Zone in Papers Bilbo-Val: Automatic Identification of Bibliographical Zone in Papers Amal Htait, Sebastien Fournier and Patrice Bellot Aix Marseille University, CNRS, ENSAM, University of Toulon, LSIS UMR 7296,13397,

More information

Sentiment Analysis. Andrea Esuli

Sentiment Analysis. Andrea Esuli Sentiment Analysis Andrea Esuli What is Sentiment Analysis? What is Sentiment Analysis? Sentiment analysis and opinion mining is the field of study that analyzes people s opinions, sentiments, evaluations,

More information

Introduction to Sentiment Analysis. Text Analytics - Andrea Esuli

Introduction to Sentiment Analysis. Text Analytics - Andrea Esuli Introduction to Sentiment Analysis Text Analytics - Andrea Esuli What is Sentiment Analysis? What is Sentiment Analysis? Sentiment analysis and opinion mining is the field of study that analyzes people

More information

Music Genre Classification

Music Genre Classification Music Genre Classification chunya25 Fall 2017 1 Introduction A genre is defined as a category of artistic composition, characterized by similarities in form, style, or subject matter. [1] Some researchers

More information

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models Aric Bartle (abartle@stanford.edu) December 14, 2012 1 Background The field of composer recognition has

More information

First Stage of an Automated Content-Based Citation Analysis Study: Detection of Citation Sentences 1

First Stage of an Automated Content-Based Citation Analysis Study: Detection of Citation Sentences 1 First Stage of an Automated Content-Based Citation Analysis Study: Detection of Citation Sentences 1 Zehra Taşkın *, Umut Al * and Umut Sezen ** * {ztaskin; umutal}@hacettepe.edu.tr Department of Information

More information

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

Music Emotion Recognition. Jaesung Lee. Chung-Ang University Music Emotion Recognition Jaesung Lee Chung-Ang University Introduction Searching Music in Music Information Retrieval Some information about target music is available Query by Text: Title, Artist, or

More information

MUSI-6201 Computational Music Analysis

MUSI-6201 Computational Music Analysis MUSI-6201 Computational Music Analysis Part 9.1: Genre Classification alexander lerch November 4, 2015 temporal analysis overview text book Chapter 8: Musical Genre, Similarity, and Mood (pp. 151 155)

More information

Detecting Musical Key with Supervised Learning

Detecting Musical Key with Supervised Learning Detecting Musical Key with Supervised Learning Robert Mahieu Department of Electrical Engineering Stanford University rmahieu@stanford.edu Abstract This paper proposes and tests performance of two different

More information

Understanding People in Low Resourced Languages

Understanding People in Low Resourced Languages Understanding People in Low Resourced Languages Thesis submitted in partial fulfillment of the requirements for the degree of Masters of Science in Computer Science by Research by Sahil Swami 201302071

More information

Topics in Computer Music Instrument Identification. Ioanna Karydi

Topics in Computer Music Instrument Identification. Ioanna Karydi Topics in Computer Music Instrument Identification Ioanna Karydi Presentation overview What is instrument identification? Sound attributes & Timbre Human performance The ideal algorithm Selected approaches

More information

Feature-Based Analysis of Haydn String Quartets

Feature-Based Analysis of Haydn String Quartets Feature-Based Analysis of Haydn String Quartets Lawson Wong 5/5/2 Introduction When listening to multi-movement works, amateur listeners have almost certainly asked the following situation : Am I still

More information

Sentiment Aggregation using ConceptNet Ontology

Sentiment Aggregation using ConceptNet Ontology Sentiment Aggregation using ConceptNet Ontology Subhabrata Mukherjee Sachindra Joshi IBM Research - India 7th International Joint Conference on Natural Language Processing (IJCNLP 2013), Nagoya, Japan

More information

Universität Bamberg Angewandte Informatik. Seminar KI: gestern, heute, morgen. We are Humor Beings. Understanding and Predicting visual Humor

Universität Bamberg Angewandte Informatik. Seminar KI: gestern, heute, morgen. We are Humor Beings. Understanding and Predicting visual Humor Universität Bamberg Angewandte Informatik Seminar KI: gestern, heute, morgen We are Humor Beings. Understanding and Predicting visual Humor by Daniel Tremmel 18. Februar 2017 advised by Professor Dr. Ute

More information

Are Word Embedding-based Features Useful for Sarcasm Detection?

Are Word Embedding-based Features Useful for Sarcasm Detection? Are Word Embedding-based Features Useful for Sarcasm Detection? Aditya Joshi 1,2,3 Vaibhav Tripathi 1 Kevin Patel 1 Pushpak Bhattacharyya 1 Mark Carman 2 1 Indian Institute of Technology Bombay, India

More information

Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing

Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing Elena Filatova Computer and Information Science Department Fordham University filatova@cis.fordham.edu Abstract The ability to reliably

More information

MPEG has been established as an international standard

MPEG has been established as an international standard 1100 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 9, NO. 7, OCTOBER 1999 Fast Extraction of Spatially Reduced Image Sequences from MPEG-2 Compressed Video Junehwa Song, Member,

More information

Lyrics Classification using Naive Bayes

Lyrics Classification using Naive Bayes Lyrics Classification using Naive Bayes Dalibor Bužić *, Jasminka Dobša ** * College for Information Technologies, Klaićeva 7, Zagreb, Croatia ** Faculty of Organization and Informatics, Pavlinska 2, Varaždin,

More information

Outline. Why do we classify? Audio Classification

Outline. Why do we classify? Audio Classification Outline Introduction Music Information Retrieval Classification Process Steps Pitch Histograms Multiple Pitch Detection Algorithm Musical Genre Classification Implementation Future Work Why do we classify

More information

A combination of approaches to solve Task How Many Ratings? of the KDD CUP 2007

A combination of approaches to solve Task How Many Ratings? of the KDD CUP 2007 A combination of approaches to solve Tas How Many Ratings? of the KDD CUP 2007 Jorge Sueiras C/ Arequipa +34 9 382 45 54 orge.sueiras@neo-metrics.com Daniel Vélez C/ Arequipa +34 9 382 45 54 José Luis

More information

Hidden Markov Model based dance recognition

Hidden Markov Model based dance recognition Hidden Markov Model based dance recognition Dragutin Hrenek, Nenad Mikša, Robert Perica, Pavle Prentašić and Boris Trubić University of Zagreb, Faculty of Electrical Engineering and Computing Unska 3,

More information

arxiv: v2 [cs.cl] 20 Sep 2016

arxiv: v2 [cs.cl] 20 Sep 2016 A Automatic Sarcasm Detection: A Survey ADITYA JOSHI, IITB-Monash Research Academy PUSHPAK BHATTACHARYYA, Indian Institute of Technology Bombay MARK J CARMAN, Monash University arxiv:1602.03426v2 [cs.cl]

More information

Singer Traits Identification using Deep Neural Network

Singer Traits Identification using Deep Neural Network Singer Traits Identification using Deep Neural Network Zhengshan Shi Center for Computer Research in Music and Acoustics Stanford University kittyshi@stanford.edu Abstract The author investigates automatic

More information

1) New Paths to New Machine Learning Science. 2) How an Unruly Mob Almost Stole. Jeff Howbert University of Washington

1) New Paths to New Machine Learning Science. 2) How an Unruly Mob Almost Stole. Jeff Howbert University of Washington 1) New Paths to New Machine Learning Science 2) How an Unruly Mob Almost Stole the Grand Prize at the Last Moment Jeff Howbert University of Washington February 4, 2014 Netflix Viewing Recommendations

More information

Composer Style Attribution

Composer Style Attribution Composer Style Attribution Jacqueline Speiser, Vishesh Gupta Introduction Josquin des Prez (1450 1521) is one of the most famous composers of the Renaissance. Despite his fame, there exists a significant

More information

Automatic Music Genre Classification

Automatic Music Genre Classification Automatic Music Genre Classification Nathan YongHoon Kwon, SUNY Binghamton Ingrid Tchakoua, Jackson State University Matthew Pietrosanu, University of Alberta Freya Fu, Colorado State University Yue Wang,

More information

Understanding Book Popularity on Goodreads

Understanding Book Popularity on Goodreads Understanding Book Popularity on Goodreads Suman Kalyan Maity sumankalyan.maity@ cse.iitkgp.ernet.in Ayush Kumar ayush235317@gmail.com Ankan Mullick Bing Microsoft India ankan.mullick@microsoft.com Vishnu

More information

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes hello Jay Biernat Third author University of Rochester University of Rochester Affiliation3 words jbiernat@ur.rochester.edu author3@ismir.edu

More information

Automatic Polyphonic Music Composition Using the EMILE and ABL Grammar Inductors *

Automatic Polyphonic Music Composition Using the EMILE and ABL Grammar Inductors * Automatic Polyphonic Music Composition Using the EMILE and ABL Grammar Inductors * David Ortega-Pacheco and Hiram Calvo Centro de Investigación en Computación, Instituto Politécnico Nacional, Av. Juan

More information

Some Experiments in Humour Recognition Using the Italian Wikiquote Collection

Some Experiments in Humour Recognition Using the Italian Wikiquote Collection Some Experiments in Humour Recognition Using the Italian Wikiquote Collection Davide Buscaldi and Paolo Rosso Dpto. de Sistemas Informáticos y Computación (DSIC), Universidad Politécnica de Valencia, Spain

More information

Dynamic Allocation of Crowd Contributions for Sentiment Analysis during the 2016 U.S. Presidential Election

Dynamic Allocation of Crowd Contributions for Sentiment Analysis during the 2016 U.S. Presidential Election Dynamic Allocation of Crowd Contributions for Sentiment Analysis during the 2016 U.S. Presidential Election Mehrnoosh Sameki, Mattia Gentil, Kate K. Mays, Lei Guo, and Margrit Betke Boston University Abstract

More information

DISCOURSE ANALYSIS OF LYRIC AND LYRIC-BASED CLASSIFICATION OF MUSIC

DISCOURSE ANALYSIS OF LYRIC AND LYRIC-BASED CLASSIFICATION OF MUSIC DISCOURSE ANALYSIS OF LYRIC AND LYRIC-BASED CLASSIFICATION OF MUSIC Jiakun Fang 1 David Grunberg 1 Diane Litman 2 Ye Wang 1 1 School of Computing, National University of Singapore, Singapore 2 Department

More information

Enhancing Music Maps

Enhancing Music Maps Enhancing Music Maps Jakob Frank Vienna University of Technology, Vienna, Austria http://www.ifs.tuwien.ac.at/mir frank@ifs.tuwien.ac.at Abstract. Private as well as commercial music collections keep growing

More information

Lyric-based Sentiment Polarity Classification of Thai Songs

Lyric-based Sentiment Polarity Classification of Thai Songs Lyric-based Sentiment Polarity Classification of Thai Songs Chutimet Srinilta, Wisuwat Sunhem, Suchat Tungjitnob, Saruta Thasanthiah, and Supawit Vatathanavaro Abstract Song sentiment polarity provides

More information

A Framework for Segmentation of Interview Videos

A Framework for Segmentation of Interview Videos A Framework for Segmentation of Interview Videos Omar Javed, Sohaib Khan, Zeeshan Rasheed, Mubarak Shah Computer Vision Lab School of Electrical Engineering and Computer Science University of Central Florida

More information

INGEOTEC at IberEval 2018 Task HaHa: µtc and EvoMSA to Detect and Score Humor in Texts

INGEOTEC at IberEval 2018 Task HaHa: µtc and EvoMSA to Detect and Score Humor in Texts INGEOTEC at IberEval 2018 Task HaHa: µtc and EvoMSA to Detect and Score Humor in Texts José Ortiz-Bejar 1,3, Vladimir Salgado 3, Mario Graff 2,3, Daniela Moctezuma 3,4, Sabino Miranda-Jiménez 2,3, and

More information

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng

Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng Melody Extraction from Generic Audio Clips Thaminda Edirisooriya, Hansohl Kim, Connie Zeng Introduction In this project we were interested in extracting the melody from generic audio files. Due to the

More information

A Fast Alignment Scheme for Automatic OCR Evaluation of Books

A Fast Alignment Scheme for Automatic OCR Evaluation of Books A Fast Alignment Scheme for Automatic OCR Evaluation of Books Ismet Zeki Yalniz, R. Manmatha Multimedia Indexing and Retrieval Group Dept. of Computer Science, University of Massachusetts Amherst, MA,

More information

Automatic Sarcasm Detection: A Survey

Automatic Sarcasm Detection: A Survey Automatic Sarcasm Detection: A Survey Aditya Joshi 1,2,3 Pushpak Bhattacharyya 2 Mark James Carman 3 1 IITB-Monash Research Academy, India 2 IIT Bombay, India, 3 Monash University, Australia {adityaj,pb}@cse.iitb.ac.in,

More information

MUSICAL MOODS: A MASS PARTICIPATION EXPERIMENT FOR AFFECTIVE CLASSIFICATION OF MUSIC

MUSICAL MOODS: A MASS PARTICIPATION EXPERIMENT FOR AFFECTIVE CLASSIFICATION OF MUSIC 12th International Society for Music Information Retrieval Conference (ISMIR 2011) MUSICAL MOODS: A MASS PARTICIPATION EXPERIMENT FOR AFFECTIVE CLASSIFICATION OF MUSIC Sam Davies, Penelope Allen, Mark

More information

Formalizing Irony with Doxastic Logic

Formalizing Irony with Doxastic Logic Formalizing Irony with Doxastic Logic WANG ZHONGQUAN National University of Singapore April 22, 2015 1 Introduction Verbal irony is a fundamental rhetoric device in human communication. It is often characterized

More information