Some Experiments in Humour Recognition Using the Italian Wikiquote Collection

Similar documents
Affect-based Features for Humour Recognition

Computational Laughing: Automatic Recognition of Humorous One-liners

Natural language s creative genres are traditionally considered to be outside the

Automatic Polyphonic Music Composition Using the EMILE and ABL Grammar Inductors *

Evaluating Humorous Features: Towards a Humour Taxonomy

Automatic Detection of Sarcasm in BBS Posts Based on Sarcasm Classification

Humorist Bot: Bringing Computational Humour in a Chat-Bot System

An Impact Analysis of Features in a Classification Approach to Irony Detection in Product Reviews

Bi-Modal Music Emotion Recognition: Novel Lyrical Features and Dataset

Mood Tracking of Radio Station Broadcasts

Multimodal Music Mood Classification Framework for Christian Kokborok Music

The final publication is available at

Music Information Retrieval with Temporal Features and Timbre

Melody classification using patterns

STRING QUARTET CLASSIFICATION WITH MONOPHONIC MODELS

First Stage of an Automated Content-Based Citation Analysis Study: Detection of Citation Sentences 1

Mining Subjective Knowledge from Customer Reviews: A Specific Case of Irony Detection

STYLE RECOGNITION THROUGH STATISTICAL EVENT MODELS

Lyric-Based Music Mood Recognition

Humor recognition using deep learning

Exploring Relationships between Audio Features and Emotion in Music

Homonym Detection For Humor Recognition In Short Text

Instrument Recognition in Polyphonic Mixtures Using Spectral Envelopes

Sarcasm Detection in Text: Design Document

Introduction to Natural Language Processing This week & next week: Classification Sentiment Lexicons

Document downloaded from: This paper must be cited as:

Toward Multi-Modal Music Emotion Classification

Speech Recognition Combining MFCCs and Image Features

Automatic Rhythmic Notation from Single Voice Audio Sources

A Pattern Recognition Approach for Melody Track Selection in MIDI Files

Sentiment Analysis. Andrea Esuli

Robust Radio Broadcast Monitoring Using a Multi-Band Spectral Entropy Signature

Introduction to Sentiment Analysis. Text Analytics - Andrea Esuli

Lyrics Classification using Naive Bayes

Finding Sarcasm in Reddit Postings: A Deep Learning Approach

Humor: Prosody Analysis and Automatic Recognition for F * R * I * E * N * D * S *

Music Understanding and the Future of Music

Stierlitz Meets SVM: Humor Detection in Russian

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues

TREE MODEL OF SYMBOLIC MUSIC FOR TONALITY GUESSING

INTER GENRE SIMILARITY MODELLING FOR AUTOMATIC MUSIC GENRE CLASSIFICATION

Computational Models for Incongruity Detection in Humour

Implementation of Emotional Features on Satire Detection

Music Emotion Recognition. Jaesung Lee. Chung-Ang University

Composer Style Attribution

Categorization of ICMR Using Feature Extraction Strategy And MIR With Ensemble Learning

Linguistic Features of Humor in Academic Writing

Bilbo-Val: Automatic Identification of Bibliographical Zone in Papers

Identifying Humor in Reviews using Background Text Sources

Figurative Language Processing in Social Media: Humor Recognition and Irony Detection

Outline. Why do we classify? Audio Classification

Combination of Audio & Lyrics Features for Genre Classication in Digital Audio Collections

Automatic Laughter Detection

Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest

Determining sentiment in citation text and analyzing its impact on the proposed ranking index

A New Scheme for Citation Classification based on Convolutional Neural Networks

Research & Development. White Paper WHP 232. A Large Scale Experiment for Mood-based Classification of TV Programmes BRITISH BROADCASTING CORPORATION

BayesianBand: Jam Session System based on Mutual Prediction by User and System

ALF-200k: Towards Extensive Multimodal Analyses of Music Tracks and Playlists

Laughbot: Detecting Humor in Spoken Language with Language and Audio Cues

Identifying Related Work and Plagiarism by Citation Analysis

Automatically Creating Word-Play Jokes in Japanese

arxiv: v1 [cs.cl] 26 Jun 2015

WHAT MAKES FOR A HIT POP SONG? WHAT MAKES FOR A POP SONG?

Exploring Citations for Conflict of Interest Detection in Peer Review System

MindMouse. This project is written in C++ and uses the following Libraries: LibSvm, kissfft, BOOST File System, and Emotiv Research Edition SDK.

Music Mood Classification - an SVM based approach. Sebastian Napiorkowski

Universität Bamberg Angewandte Informatik. Seminar KI: gestern, heute, morgen. We are Humor Beings. Understanding and Predicting visual Humor

Detecting Musical Key with Supervised Learning

Improving Frame Based Automatic Laughter Detection

Composer Identification of Digital Audio Modeling Content Specific Features Through Markov Models

Lyrical Features of Popular Music of the 20th and 21st Centuries: Distinguishing by Decade

Music Mood. Sheng Xu, Albert Peyton, Ryan Bhular

A Large Scale Experiment for Mood-Based Classification of TV Programmes

Singer Traits Identification using Deep Neural Network

DISCOURSE ANALYSIS OF LYRIC AND LYRIC-BASED CLASSIFICATION OF MUSIC

Detecting Hoaxes, Frauds and Deception in Writing Style Online

Overview of the SBS 2016 Mining Track

Multilabel Subject-Based Classification of Poetry

Regression Model for Politeness Estimation Trained on Examples

Privacy Level Indicating Data Leakage Prevention System

A CLASSIFICATION APPROACH TO MELODY TRANSCRIPTION

LEARNING SPECTRAL FILTERS FOR SINGLE- AND MULTI-LABEL CLASSIFICATION OF MUSICAL INSTRUMENTS. Patrick Joseph Donnelly

Lyric-based Sentiment Polarity Classification of Thai Songs

Improving MeSH Classification of Biomedical Articles using Citation Contexts

Authorship Verification with the Minmax Metric

Normalized Cumulative Spectral Distribution in Music

Name Identification of People in News Video by Face Matching

Automatic Laughter Detection

MINING THE CORRELATION BETWEEN LYRICAL AND AUDIO FEATURES AND THE EMERGENCE OF MOOD

Projektseminar: Sentimentanalyse Dozenten: Michael Wiegand und Marc Schulder

Exploring the Design Space of Symbolic Music Genre Classification Using Data Mining Techniques Ortiz-Arroyo, Daniel; Kofod, Christian

Identifying Related Documents For Research Paper Recommender By CPA and COA

SemEval-2015 Task 11: Sentiment Analysis of Figurative Language in Twitter

A QUERY BY EXAMPLE MUSIC RETRIEVAL ALGORITHM

Sean O Driscoll x

A personalized TV Guide System

UC San Diego UC San Diego Previously Published Works

Headings: Machine Learning. Text Mining. Music Emotion Recognition

Mood Classification Using Lyrics and Audio: A Case-Study in Greek Music

Transcription:

Some Experiments in Humour Recognition Using the Italian Wikiquote Collection Davide Buscaldi and Paolo Rosso Dpto. de Sistemas Informáticos y Computación (DSIC), Universidad Politécnica de Valencia, Spain {dbuscaldi,prosso}@dsic.upv.es Abstract. In this paper we present some results obtained in humour classification over a corpus of Italian quotations manually extracted and tagged from the Wikiquote project. The experiments were carried out using both a multinomial Naïve Bayes classifier and a Support Vector Machine (SVM). The considered features range from single words to n- grams and sentence length. The obtained results show that it is possible to identify the funny quotes even with the simplest features (bag of words); the bayesian classifier performed better than the SVM. However, the size of the corpus size is too small to support definitive assertions. 1 Introduction Nowadays, the discipline of Natural Language Processing (NLP) embraces a large quantity of specific tasks, aimed at the solution of practical problems related to the access of human users to machine-readable textual information. For instance, thanks to Machine Translation, people can read and understand documents that are written in a language they do not know; Information Retrieval techniques allow to find almost immediately some kind of information on the web or in a digital collection. Less prosaic tasks, related to emotional aspects of natural language, have, until now, obtained less attention by the NLP research community, despite their close correlation to the understanding of human language. One of such tasks is the automatic recognition of humour. In the words of the psychologist Edward De Bono[1]: Humor is by far the most significant activity of the human brain. Why has it been so neglected by traditional philosophers, psychologists and information scientists? The nature of humour is elusive, it is expressed in many different forms and styles; for instance, the amusing elements of jokes are not the same of irony or satire. The sense of humour is also particularly subjective. All these characteristics were considered to represent a major obstacle to process it in an automated way. The work by Mihalcea and Strapparava [2,3] in the classification of one-liners mined these beliefs, demonstrating that it is possible to apply computational approaches to the automatic recognition and use of humour. F. Masulli, S. Mitra, and G. Pasi (Eds.): WILF 2007, LNAI 4578, pp. 464 468, 2007. c Springer-Verlag Berlin Heidelberg 2007

Some Experiments in Humour Recognition 465 In this paper we investigated the use of Wikiquote 1 as a corpus for automatic humour recognition in Italian. Wikiquote is a section of Wikipedia 2 that stores famous quotes from a plethora of sources, from movies to writers, from anchormen to proverbs. We manually annotated a part of the quotes as humourous or not, and we used this corpus for some experiments. We used a Multinomial Naïve Bayes classifier and a Support Vector Machine (SVM) [4], as in [2]. In the following Section we describe how we selected the quotes for the corpus and its characteristics. In Section 3 we describe the experiments carried out and the obtained results. 2 Corpus Construction The Italian Wikiquote currently contains about 4, 000 pages of quotations, aphorisms and proverbs. We decided to include only the quotations with an author assigned and Italian proverbs, that is, we excluded anonymous citations and phrases extracted from movies, television shows and category pages (for instance, Category:Love). A quantity of data was also removed due to formatting issues. The quotations were extracted and presented to a human annotator by means of a simple Java interface (see Fig. 1). Fig. 1. Interface of the Wikiquote annotation tool The annotator had the options of skipping the quotation or the whole author, eventually adjust typos or remove exceeding informations (such as the source of the citation in Fig. 1), and label the quote as funny. 1 http://en.wikiquote.org 2 http://www.wikipedia.org

466 D. Buscaldi and P. Rosso The results of such processing is a corpus consisting of 1, 966 citations from 89 authors, of which 471 labeled as funny. For each quote we stored also the information about the author and its category. The amusing quotes include various types of humour: for instance, there are simple one-liners such as Per te sono ateo, ma per Dio sono una leale opposizione. ( To you I m an atheist; to God, I m the Loyal Opposition., by Woody Allen), and jokes such as Lo sa che io ho perduto due figli - Signora lei una donna piuttosto distratta ( You see, I lost two sons - Madame, you are quite a scatterbrain, byfabriziode Andrè). The corpus has been made publicly available in the web at the following direction: http://www.dsic.upv.es/~dbuscaldi/resources/emoticorpus.xml.bz2. 3 Experiments and Results The experiments were carried out using the Multinomial Naïve Bayes classifier of Weka [5] and the SVM light 3 implementation of SVM by Thorsten Joachims. The motivation of this choice is that these classifiers have been already used in the current state-of-the-art work on humour recognition [2]. Each classifier was evaluated using various sets of features: bag-of-words (the set of words in the quote, including stop-words), n-grams (from unigrams to trigrams), quotation length. For SVM we also considered using a linear and a polynomial (d =2)kernel and two weight schemes for features: binary and tf idf. The cross-validation method used in all the experiments was the leave-one-out. We used precision and recall as performance measures. Precision is the probability that a document predicted to be in class A truly belongs to this class. Recall is the probability that a document belonging to class A is classified into this class. We calculated also the F -measure, that is calculated as 2 p r/(p+r), where p is precision and r recall. In Table 1 we show the baselines, obtained by assigning to the whole collection all the same label, either Humorous or Not-humorous. Table 1. Baselines. BL H is the baseline obtained by assigning to all samples the Humourous label. BL N is the baseline obtained by assigning to all samples a Nonhumorous label. p H, r H and F H indicate precision, recall and F -measure over the Humorous set of samples, p N, r N and F N indicate the same for the Non-humorous samples. r O indicates overall recall. BL H 0.240 0.240 1.000 0.387 0.000 0.000 0.000 BL N 0.760 0.000 0.000 0.000 0.760 1.000 0.864 In Table 2 we show the results obtained with the Naïve Bayes classifier, using different sets of features. We used the author feature only in order to check the 3 http://www.cs.cornell.edu/people/tj/svm light/

Some Experiments in Humour Recognition 467 Table 2. Multinomial Naïve Bayes results. bow: bag-of-words features; n-grams: from unigrams to trigrams; n-grams +length: n-grams and sentence length; n-grams +author: n-grams and author name. bow 0.807 0.619 0.501 0.554 0.852 0.903 0.877 n-grams 0.788 0.556 0.584 0.570 0.867 0.853 0.860 n-grams +length 0.796 0.584 0.516 0.548 0.853 0.884 0.868 n-grams +author 0.870 0.724 0.737 0.730 0.917 0.912 0.914 importance of knowing the source of a quotation. Actually, humour classification should be blind with respect to the author s name. In Tables 2 and 4 we display the results obatined using, respectively, SVM with linear and polynomial kernels. Table 3. Results for SVM with linear kernel. bow: bag-of-words features; n-grams: from unigrams to trigrams; n-grams +length: n-grams and sentence length; tf idf: features weighted by means of the tf idf. bow 0.796 0.748 0.221 0.341 0.799 0.977 0.879 n-grams 0.789 0.721 0.197 0.310 0.794 0.976 0.876 n-grams +length 0.795 0.743 0.221 0.340 0.799 0.976 0.879 bow, tf idf 0.767 0.591 0.083 0.145 0.773 0.982 0.865 n-grams, tf idf 0.770 0.714 0.064 0.117 0.771 0.992 0.867 Table 4. Results for SVM with polynomial kernel. bow: bag-of-words features; n-grams: from unigrams to trigrams; tf idf: features weighted by means of the tf idf. bow 0.767 0.615 0.068 0.122 0.771 0.980 0.863 n-grams 0.758 0.417 0.021 0.040 0.763 0.991 0.862 bow, tf idf 0.761 1.000 0.002 0.004 0.761 1.000 0.864 n-grams, tf idf 0.760 0.333 0.002 0.004 0.761 0.999 0.864 Generally, the obtained results are in line with those obtained by [2] for a corpus of dimensions similar to our Wikiquote-based one. The results obtained with SVMs show that they obtained a very low recall over humorous quotes; we think this is due to the nature of the corpus, that contains three times nonhumorous quotes more than funny ones. Quite surprisingly, the best results were obtained with the simplest model, the Naïve Bayes with bag-of-words as features. In other words, this means that terminology is quite distinctive for humourous quotes. We suppose the n-grams features are more useful in other tasks where style is more important, such as authorship identification [6].

468 D. Buscaldi and P. Rosso 4 Conclusions and Further Work We built a corpus of 1966 citations in Italian, extracted from Wikiquote, where each citation has been labeled as funny or not. The corpus was used to perform experiments with the leave-one-out method, using a Naïve Bayes and a SVM classifier. The results show that it is actually possible to identify the humorous quotes even with simple features such as the bag-of-words of each sentence, and that the bayesian classifier performs better than the SVM one. However, the corpus is too small (about 10% the corpus used in [2]) and the results are not decisive. Further investigation will be conditioned by the acquisition of a larger corpus, possibly by working on the English edition of Wikiquote. Acknowledgements We would like to thank the TIN2006-15265-C06-04 research project for partially supporting this work. References 1. De Bono, E.: I am Right, You are Wrong: From This to the New Renaissance, From Rock Logic to Water Logic. Penguin (1991) 2. Mihalcea, R., Strapparava, C.: Computational Laughing: Automatic Recognition of Humorous One-liners. In: Proc. 27th Ann. Conf. Cognitive Science Soc (CogSci 05), Stresa, Italy, pp. 1513 1518 (2005) 3. Mihalcea, R., Strapparava, C.: Technologies That Make You Smile: Adding Humor to Text-Based Applications. IEEE Intelligent Systems 21(5), 33 39 (2006) 4. Joachims, T.: Text Categorization with Support Vector Machines: Learning with Many Relevant Features. In: Springer (ed.) Proc. 10th European Conf. on Machine Learning (ECML 98), pp. 137 142 (1998) 5. Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005) 6. Coyotl, R.M.: Villaseñor, L. In: Martínez-Trinidad, J.F., Carrasco Ochoa, J.A., Kittler, J. (eds.) CIARP 2006. LNCS, vol. 4225, pp. 844 853. Springer, Heidelberg (2006)