Measuring #GamerGate: A Tale of Hate, Sexism, and Bullying

Similar documents
Sarcasm Detection in Text: Design Document

World Journal of Engineering Research and Technology WJERT

Semantic Role Labeling of Emotions in Tweets. Saif Mohammad, Xiaodan Zhu, and Joel Martin! National Research Council Canada!

The Best Men Can Be Language level Learner type Time Activity Topic Language Materials

Sentiment Analysis. Andrea Esuli

Introduction to Sentiment Analysis. Text Analytics - Andrea Esuli

Do we really know what people mean when they tweet? Dr. Diana Maynard University of Sheffield, UK

Victim s Stalking and Harassment Risk Identification Checklist (VS-DASH 2009) 1

WHAT'S HOT: LINEAR POPULARITY PREDICTION FROM TV AND SOCIAL USAGE DATA Jan Neumann, Xiaodong Yu, and Mohamad Ali Torkamani Comcast Labs

S-DASH (2009) Risk Identification Checklist For Use in Stalking and Harassment Cases

English 10-Persuasive Research Paper

Text Analysis. Language is complex. The goal of text analysis is to strip away some of that complexity to extract meaning.

Stalking in Supervised Visitation

MUSICAL MOODS: A MASS PARTICIPATION EXPERIMENT FOR AFFECTIVE CLASSIFICATION OF MUSIC

Article Title: Discovering the Influence of Sarcasm in Social Media Responses

Online community dialogue conducted in March Summary: evolving TV distribution models

Learning to Listen.. and Defusing a Hostile Situation. Course Outline

Stalking in Supervised Visitation

PHIL 480: Seminar in the History of Philosophy Building Moral Character: Neo-Confucianism and Moral Psychology

Understanding Book Popularity on Goodreads

KLUEnicorn at SemEval-2018 Task 3: A Naïve Approach to Irony Detection

A Fast Alignment Scheme for Automatic OCR Evaluation of Books

1 Introduction to the life course perspective. 2 Working with life course data. 3 Familial life course analysis. 4 Visualization.

Writing Workshops-Grade 6 Some topics are supported with WriteSmart models to assist students during the writing process. *=Collected in red writing

M1 OSCILLOSCOPE TOOLS

Research & Development. White Paper WHP 228. Musical Moods: A Mass Participation Experiment for the Affective Classification of Music

9.00 AM AM. you understand the main ideas and important details in the passage in other words, what the writer has said (Understanding U);

THE USE OF THOMSON REUTERS RESEARCH ANALYTIC RESOURCES IN ACADEMIC PERFORMANCE EVALUATION DR. EVANGELIA A.E.C. LIPITAKIS SEPTEMBER 2014

OMNICHANNEL MARKETING AUTOMATION AUTOMATE OMNICHANNEL MARKETING STRATEGIES TO IMPROVE THE CUSTOMER JOURNEY

Conditional Probability and Bayes

MEASURING EMERGING SCIENTIFIC IMPACT AND CURRENT RESEARCH TRENDS: A COMPARISON OF ALTMETRIC AND HOT PAPERS INDICATORS

Analyzing Electoral Tweets for Affect, Purpose, and Style

Are you ready to Publish? Understanding the publishing process. Presenter: Andrea Hoogenkamp-OBrien

Demystifying Citation Metrics. Michael Ladisch Pacific Libraries

Can Human Assistance Improve a Computational Poet?

DATA! NOW WHAT? Preparing your ERP data for analysis

Lyrics Classification using Naive Bayes

Sarcasm Detection on Facebook: A Supervised Learning Approach

Data Dissemination and Broadcasting Systems Lesson 05 Data Dissemination Broadcast-disk Models

Appendix: The ACUMEN Portfolio

How can you use Orion to get your publications registered with ACU?

Spread of hoax in Social Media

Library Trends Style Sheet for Articles

Wing-Man. Study Guide Page. Mark Gindick

Running head: AN INTERMEDIATE-LEVEL APA STYLE PAPER 1. Example of an Intermediate-Level APA Style Paper. Justine Berry. Austin Peay State University

Exploring Millennials Meaningful Relationship With TV Programming

Temporal patterns of happiness and sarcasm detection in social media (Twitter)

FINDING RESOURCES FROM YOUR READING LIST. Guide 2

Celebrities and Social Media. Before social media and social networking became the latest craze, the general

National Stalking Awareness Month

Therapeutic Sound for Tinnitus Management: Subjective Helpfulness Ratings. VA M e d i c a l C e n t e r D e c a t u r, G A

Analyzing Second Screen Based Social Soundtrack of TV Viewers from Diverse Cultural Settings

Gabby Douglas partners with antibullying group after Olympics backlash

Gamergate controversy From Wikipedia, the free encyclopedia

Write to be read. Dr B. Pochet. BSA Gembloux Agro-Bio Tech - ULiège. Write to be read B. Pochet

A Framework for Segmentation of Interview Videos

Writing Posters. The Bubbly Blonde

How to write a scientific paper for an international journal

Recognizing Stalking in Intimate Partner Cases. Part I: Recognizing Stalking in Intimate Partner Violence Cases 8/8/2017

Responding to Stalking

Recognizing and Responding to Stalking on Campus

Viewers and Voters: Attitudes to television coverage of the 2005 General Election

Sarcasm in Social Media. sites. This research topic posed an interesting question. Sarcasm, being heavily conveyed

American Chemical Society Publication Guidelines

Facilitator Guide Know It. Name It. Stop It. Public Awareness Program

`Health Literacy Quizzes Project

Reebok Reaches Light TV Viewers with Google and YouTube

V I D E O A D V E R T I S I N G B U R E A U - R E P O R T TV Preferred. Understanding YouTube Enthusiasts Affinity For Video Content

Creating Mindmaps of Documents

European Commission 7th Framework Programme SP4 - Capacities Science in Society 2010 Grant Agreement:

Basic Natural Language Processing

BBC Trust Review of the BBC s Speech Radio Services

Improve Your Storytelling Through Restorative Narrative

LMS301: Reference Management Software (Mendeley)

ManageMyTvs App for the ipadtm

Reducing IPTV Channel Zapping Time Based on Viewer s Surfing Behavior and Preference

CONQUERING CONTENT EXCERPT OF FINDINGS

CANADIAN AUDIENCE REPORT. Full report

The transition to Digital Terrestrial TV and utilisation of the digital dividend in Europe

TongArk: a Human-Machine Ensemble

Pacific s The Vagina Monologues 2011 Schedule & Expectations

Consider the following quote: What does the quote mean? Be prepared to share your thoughts.

10 Most Disturbing Celebrity Controversies And Scandals Of The Year 2015

A probabilistic approach to determining bass voice leading in melodic harmonisation

Publishing your research in a peer reviewed journal: Tips for success. Los Angeles London New Delhi Singapore Washington DC

7/10/2014. Supplemental Handout (Not on website) Itunes Playlist PRIZE SURPRISE!!!!!

User Guide. Best Seat Help Desk 24 hours a day/7 days a week

For the SIA. Applications of Propagation Delay & Skew tool. Introduction. Theory of Operation. Propagation Delay & Skew Tool

GLOSSARY of Internet and Library Terms

SEARCHING FOR SCHOLARLY ARTICLES

Library Acquisition Patterns Preliminary Findings

REPORT TO CONGRESS ON STALKING AND DOMESTIC VIOLENCE, 2005 THROUGH 2006

Social TV and the Social Soundtrack: Significance of Second Screen Interaction during Television Viewing

Stalking and Domestic Violence

EndNote X6 Workshop Michigan State University Libraries

UCSB LIBRARY COLLECTION SPACE PLANNING INITIATIVE: REPORT ON THE UCSB LIBRARY COLLECTIONS SURVEY OUTCOMES AND PLANNING STRATEGIES

Influence of Discovery Search Tools on Science and Engineering e-books Usage

Literary Studies; Sponsored Books Commissioning Editor: Jackie Jones

Large Scale Concepts and Classifiers for Describing Visual Sentiment in Social Multimedia

Measuring Your Research Impact: Citation and Altmetrics Tools

Transcription:

Measuring #GamerGate: A Tale of Hate, Sexism, and Bullying Despoina Chatzakou, Nicolas Kourtellis, Jeremy Blackburn Emiliano De Cristofaro, Gianluca Stringhini, Athena Vakali Aristotle University of Thessaloniki Telefonica Research University College London International Workshop on Computational Methods in CyberSafety, 2017 WWW 17 1 / 27

Social Networking Services WWW 17 2 / 27

Aggressive & Bullying behavior Cyberbullying. Repeated and hostile behavior by a group or an individual, using electronic forms of contact. Cyber-aggression. Intentional harm delivered by the use of electronic means to a person or a group of people who perceive such acts as offensive, derogatory, harmful, or unwanted. WWW 17 3 / 27

Gamergate controversy A coordinated campaign of harassment in the online world. It started with a blog post by an ex-boyfriend of independent game developer Zoe Quinn, alleging sexual improprieties. It quickly evolved into a polarizing issue, involving sexism, feminism, and social justice, taking place on social media like Twitter. Gamergate controversy provides us a unique point of view into online harassment campaigns. WWW 17 4 / 27

Our goals Proposal of a principled methodology to collect content related to aggressive and bullying activities. Gamergate specific: Quantification of this controversy. Exploration of the existing differences between Gamergaters and random Twitter users. WWW 17 5 / 27

Outline 1 Abusive dataset building Data collection Data processing 2 Measurement results How Active are Gamergaters? How Social are Gamergaters? Are Gamergaters Suspended More Often? 3 Conclusions WWW 17 6 / 27

Outline 1 Abusive dataset building Data collection Data processing 2 Measurement results How Active are Gamergaters? How Social are Gamergaters? Are Gamergaters Suspended More Often? 3 Conclusions WWW 17 7 / 27

Outline 1 Abusive dataset building Data collection Data processing 2 Measurement results How Active are Gamergaters? How Social are Gamergaters? Are Gamergaters Suspended More Often? 3 Conclusions WWW 17 8 / 27

Overall process Steps. 1. Select seed keyword(s). 2. Create dynamic list of keywords. 3. Crawl tweets. 4. Collect a random sample*. * Complements the abusive-related dataset with cases that are less likely to contain abusive content. WWW 17 9 / 27

Seed keyword(s) Select seed keyword(s) which are likely to relate with abusive incidents. E.g., #GamerGate, #BlackLivesMatter, #PizzaGate. Set of hate- or curse-related words, e.g., Hatebase database. At the time, t i, the lists of words to be used for filtering posted texts includes only the seed word(s): L(t 1 ) =< seed(s) >. WWW 17 10 / 27

Dynamic list of keywords (I) Filter keywords list to select abusive-related content. Update dynamically - in consecutive time intervals - the filtering list. Depending on the topic under examination: update the filtering list at different time intervals. Keywords list, L(T ) In T = {t 1, t 2,..., t n } the L(T ) equals to: L(t i ) =< seed(s), kw 1, kw 2, kw N >, where kw j is the jth top keyword in time period T = t i t i 1. WWW 17 11 / 27

Dynamic list of keywords (II) Update dynamic list for t i t i+1 Step 1. Investigate the texts posted at t i 1 t i. Step 2. Extract the top N keywords based on their frequency of appearance. Step 3. Update L(t i ) with the up-to-date top N keywords along with the seed word(s). Use of the updated list at the time period: t i t i+1. WWW 17 12 / 27

Outline 1 Abusive dataset building Data collection Data processing 2 Measurement results How Active are Gamergaters? How Social are Gamergaters? Are Gamergaters Suspended More Often? 3 Conclusions WWW 17 13 / 27

Preprocessing Cleaning. Removal of stop words, URLs, punctuations marks, normalization (repetitive characters elimination). Spam removal. Based on the number of hashtags, and duplications. Study of hashtags and duplication distributions to find optimal cutoffs. Avg. # hashtags: 0 to 17. Hashtags: we set the limit to 5. Similarity of tweets: Levenshtein distance. About 5% of the users have a high percentage of similar posts. Final dataset: 659k GG-related tweets, 1M random tweets. WWW 17 14 / 27

Outline 1 Abusive dataset building Data collection Data processing 2 Measurement results How Active are Gamergaters? How Social are Gamergaters? Are Gamergaters Suspended More Often? 3 Conclusions WWW 17 15 / 27

Outline 1 Abusive dataset building Data collection Data processing 2 Measurement results How Active are Gamergaters? How Social are Gamergaters? Are Gamergaters Suspended More Often? 3 Conclusions WWW 17 16 / 27

Account age, posts, hashtags GGers tend to have older accounts > They are not bots. GGers are significantly more active than random Twitter users (more posts and hashtags). WWW 17 17 / 27

Favorites, lists, URLs, mentions GGers have more favorites and topical lists declared than random users. GGers post more URLs in an attempt to disseminate information about their cause. GGers make more mentions within their posts > higher number of direct attacks compared to random users. WWW 17 18 / 27

Outline 1 Abusive dataset building Data collection Data processing 2 Measurement results How Active are Gamergaters? How Social are Gamergaters? Are Gamergaters Suspended More Often? 3 Conclusions WWW 17 19 / 27

Followers, friends GGers tend to have more friends and followers than random users. The controversy appears to be a clear us vs. them situation. Existence of in-group membership > heightens the likelihood of relationship formation. WWW 17 20 / 27

Emoticons, uppercases, sentiment, emotion Emoticons and shouting by using all capital letters: two common ways to express emotion. GGers and random users use emoticons at about the same rate. GGers tend to use all uppercase less often than random users. Sentiment, Offense, & Emotion GGers post tweets with a generally more negative sentiment > large proportion of offensive posts. GGers use more hate words than random users (Hatebase database). GGers and random users do not differ substantially in a variety of emotions: anger, disgust, fear, sadness, surprise. GGers are less joyful > they are not necessarily angry, but they are apparently not happy. WWW 17 21 / 27

Outline 1 Abusive dataset building Data collection Data processing 2 Measurement results How Active are Gamergaters? How Social are Gamergaters? Are Gamergaters Suspended More Often? 3 Conclusions WWW 17 22 / 27

Twitter Reaction to Aggression active deleted suspended Random users 67% 13% 20% Gamergate 86% 5% 9% Focus on a sample of 33k users from both the GG and random datasets. Users tend to be suspended more often than deleting their accounts by choice. Random users are more prone to be suspended or delete their accounts than GGers. WWW 17 23 / 27

Outline 1 Abusive dataset building Data collection Data processing 2 Measurement results How Active are Gamergaters? How Social are Gamergaters? Are Gamergaters Suspended More Often? 3 Conclusions WWW 17 24 / 27

Summary GGers use Twitter as a mechanism for broadcasting their ideals (hashtags, mentions). GGers appear to be Twitter savvy users and quite engaged with the platform (posts, participating lists, favorites). GGers are more well-connected within their network (followers, friends). GGers express with more negative sentiment overall, but they only differ significantly from random users with respect to joy. GGers are less likely to be suspended due to the inherent difficulties in detecting and combating online harassment activities. WWW 17 25 / 27

Future work Conduction of a more in-depth study of Gamergate controversy, focusing on how it evolved over time. Consideration of additional features, e.g., network-based, to further examine the differences among the GGers and random users. Automatically detect abusive users (upcoming HyperText paper: stay tuned!) WWW 17 26 / 27

Questions? This work has been funded by the European Commission as part of the ENCASE project (H2020-MSCA-RISE), under GA number 691025. WWW 17 27 / 27