Less is more: Semi-supervised causal inference for detecting pathogenic users in social media

Hamidreza Alvari, Elham Shaabani, Soumajyoti Sarkar, Ghazaleh Beigi, Paulo Shakarian

Research output: Chapter in Book/Report/Conference proceedingConference contribution

22 Scopus citations

Abstract

Recent years have witnessed a surge of manipulation of public opinion and political events by malicious social media actors. These users are referred to as �Pathogenic Social Media (PSM)" accounts. PSMs are key users in spreading misinformation in social media to viral proportions. These accounts can be either controlled by real users or automated bots. Identification of PSMs is thus of utmost importance for social media authorities. The burden usually falls to automatic approaches that can identify these accounts and protect social media reputation. However, lack of sufficient labeled examples for devising and training sophisticated approaches to combat these accounts is still one of the foremost challenges facing social media firms. In contrast, unlabeled data is abundant and cheap to obtain thanks to massive user-generated data. In this paper, we propose a semi-supervised causal inference PSM detection framework, SemiPsm, to compensate for the lack of labeled data. In particular, the proposed method leverages unlabeled data in the form of manifold regularization and only relies on cascade information. This is in contrast to the existing approaches that use exhaustive feature engineering (e.g., profile information, network structure, etc.). Evidence from empirical experiments on a real-world ISIS-related dataset from Twitter suggests promising results of utilizing unlabeled instances for detecting PSMs.

Original languageEnglish (US)
Title of host publicationThe Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019
PublisherAssociation for Computing Machinery, Inc
Pages154-161
Number of pages8
ISBN (Electronic)9781450366755
DOIs
StatePublished - May 13 2019
Event2019 World Wide Web Conference, WWW 2019 - San Francisco, United States
Duration: May 13 2019May 17 2019

Publication series

NameThe Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019

Conference

Conference2019 World Wide Web Conference, WWW 2019
Country/TerritoryUnited States
CitySan Francisco
Period5/13/195/17/19

Keywords

  • Causal inference
  • Pathogenic users
  • Semi-supervised learning
  • Social media

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Software

Fingerprint

Dive into the research topics of 'Less is more: Semi-supervised causal inference for detecting pathogenic users in social media'. Together they form a unique fingerprint.

Cite this