Abstract

Various socio-political organizations, from activist groups to propaganda campaigners, create accounts on Twitter to reach out, influence and gain followers. In order to analyze the impact of these organizational accounts, the first step is to identify them. In this paper, we develop and experiment with a set of network-based, behavioral, temporal and spatial characteristics in these accounts, independent of domain or language, to identify features that can be useful in detecting organizational accounts. In order to assess this model, we experimented with a microblog corpus comprised of over 7 million tweets from 150,000 Twitter users in Bangladesh, tweeted between June and October 2016. We sampled 31,139 accounts using cold-start heuristics to locate and label nearly 200 organizational accounts, distributed as 68 NGOs, 62 news outlets, 35 political groups, and 17 public intellectual and iconic figures. The remaining accounts were labeled as individuals. Next, we developed a set of features and experimented with a set of linear and non-linear classifiers. The highest performing sparse logistic regression classifier achieved an accuracy of 68.2% precision and 64.4% recall leading to a 66.2% F1-score in detecting less than 1% rare organizational accounts using a set of content- and language-independent features.

Original languageEnglish (US)
Title of host publicationSocial, Cultural, and Behavioral Modeling - 11th International Conference, SBP-BRiMS 2018, Proceedings
EditorsHalil Bisgin, Robert Thomson, Ayaz Hyder, Christopher Dancy
PublisherSpringer Verlag
Pages164-175
Number of pages12
ISBN (Print)9783319933719
DOIs
StatePublished - Jan 1 2018
Event11th International Conference on Social Computing, Behavioral-Cultural Modeling, and Prediction conference and Behavior Representation in Modeling and Simulation, SBP-BRiMS 2018 - Washington, United States
Duration: Jul 10 2018Jul 13 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10899 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other11th International Conference on Social Computing, Behavioral-Cultural Modeling, and Prediction conference and Behavior Representation in Modeling and Simulation, SBP-BRiMS 2018
CountryUnited States
CityWashington
Period7/10/187/13/18

Fingerprint

Classifiers
Logistics
Labels
Classifier
Logistic Regression
Figure
Experiments
Heuristics
Experiment
Language
Model

Keywords

  • Automatic identification
  • Social network
  • Social network analysis

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Alzahrani, S., Gore, C., Salehi, A., & Davulcu, H. (2018). Finding organizational accounts based on structural and behavioral factors on twitter. In H. Bisgin, R. Thomson, A. Hyder, & C. Dancy (Eds.), Social, Cultural, and Behavioral Modeling - 11th International Conference, SBP-BRiMS 2018, Proceedings (pp. 164-175). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10899 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-319-93372-6_18

Finding organizational accounts based on structural and behavioral factors on twitter. / Alzahrani, Sultan; Gore, Chinmay; Salehi, Amin; Davulcu, Hasan.

Social, Cultural, and Behavioral Modeling - 11th International Conference, SBP-BRiMS 2018, Proceedings. ed. / Halil Bisgin; Robert Thomson; Ayaz Hyder; Christopher Dancy. Springer Verlag, 2018. p. 164-175 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10899 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Alzahrani, S, Gore, C, Salehi, A & Davulcu, H 2018, Finding organizational accounts based on structural and behavioral factors on twitter. in H Bisgin, R Thomson, A Hyder & C Dancy (eds), Social, Cultural, and Behavioral Modeling - 11th International Conference, SBP-BRiMS 2018, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 10899 LNCS, Springer Verlag, pp. 164-175, 11th International Conference on Social Computing, Behavioral-Cultural Modeling, and Prediction conference and Behavior Representation in Modeling and Simulation, SBP-BRiMS 2018, Washington, United States, 7/10/18. https://doi.org/10.1007/978-3-319-93372-6_18
Alzahrani S, Gore C, Salehi A, Davulcu H. Finding organizational accounts based on structural and behavioral factors on twitter. In Bisgin H, Thomson R, Hyder A, Dancy C, editors, Social, Cultural, and Behavioral Modeling - 11th International Conference, SBP-BRiMS 2018, Proceedings. Springer Verlag. 2018. p. 164-175. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-319-93372-6_18
Alzahrani, Sultan ; Gore, Chinmay ; Salehi, Amin ; Davulcu, Hasan. / Finding organizational accounts based on structural and behavioral factors on twitter. Social, Cultural, and Behavioral Modeling - 11th International Conference, SBP-BRiMS 2018, Proceedings. editor / Halil Bisgin ; Robert Thomson ; Ayaz Hyder ; Christopher Dancy. Springer Verlag, 2018. pp. 164-175 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{a2ebd7479c4144e49c3217bfb1562f1e,
title = "Finding organizational accounts based on structural and behavioral factors on twitter",
abstract = "Various socio-political organizations, from activist groups to propaganda campaigners, create accounts on Twitter to reach out, influence and gain followers. In order to analyze the impact of these organizational accounts, the first step is to identify them. In this paper, we develop and experiment with a set of network-based, behavioral, temporal and spatial characteristics in these accounts, independent of domain or language, to identify features that can be useful in detecting organizational accounts. In order to assess this model, we experimented with a microblog corpus comprised of over 7 million tweets from 150,000 Twitter users in Bangladesh, tweeted between June and October 2016. We sampled 31,139 accounts using cold-start heuristics to locate and label nearly 200 organizational accounts, distributed as 68 NGOs, 62 news outlets, 35 political groups, and 17 public intellectual and iconic figures. The remaining accounts were labeled as individuals. Next, we developed a set of features and experimented with a set of linear and non-linear classifiers. The highest performing sparse logistic regression classifier achieved an accuracy of 68.2{\%} precision and 64.4{\%} recall leading to a 66.2{\%} F1-score in detecting less than 1{\%} rare organizational accounts using a set of content- and language-independent features.",
keywords = "Automatic identification, Social network, Social network analysis",
author = "Sultan Alzahrani and Chinmay Gore and Amin Salehi and Hasan Davulcu",
year = "2018",
month = "1",
day = "1",
doi = "10.1007/978-3-319-93372-6_18",
language = "English (US)",
isbn = "9783319933719",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "164--175",
editor = "Halil Bisgin and Robert Thomson and Ayaz Hyder and Christopher Dancy",
booktitle = "Social, Cultural, and Behavioral Modeling - 11th International Conference, SBP-BRiMS 2018, Proceedings",

}

TY - GEN

T1 - Finding organizational accounts based on structural and behavioral factors on twitter

AU - Alzahrani, Sultan

AU - Gore, Chinmay

AU - Salehi, Amin

AU - Davulcu, Hasan

PY - 2018/1/1

Y1 - 2018/1/1

N2 - Various socio-political organizations, from activist groups to propaganda campaigners, create accounts on Twitter to reach out, influence and gain followers. In order to analyze the impact of these organizational accounts, the first step is to identify them. In this paper, we develop and experiment with a set of network-based, behavioral, temporal and spatial characteristics in these accounts, independent of domain or language, to identify features that can be useful in detecting organizational accounts. In order to assess this model, we experimented with a microblog corpus comprised of over 7 million tweets from 150,000 Twitter users in Bangladesh, tweeted between June and October 2016. We sampled 31,139 accounts using cold-start heuristics to locate and label nearly 200 organizational accounts, distributed as 68 NGOs, 62 news outlets, 35 political groups, and 17 public intellectual and iconic figures. The remaining accounts were labeled as individuals. Next, we developed a set of features and experimented with a set of linear and non-linear classifiers. The highest performing sparse logistic regression classifier achieved an accuracy of 68.2% precision and 64.4% recall leading to a 66.2% F1-score in detecting less than 1% rare organizational accounts using a set of content- and language-independent features.

AB - Various socio-political organizations, from activist groups to propaganda campaigners, create accounts on Twitter to reach out, influence and gain followers. In order to analyze the impact of these organizational accounts, the first step is to identify them. In this paper, we develop and experiment with a set of network-based, behavioral, temporal and spatial characteristics in these accounts, independent of domain or language, to identify features that can be useful in detecting organizational accounts. In order to assess this model, we experimented with a microblog corpus comprised of over 7 million tweets from 150,000 Twitter users in Bangladesh, tweeted between June and October 2016. We sampled 31,139 accounts using cold-start heuristics to locate and label nearly 200 organizational accounts, distributed as 68 NGOs, 62 news outlets, 35 political groups, and 17 public intellectual and iconic figures. The remaining accounts were labeled as individuals. Next, we developed a set of features and experimented with a set of linear and non-linear classifiers. The highest performing sparse logistic regression classifier achieved an accuracy of 68.2% precision and 64.4% recall leading to a 66.2% F1-score in detecting less than 1% rare organizational accounts using a set of content- and language-independent features.

KW - Automatic identification

KW - Social network

KW - Social network analysis

UR - http://www.scopus.com/inward/record.url?scp=85049801841&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85049801841&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-93372-6_18

DO - 10.1007/978-3-319-93372-6_18

M3 - Conference contribution

SN - 9783319933719

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 164

EP - 175

BT - Social, Cultural, and Behavioral Modeling - 11th International Conference, SBP-BRiMS 2018, Proceedings

A2 - Bisgin, Halil

A2 - Thomson, Robert

A2 - Hyder, Ayaz

A2 - Dancy, Christopher

PB - Springer Verlag

ER -