TY - GEN
T1 - Bot detection
T2 - 12th International Conference on Social Computing, Behavioral-Cultural Modeling, and Prediction and Behavior Representation in Modeling and Simulation, SBP-BRiMS 2019
AU - H. Nazer, Tahora
AU - Davis, Matthew
AU - Karami, Mansooreh
AU - Akoglu, Leman
AU - Koelle, David
AU - Liu, Huan
N1 - Funding Information:
Acknowledgements. Support was provided, in part, by NSF grant 1461886 on “Disaster Preparation and Response via Big Data Analysis and Robust Networking” and ONR grants N000141612257 (on “Intelligent Analysis of Big Social Media Data for Crisis Tracking”) and N000141812108 (on “Bot Hunter”). We would like to thank anonymous reviewers for their valuable feedback.
Publisher Copyright:
© Springer Nature Switzerland AG 2019.
PY - 2019
Y1 - 2019
N2 - Social bots are an effective tool in the arsenal of malicious actors who manipulate discussions on social media. Bots help spread misinformation, promote political propaganda, and inflate the popularity of users and content. Hence, it is necessary to differentiate bot accounts and human users. There are several bot detection methods that approach this problem. Conventional methods either focus on precision regardless of the overall performance or optimize overall performance, say F1, without monitoring its effect on precision or recall. Focusing on precision means that those users marked as bots are more likely than not bots but a large portion of the bots could remain undetected. From a user’s perspective, however, it is more desirable to have less interaction with bots, even if it would incur a loss in precision. This can be achieved by a detection method with higher recall. A trivial, but useless, solution for high recall is to classify every account (human or bot) as bot, hence, resulting in poor overall performance. In this work, we investigate if it is feasible for a method to focus on recall without considerable loss in overall performance. Extensive experiments with recall and precision trade-off suggest that high recall can be achieved without much overall performance deterioration. This research leads to a recall-focused approach to bot detection, REFOCUS, with some lessons learned and future directions.
AB - Social bots are an effective tool in the arsenal of malicious actors who manipulate discussions on social media. Bots help spread misinformation, promote political propaganda, and inflate the popularity of users and content. Hence, it is necessary to differentiate bot accounts and human users. There are several bot detection methods that approach this problem. Conventional methods either focus on precision regardless of the overall performance or optimize overall performance, say F1, without monitoring its effect on precision or recall. Focusing on precision means that those users marked as bots are more likely than not bots but a large portion of the bots could remain undetected. From a user’s perspective, however, it is more desirable to have less interaction with bots, even if it would incur a loss in precision. This can be achieved by a detection method with higher recall. A trivial, but useless, solution for high recall is to classify every account (human or bot) as bot, hence, resulting in poor overall performance. In this work, we investigate if it is feasible for a method to focus on recall without considerable loss in overall performance. Extensive experiments with recall and precision trade-off suggest that high recall can be achieved without much overall performance deterioration. This research leads to a recall-focused approach to bot detection, REFOCUS, with some lessons learned and future directions.
KW - Bot detection
KW - Recall
KW - Social bots
KW - Social media
KW - Twitter
UR - http://www.scopus.com/inward/record.url?scp=85068154080&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85068154080&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-21741-9_5
DO - 10.1007/978-3-030-21741-9_5
M3 - Conference contribution
AN - SCOPUS:85068154080
SN - 9783030217402
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 39
EP - 49
BT - Social, Cultural, and Behavioral Modeling - 12th International Conference, SBP-BRiMS 2019, Proceedings
A2 - Thomson, Robert
A2 - Bisgin, Halil
A2 - Dancy, Christopher
A2 - Hyder, Ayaz
PB - Springer Verlag
Y2 - 9 July 2019 through 12 July 2019
ER -