Toward order-of-magnitude cascade prediction

Ruocheng Guo; Elham Shaabani; Abhinav Bhatnagar; Paulo Shakarian

doi:10.1145/2808797.2809358

Toward order-of-magnitude cascade prediction

Ruocheng Guo, Elham Shaabani, Abhinav Bhatnagar, Paulo Shakarian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

9 Scopus citations

Abstract

When a piece of information (microblog, photograph, video, link, etc.) starts to spread in a social network, an important question arises: will it spread to "viral" proportions - where "viral" is defined as an order-of-magnitude increase. However, several previous studies have established that cascade size and frequency are related through a power-law - which leads to a severe imbalance in this classification problem. In this paper, we devise a suite of measurements based on "structural diversity" - the variety of social contexts (communities) in which individuals partaking in a given cascade engage. We demonstrate these measures are able to distinguish viral from non-viral cascades, despite the severe imbalance of the data for this problem. Further, we leverage these measurements as features in a classification approach, successfully predicting microblogs that grow from 50 to 500 reposts with precision of 0.69 and recall of 0.52 for the viral class - despite this class comprising under 2% of samples. This significantly outperforms our baseline approach as well as the current state-of-the-art. Our work also demonstrates how we can tradeoff between precision and recall.

Original language	English (US)
Title of host publication	Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015
Editors	Jian Pei, Jie Tang, Fabrizio Silvestri
Publisher	Association for Computing Machinery, Inc
Pages	1610-1613
Number of pages	4
ISBN (Electronic)	9781450338547
DOIs	https://doi.org/10.1145/2808797.2809358
State	Published - Aug 25 2015
Event	IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015 - Paris, France Duration: Aug 25 2015 → Aug 28 2015

Publication series

Name	Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015

Other

Other	IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015
Country/Territory	France
City	Paris
Period	8/25/15 → 8/28/15

ASJC Scopus subject areas

Computer Science Applications
Computer Networks and Communications

Access to Document

10.1145/2808797.2809358

Cite this

Guo, R., Shaabani, E., Bhatnagar, A., & Shakarian, P. (2015). Toward order-of-magnitude cascade prediction. In J. Pei, J. Tang, & F. Silvestri (Eds.), Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015 (pp. 1610-1613). (Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015). Association for Computing Machinery, Inc. https://doi.org/10.1145/2808797.2809358

Toward order-of-magnitude cascade prediction. / Guo, Ruocheng; Shaabani, Elham; Bhatnagar, Abhinav et al.
Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015. ed. / Jian Pei; Jie Tang; Fabrizio Silvestri. Association for Computing Machinery, Inc, 2015. p. 1610-1613 (Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Guo, R, Shaabani, E, Bhatnagar, A & Shakarian, P 2015, Toward order-of-magnitude cascade prediction. in J Pei, J Tang & F Silvestri (eds), Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015. Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015, Association for Computing Machinery, Inc, pp. 1610-1613, IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015, Paris, France, 8/25/15. https://doi.org/10.1145/2808797.2809358

Guo R, Shaabani E, Bhatnagar A, Shakarian P. Toward order-of-magnitude cascade prediction. In Pei J, Tang J, Silvestri F, editors, Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015. Association for Computing Machinery, Inc. 2015. p. 1610-1613. (Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015). doi: 10.1145/2808797.2809358

Guo, Ruocheng ; Shaabani, Elham ; Bhatnagar, Abhinav et al. / Toward order-of-magnitude cascade prediction. Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015. editor / Jian Pei ; Jie Tang ; Fabrizio Silvestri. Association for Computing Machinery, Inc, 2015. pp. 1610-1613 (Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015).

@inproceedings{f701ca20599b4cde800325bb09f0aa7d,

title = "Toward order-of-magnitude cascade prediction",

abstract = "When a piece of information (microblog, photograph, video, link, etc.) starts to spread in a social network, an important question arises: will it spread to {"}viral{"} proportions - where {"}viral{"} is defined as an order-of-magnitude increase. However, several previous studies have established that cascade size and frequency are related through a power-law - which leads to a severe imbalance in this classification problem. In this paper, we devise a suite of measurements based on {"}structural diversity{"} - the variety of social contexts (communities) in which individuals partaking in a given cascade engage. We demonstrate these measures are able to distinguish viral from non-viral cascades, despite the severe imbalance of the data for this problem. Further, we leverage these measurements as features in a classification approach, successfully predicting microblogs that grow from 50 to 500 reposts with precision of 0.69 and recall of 0.52 for the viral class - despite this class comprising under 2% of samples. This significantly outperforms our baseline approach as well as the current state-of-the-art. Our work also demonstrates how we can tradeoff between precision and recall.",

author = "Ruocheng Guo and Elham Shaabani and Abhinav Bhatnagar and Paulo Shakarian",

note = "Funding Information: IV. ACKNOWLEDGMENT This work is supported through the AFOSR Young Investigator Program (YIP), grant number FA9550-15-1-0159.; IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015 ; Conference date: 25-08-2015 Through 28-08-2015",

year = "2015",

month = aug,

day = "25",

doi = "10.1145/2808797.2809358",

language = "English (US)",

series = "Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015",

publisher = "Association for Computing Machinery, Inc",

pages = "1610--1613",

editor = "Jian Pei and Jie Tang and Fabrizio Silvestri",

booktitle = "Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015",

}

TY - GEN

T1 - Toward order-of-magnitude cascade prediction

AU - Guo, Ruocheng

AU - Shaabani, Elham

AU - Bhatnagar, Abhinav

AU - Shakarian, Paulo

N1 - Funding Information: IV. ACKNOWLEDGMENT This work is supported through the AFOSR Young Investigator Program (YIP), grant number FA9550-15-1-0159.

PY - 2015/8/25

Y1 - 2015/8/25

N2 - When a piece of information (microblog, photograph, video, link, etc.) starts to spread in a social network, an important question arises: will it spread to "viral" proportions - where "viral" is defined as an order-of-magnitude increase. However, several previous studies have established that cascade size and frequency are related through a power-law - which leads to a severe imbalance in this classification problem. In this paper, we devise a suite of measurements based on "structural diversity" - the variety of social contexts (communities) in which individuals partaking in a given cascade engage. We demonstrate these measures are able to distinguish viral from non-viral cascades, despite the severe imbalance of the data for this problem. Further, we leverage these measurements as features in a classification approach, successfully predicting microblogs that grow from 50 to 500 reposts with precision of 0.69 and recall of 0.52 for the viral class - despite this class comprising under 2% of samples. This significantly outperforms our baseline approach as well as the current state-of-the-art. Our work also demonstrates how we can tradeoff between precision and recall.

AB - When a piece of information (microblog, photograph, video, link, etc.) starts to spread in a social network, an important question arises: will it spread to "viral" proportions - where "viral" is defined as an order-of-magnitude increase. However, several previous studies have established that cascade size and frequency are related through a power-law - which leads to a severe imbalance in this classification problem. In this paper, we devise a suite of measurements based on "structural diversity" - the variety of social contexts (communities) in which individuals partaking in a given cascade engage. We demonstrate these measures are able to distinguish viral from non-viral cascades, despite the severe imbalance of the data for this problem. Further, we leverage these measurements as features in a classification approach, successfully predicting microblogs that grow from 50 to 500 reposts with precision of 0.69 and recall of 0.52 for the viral class - despite this class comprising under 2% of samples. This significantly outperforms our baseline approach as well as the current state-of-the-art. Our work also demonstrates how we can tradeoff between precision and recall.

UR - http://www.scopus.com/inward/record.url?scp=84962582467&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84962582467&partnerID=8YFLogxK

U2 - 10.1145/2808797.2809358

DO - 10.1145/2808797.2809358

M3 - Conference contribution

AN - SCOPUS:84962582467

T3 - Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015

SP - 1610

EP - 1613

BT - Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015

A2 - Pei, Jian

A2 - Tang, Jie

A2 - Silvestri, Fabrizio

PB - Association for Computing Machinery, Inc

T2 - IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015

Y2 - 25 August 2015 through 28 August 2015

ER -

Toward order-of-magnitude cascade prediction

Abstract

Publication series

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Cite this