TY - GEN
T1 - Scalable learning of collective behavior based on sparse social dimensions
AU - Tang, Lei
AU - Liu, Huan
PY - 2009
Y1 - 2009
N2 - The study of collective behavior is to understand how individuals behave in a social network environment. Oceans of data generated by social media like Facebook, Twitter, Flickr and YouTube present opportunities and challenges to studying collective behavior in a large scale. In this work, we aim to learn to predict collective behavior in social media. In particular, given information about some individuals, how can we infer the behavior of unobserved individuals in the same network? A social-dimension based approach is adopted to address the heterogeneity of connections presented in social media. However, the networks in social media are normally of colossal size, involving hundreds of thousands or even millions of actors. The scale of networks entails scalable learning of models for collective behavior prediction. To address the scalability issue, we propose an edge-centric clustering scheme to extract sparse social dimensions. With sparse social dimensions, the social-dimension based approach can efficiently handle networks of millions of actors while demonstrating comparable prediction performance as other non-scalable methods.
AB - The study of collective behavior is to understand how individuals behave in a social network environment. Oceans of data generated by social media like Facebook, Twitter, Flickr and YouTube present opportunities and challenges to studying collective behavior in a large scale. In this work, we aim to learn to predict collective behavior in social media. In particular, given information about some individuals, how can we infer the behavior of unobserved individuals in the same network? A social-dimension based approach is adopted to address the heterogeneity of connections presented in social media. However, the networks in social media are normally of colossal size, involving hundreds of thousands or even millions of actors. The scale of networks entails scalable learning of models for collective behavior prediction. To address the scalability issue, we propose an edge-centric clustering scheme to extract sparse social dimensions. With sparse social dimensions, the social-dimension based approach can efficiently handle networks of millions of actors while demonstrating comparable prediction performance as other non-scalable methods.
KW - Behavior prediction
KW - Edge-centric clustering
KW - Relational learning
KW - Social dimensions
KW - Social media
UR - http://www.scopus.com/inward/record.url?scp=74549120273&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=74549120273&partnerID=8YFLogxK
U2 - 10.1145/1645953.1646094
DO - 10.1145/1645953.1646094
M3 - Conference contribution
AN - SCOPUS:74549120273
SN - 9781605585123
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 1107
EP - 1116
BT - ACM 18th International Conference on Information and Knowledge Management, CIKM 2009
T2 - ACM 18th International Conference on Information and Knowledge Management, CIKM 2009
Y2 - 2 November 2009 through 6 November 2009
ER -