18 Citations (Scopus)

Abstract

Community detection is an unsupervised learning task that discovers groups such that group members share more similarities or interact more frequently among themselves than with people outside groups. In social media, link information can reveal heterogeneous relationships of various strengths, but often can be noisy. Since different sources of data in social media can provide complementary information, e.g., bookmarking and tagging data indicates user interests, frequency of commenting suggests the strength of ties, etc., we propose to integrate social media data of multiple types for improving the performance of community detection. We present a joint optimization framework to integrate multiple data sources for community detection. Empirical evaluation on both synthetic data and real-world social media data shows significant performance improvement of the proposed approach. This work elaborates the need for and challenges of multi-source integration of heterogeneous data types, and provides a principled way of multi-source community detection.

Original languageEnglish (US)
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages1-20
Number of pages20
Volume7472 LNAI
DOIs
StatePublished - 2012
Event2nd International Workshop on Modeling and Mining Ubiquitous Social Media, MSM 2011, MUSE 2011 - Boston, MA, United States
Duration: Oct 9 2011Oct 9 2011

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7472 LNAI
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other2nd International Workshop on Modeling and Mining Ubiquitous Social Media, MSM 2011, MUSE 2011
CountryUnited States
CityBoston, MA
Period10/9/1110/9/11

Fingerprint

Community Detection
Unsupervised learning
Social Media
Integrate
Unsupervised Learning
Tagging
Tie
Synthetic Data
Optimization
Evaluation

Keywords

  • Community Detection
  • Multi-source Integration
  • Social Media Data

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Tang, J., Wang, X., & Liu, H. (2012). Integrating social media data for community detection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7472 LNAI, pp. 1-20). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7472 LNAI). https://doi.org/10.1007/978-3-642-33684-3_1

Integrating social media data for community detection. / Tang, Jiliang; Wang, Xufei; Liu, Huan.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 7472 LNAI 2012. p. 1-20 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7472 LNAI).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Tang, J, Wang, X & Liu, H 2012, Integrating social media data for community detection. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 7472 LNAI, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 7472 LNAI, pp. 1-20, 2nd International Workshop on Modeling and Mining Ubiquitous Social Media, MSM 2011, MUSE 2011, Boston, MA, United States, 10/9/11. https://doi.org/10.1007/978-3-642-33684-3_1
Tang J, Wang X, Liu H. Integrating social media data for community detection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 7472 LNAI. 2012. p. 1-20. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-642-33684-3_1
Tang, Jiliang ; Wang, Xufei ; Liu, Huan. / Integrating social media data for community detection. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 7472 LNAI 2012. pp. 1-20 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{414cac56dcec4686a7da870f0e989a42,
title = "Integrating social media data for community detection",
abstract = "Community detection is an unsupervised learning task that discovers groups such that group members share more similarities or interact more frequently among themselves than with people outside groups. In social media, link information can reveal heterogeneous relationships of various strengths, but often can be noisy. Since different sources of data in social media can provide complementary information, e.g., bookmarking and tagging data indicates user interests, frequency of commenting suggests the strength of ties, etc., we propose to integrate social media data of multiple types for improving the performance of community detection. We present a joint optimization framework to integrate multiple data sources for community detection. Empirical evaluation on both synthetic data and real-world social media data shows significant performance improvement of the proposed approach. This work elaborates the need for and challenges of multi-source integration of heterogeneous data types, and provides a principled way of multi-source community detection.",
keywords = "Community Detection, Multi-source Integration, Social Media Data",
author = "Jiliang Tang and Xufei Wang and Huan Liu",
year = "2012",
doi = "10.1007/978-3-642-33684-3_1",
language = "English (US)",
isbn = "9783642336836",
volume = "7472 LNAI",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "1--20",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - Integrating social media data for community detection

AU - Tang, Jiliang

AU - Wang, Xufei

AU - Liu, Huan

PY - 2012

Y1 - 2012

N2 - Community detection is an unsupervised learning task that discovers groups such that group members share more similarities or interact more frequently among themselves than with people outside groups. In social media, link information can reveal heterogeneous relationships of various strengths, but often can be noisy. Since different sources of data in social media can provide complementary information, e.g., bookmarking and tagging data indicates user interests, frequency of commenting suggests the strength of ties, etc., we propose to integrate social media data of multiple types for improving the performance of community detection. We present a joint optimization framework to integrate multiple data sources for community detection. Empirical evaluation on both synthetic data and real-world social media data shows significant performance improvement of the proposed approach. This work elaborates the need for and challenges of multi-source integration of heterogeneous data types, and provides a principled way of multi-source community detection.

AB - Community detection is an unsupervised learning task that discovers groups such that group members share more similarities or interact more frequently among themselves than with people outside groups. In social media, link information can reveal heterogeneous relationships of various strengths, but often can be noisy. Since different sources of data in social media can provide complementary information, e.g., bookmarking and tagging data indicates user interests, frequency of commenting suggests the strength of ties, etc., we propose to integrate social media data of multiple types for improving the performance of community detection. We present a joint optimization framework to integrate multiple data sources for community detection. Empirical evaluation on both synthetic data and real-world social media data shows significant performance improvement of the proposed approach. This work elaborates the need for and challenges of multi-source integration of heterogeneous data types, and provides a principled way of multi-source community detection.

KW - Community Detection

KW - Multi-source Integration

KW - Social Media Data

UR - http://www.scopus.com/inward/record.url?scp=84867631568&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84867631568&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-33684-3_1

DO - 10.1007/978-3-642-33684-3_1

M3 - Conference contribution

AN - SCOPUS:84867631568

SN - 9783642336836

VL - 7472 LNAI

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 1

EP - 20

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -