Towards Understanding Community Interests with Topic Modeling

Feng Wang, Kenneth Orton, Paul Wagenseller, Kuai Xu

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

Community plays an important role in shaping a network. Quantitatively interpreting a community is necessary for graph generalization which is used for privacy preserving, summarization, and dimensionality reduction in social network mining. However, there are few works in community detection focusing on making sense of the identified communities. In this paper, we study communities in the social media context and investigate structure-based communities from the perspective of community topical homophily. We train Latent Dirichlet Allocation topic model to capture the topics in the aggregated tweets of each user in a community and propose new distance metrics to quantify the topic similarity of individual users, cliques, and communities. By building a Twitter topic modeling system to interpret the communities identified by two community detection algorithms in a large scale Twitter topology, we discover evidence that Twitter users in a community show common interests in general. The major contributions of this paper lie in that it verifies the topical homophily in structure-based social media communities and proposes new metrics to quantitatively label the degree of the homophily and describe the theme of the communities.

Original languageEnglish (US)
Pages (from-to)24660-24668
Number of pages9
JournalIEEE Access
Volume6
DOIs
StatePublished - Mar 13 2018

Fingerprint

Labels
Topology

Keywords

  • Community detection
  • LDA
  • social media
  • topic modeling
  • topical homophily

ASJC Scopus subject areas

  • Computer Science(all)
  • Materials Science(all)
  • Engineering(all)

Cite this

Towards Understanding Community Interests with Topic Modeling. / Wang, Feng; Orton, Kenneth; Wagenseller, Paul; Xu, Kuai.

In: IEEE Access, Vol. 6, 13.03.2018, p. 24660-24668.

Research output: Contribution to journalArticle

Wang, Feng ; Orton, Kenneth ; Wagenseller, Paul ; Xu, Kuai. / Towards Understanding Community Interests with Topic Modeling. In: IEEE Access. 2018 ; Vol. 6. pp. 24660-24668.
@article{2b1f500aba064009b750785f4aaf19a7,
title = "Towards Understanding Community Interests with Topic Modeling",
abstract = "Community plays an important role in shaping a network. Quantitatively interpreting a community is necessary for graph generalization which is used for privacy preserving, summarization, and dimensionality reduction in social network mining. However, there are few works in community detection focusing on making sense of the identified communities. In this paper, we study communities in the social media context and investigate structure-based communities from the perspective of community topical homophily. We train Latent Dirichlet Allocation topic model to capture the topics in the aggregated tweets of each user in a community and propose new distance metrics to quantify the topic similarity of individual users, cliques, and communities. By building a Twitter topic modeling system to interpret the communities identified by two community detection algorithms in a large scale Twitter topology, we discover evidence that Twitter users in a community show common interests in general. The major contributions of this paper lie in that it verifies the topical homophily in structure-based social media communities and proposes new metrics to quantitatively label the degree of the homophily and describe the theme of the communities.",
keywords = "Community detection, LDA, social media, topic modeling, topical homophily",
author = "Feng Wang and Kenneth Orton and Paul Wagenseller and Kuai Xu",
year = "2018",
month = "3",
day = "13",
doi = "10.1109/ACCESS.2018.2815904",
language = "English (US)",
volume = "6",
pages = "24660--24668",
journal = "IEEE Access",
issn = "2169-3536",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Towards Understanding Community Interests with Topic Modeling

AU - Wang, Feng

AU - Orton, Kenneth

AU - Wagenseller, Paul

AU - Xu, Kuai

PY - 2018/3/13

Y1 - 2018/3/13

N2 - Community plays an important role in shaping a network. Quantitatively interpreting a community is necessary for graph generalization which is used for privacy preserving, summarization, and dimensionality reduction in social network mining. However, there are few works in community detection focusing on making sense of the identified communities. In this paper, we study communities in the social media context and investigate structure-based communities from the perspective of community topical homophily. We train Latent Dirichlet Allocation topic model to capture the topics in the aggregated tweets of each user in a community and propose new distance metrics to quantify the topic similarity of individual users, cliques, and communities. By building a Twitter topic modeling system to interpret the communities identified by two community detection algorithms in a large scale Twitter topology, we discover evidence that Twitter users in a community show common interests in general. The major contributions of this paper lie in that it verifies the topical homophily in structure-based social media communities and proposes new metrics to quantitatively label the degree of the homophily and describe the theme of the communities.

AB - Community plays an important role in shaping a network. Quantitatively interpreting a community is necessary for graph generalization which is used for privacy preserving, summarization, and dimensionality reduction in social network mining. However, there are few works in community detection focusing on making sense of the identified communities. In this paper, we study communities in the social media context and investigate structure-based communities from the perspective of community topical homophily. We train Latent Dirichlet Allocation topic model to capture the topics in the aggregated tweets of each user in a community and propose new distance metrics to quantify the topic similarity of individual users, cliques, and communities. By building a Twitter topic modeling system to interpret the communities identified by two community detection algorithms in a large scale Twitter topology, we discover evidence that Twitter users in a community show common interests in general. The major contributions of this paper lie in that it verifies the topical homophily in structure-based social media communities and proposes new metrics to quantitatively label the degree of the homophily and describe the theme of the communities.

KW - Community detection

KW - LDA

KW - social media

KW - topic modeling

KW - topical homophily

UR - http://www.scopus.com/inward/record.url?scp=85043767816&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85043767816&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2018.2815904

DO - 10.1109/ACCESS.2018.2815904

M3 - Article

VL - 6

SP - 24660

EP - 24668

JO - IEEE Access

JF - IEEE Access

SN - 2169-3536

ER -