TY - CHAP
T1 - Proximity tracking on dynamic bipartite graphs
T2 - Problem definitions and fast solutions
AU - Tong, Hanghang
AU - Papadimitriou, Spiros
AU - Yu, Philip S.
AU - Faloutsos, Christos
N1 - Publisher Copyright:
© Springer Science+Business Media, LLC 2010. All rights reserved.
Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.
PY - 2010
Y1 - 2010
N2 - Large bipartite graphs which evolve and grow over time (e.g., new links arrive, old links die out, or link weights change) arise in many settings, such as social networks, co-citations, market-basket analysis, and collaborative filtering. Our goal is to monitor (i) the centrality of an individual node (e.g., who are the most important authors?) and (ii) the proximity of two nodes or sets of nodes (e.g., who are the most important authors with respect to a particular conference?). Moreover, we want to do this efficiently and incrementally and to provide any-time answers. In this chapter we propose pTrack, which is based on random walks with restart, together with some important modifications to adapt these measures to a dynamic, evolving setting. Additionally, we develop techniques for fast, incremental updates of these measures that allow us to track them continuously, as link updates arrive. In addition, we discuss variants of our method that can handle batch updates, as well as place more emphasis on recent links. Based on proximity tracking, we further proposed cTrack, which enables us to track the centrality of the nodes over time. We demonstrate the effectiveness and efficiency of our methods on several real data sets.
AB - Large bipartite graphs which evolve and grow over time (e.g., new links arrive, old links die out, or link weights change) arise in many settings, such as social networks, co-citations, market-basket analysis, and collaborative filtering. Our goal is to monitor (i) the centrality of an individual node (e.g., who are the most important authors?) and (ii) the proximity of two nodes or sets of nodes (e.g., who are the most important authors with respect to a particular conference?). Moreover, we want to do this efficiently and incrementally and to provide any-time answers. In this chapter we propose pTrack, which is based on random walks with restart, together with some important modifications to adapt these measures to a dynamic, evolving setting. Additionally, we develop techniques for fast, incremental updates of these measures that allow us to track them continuously, as link updates arrive. In addition, we discuss variants of our method that can handle batch updates, as well as place more emphasis on recent links. Based on proximity tracking, we further proposed cTrack, which enables us to track the centrality of the nodes over time. We demonstrate the effectiveness and efficiency of our methods on several real data sets.
UR - http://www.scopus.com/inward/record.url?scp=84919871810&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84919871810&partnerID=8YFLogxK
U2 - 10.1007/978-1-4419-6515-8_8
DO - 10.1007/978-1-4419-6515-8_8
M3 - Chapter
AN - SCOPUS:84919871810
SN - 9781441965141
VL - 9781441965158
SP - 211
EP - 236
BT - Link Mining
PB - Springer New York
ER -