Diversified ranking on large graphs: An optimization viewpoint

Hanghang Tong, Jingrui He, Zhen Wen, Ravi Konuru, Ching Yung Lin

Research output: Chapter in Book/Report/Conference proceedingConference contribution

58 Scopus citations

Abstract

Diversified ranking on graphs is a fundamental mining task and has a variety of high-impact applications. There are two important open questions here. The first challenge is the measure - how to quantify the goodness of a given top-k ranking list that captures both the relevance and the diversity? The second challenge lies in the algorithmic aspect - how to find an optimal, or near-optimal, top-k ranking list that maximizes the measure we defined in a scalable way? In this paper, we address these challenges from an optimization point of view. Firstly, we propose a goodness measure for a given top-k ranking list. The proposed goodness measure intuitively captures both (a) the relevance between each individual node in the ranking list and the query; and (b) the diversity among different nodes in the ranking list. Moreover, we propose a scalable algorithm (linear wrt the size of the graph) that generates a provably near-optimal solution. The experimental evaluations on real graphs demonstrate its effectiveness and efficiency.

Original languageEnglish (US)
Title of host publicationProceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD'11
PublisherAssociation for Computing Machinery
Pages1028-1036
Number of pages9
ISBN (Print)9781450308137
DOIs
StatePublished - 2011
Externally publishedYes
Event17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2011 - San Diego, United States
Duration: Aug 21 2011Aug 24 2011

Publication series

NameProceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Conference

Conference17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2011
Country/TerritoryUnited States
CitySan Diego
Period8/21/118/24/11

Keywords

  • Diversity
  • Graph mining
  • Ranking
  • Scalability

ASJC Scopus subject areas

  • Software
  • Information Systems

Cite this