Scuba diver: Subspace clustering of web search results

Fatih Gelgi, Srinivas Vadrevu, Hasan Davulcu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Current search engines present their search results as a ranked list of Web pages. However, as the number of pages on the Web increases exponentially, so does the number of search results for any given query. We present a novel subspace clustering based algorithm to organize keyword search results by simultaneously clustering and identifying distinguishing terms for each cluster. Our system, named Scuba Diver, enables users to better interpret the coverage of millions of search results and to refine their search queries through a keyword guided interface. We present experimental results illustrating the effectiveness of our algorithm by measuring purity, entropy and F-measure of generated clusters based on Open Directory Project (ODP).

Original languageEnglish (US)
Title of host publicationWebist 2007 - 3rd International Conference on Web Information Systems and Technologies, Proceedings
Pages334-339
Number of pages6
VolumeWIA
StatePublished - 2007
Event3rd International Conference on Web Information Systems and Technologies, Webist 2007 - Barcelona, Spain
Duration: Mar 3 2007Mar 6 2007

Other

Other3rd International Conference on Web Information Systems and Technologies, Webist 2007
CountrySpain
CityBarcelona
Period3/3/073/6/07

Keywords

  • Guided search
  • Subspace clustering
  • Web search

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Hardware and Architecture
  • Information Systems
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'Scuba diver: Subspace clustering of web search results'. Together they form a unique fingerprint.

Cite this