Supporting the automatic construction of entity aware search engines

Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Scopus citations

Abstract

Several web sites deliver a large number of pages, each publishing data about one instance of some real world entity, such as an athlete, a stock quote, a book. Although it is easy for a human reader to recognize these instances, current search engines are unaware of them. Technologies for the Semantic Web aim at achieving this goal; however, so far they have been of little help in this respect, as semantic publishing is very limited. We have developed a method to automatically search on the web for pages that publish data representing an instance of a certain conceptual entity. Our method takes as input a small set of sample pages: it automatically infers a description of the underlying conceptual entity and then searches the web for other pages containing data representing the same entity. We have implemented our method in a system prototype, which has been used to conduct several experiments that have produced interesting results.

Original languageEnglish (US)
Title of host publicationProceedings of the 10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08
Pages149-156
Number of pages8
DOIs
StatePublished - 2008
Externally publishedYes
Event10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08 - Napa Valley, CA, United States
Duration: Oct 26 2008Oct 30 2008

Publication series

NameInternational Conference on Information and Knowledge Management, Proceedings

Other

Other10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08
Country/TerritoryUnited States
CityNapa Valley, CA
Period10/26/0810/30/08

Keywords

  • Entity aware search engines
  • Resource discovery
  • Web exploration

ASJC Scopus subject areas

  • General Decision Sciences
  • General Business, Management and Accounting

Fingerprint

Dive into the research topics of 'Supporting the automatic construction of entity aware search engines'. Together they form a unique fingerprint.

Cite this