Supporting the automatic construction of entity aware search engines

Lorenzo Blanco, Valter Crescenzi, Paolo Merialdo, Paolo Papotti

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Citations (Scopus)

Abstract

Several web sites deliver a large number of pages, each publishing data about one instance of some real world entity, such as an athlete, a stock quote, a book. Although it is easy for a human reader to recognize these instances, current search engines are unaware of them. Technologies for the Semantic Web aim at achieving this goal; however, so far they have been of little help in this respect, as semantic publishing is very limited. We have developed a method to automatically search on the web for pages that publish data representing an instance of a certain conceptual entity. Our method takes as input a small set of sample pages: it automatically infers a description of the underlying conceptual entity and then searches the web for other pages containing data representing the same entity. We have implemented our method in a system prototype, which has been used to conduct several experiments that have produced interesting results.

Original languageEnglish (US)
Title of host publicationProceedings of the 10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08
Pages149-156
Number of pages8
DOIs
StatePublished - 2008
Externally publishedYes
Event10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08 - Napa Valley, CA, United States
Duration: Oct 26 2008Oct 30 2008

Other

Other10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08
CountryUnited States
CityNapa Valley, CA
Period10/26/0810/30/08

Fingerprint

Search engine
World Wide Web
Prototype
Semantic web
Web sites
Experiment

Keywords

  • Entity aware search engines
  • Resource discovery
  • Web exploration

ASJC Scopus subject areas

  • Business, Management and Accounting(all)
  • Decision Sciences(all)

Cite this

Blanco, L., Crescenzi, V., Merialdo, P., & Papotti, P. (2008). Supporting the automatic construction of entity aware search engines. In Proceedings of the 10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08 (pp. 149-156) https://doi.org/10.1145/1458502.1458526

Supporting the automatic construction of entity aware search engines. / Blanco, Lorenzo; Crescenzi, Valter; Merialdo, Paolo; Papotti, Paolo.

Proceedings of the 10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08. 2008. p. 149-156.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Blanco, L, Crescenzi, V, Merialdo, P & Papotti, P 2008, Supporting the automatic construction of entity aware search engines. in Proceedings of the 10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08. pp. 149-156, 10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08, Napa Valley, CA, United States, 10/26/08. https://doi.org/10.1145/1458502.1458526
Blanco L, Crescenzi V, Merialdo P, Papotti P. Supporting the automatic construction of entity aware search engines. In Proceedings of the 10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08. 2008. p. 149-156 https://doi.org/10.1145/1458502.1458526
Blanco, Lorenzo ; Crescenzi, Valter ; Merialdo, Paolo ; Papotti, Paolo. / Supporting the automatic construction of entity aware search engines. Proceedings of the 10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08. 2008. pp. 149-156
@inproceedings{37a4aec422324f6a9637d401a7efa08e,
title = "Supporting the automatic construction of entity aware search engines",
abstract = "Several web sites deliver a large number of pages, each publishing data about one instance of some real world entity, such as an athlete, a stock quote, a book. Although it is easy for a human reader to recognize these instances, current search engines are unaware of them. Technologies for the Semantic Web aim at achieving this goal; however, so far they have been of little help in this respect, as semantic publishing is very limited. We have developed a method to automatically search on the web for pages that publish data representing an instance of a certain conceptual entity. Our method takes as input a small set of sample pages: it automatically infers a description of the underlying conceptual entity and then searches the web for other pages containing data representing the same entity. We have implemented our method in a system prototype, which has been used to conduct several experiments that have produced interesting results.",
keywords = "Entity aware search engines, Resource discovery, Web exploration",
author = "Lorenzo Blanco and Valter Crescenzi and Paolo Merialdo and Paolo Papotti",
year = "2008",
doi = "10.1145/1458502.1458526",
language = "English (US)",
isbn = "9781605582603",
pages = "149--156",
booktitle = "Proceedings of the 10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08",

}

TY - GEN

T1 - Supporting the automatic construction of entity aware search engines

AU - Blanco, Lorenzo

AU - Crescenzi, Valter

AU - Merialdo, Paolo

AU - Papotti, Paolo

PY - 2008

Y1 - 2008

N2 - Several web sites deliver a large number of pages, each publishing data about one instance of some real world entity, such as an athlete, a stock quote, a book. Although it is easy for a human reader to recognize these instances, current search engines are unaware of them. Technologies for the Semantic Web aim at achieving this goal; however, so far they have been of little help in this respect, as semantic publishing is very limited. We have developed a method to automatically search on the web for pages that publish data representing an instance of a certain conceptual entity. Our method takes as input a small set of sample pages: it automatically infers a description of the underlying conceptual entity and then searches the web for other pages containing data representing the same entity. We have implemented our method in a system prototype, which has been used to conduct several experiments that have produced interesting results.

AB - Several web sites deliver a large number of pages, each publishing data about one instance of some real world entity, such as an athlete, a stock quote, a book. Although it is easy for a human reader to recognize these instances, current search engines are unaware of them. Technologies for the Semantic Web aim at achieving this goal; however, so far they have been of little help in this respect, as semantic publishing is very limited. We have developed a method to automatically search on the web for pages that publish data representing an instance of a certain conceptual entity. Our method takes as input a small set of sample pages: it automatically infers a description of the underlying conceptual entity and then searches the web for other pages containing data representing the same entity. We have implemented our method in a system prototype, which has been used to conduct several experiments that have produced interesting results.

KW - Entity aware search engines

KW - Resource discovery

KW - Web exploration

UR - http://www.scopus.com/inward/record.url?scp=77951136761&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77951136761&partnerID=8YFLogxK

U2 - 10.1145/1458502.1458526

DO - 10.1145/1458502.1458526

M3 - Conference contribution

AN - SCOPUS:77951136761

SN - 9781605582603

SP - 149

EP - 156

BT - Proceedings of the 10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08

ER -