TY - GEN
T1 - Supporting the automatic construction of entity aware search engines
AU - Blanco, Lorenzo
AU - Crescenzi, Valter
AU - Merialdo, Paolo
AU - Papotti, Paolo
N1 - Copyright:
Copyright 2010 Elsevier B.V., All rights reserved.
PY - 2008
Y1 - 2008
N2 - Several web sites deliver a large number of pages, each publishing data about one instance of some real world entity, such as an athlete, a stock quote, a book. Although it is easy for a human reader to recognize these instances, current search engines are unaware of them. Technologies for the Semantic Web aim at achieving this goal; however, so far they have been of little help in this respect, as semantic publishing is very limited. We have developed a method to automatically search on the web for pages that publish data representing an instance of a certain conceptual entity. Our method takes as input a small set of sample pages: it automatically infers a description of the underlying conceptual entity and then searches the web for other pages containing data representing the same entity. We have implemented our method in a system prototype, which has been used to conduct several experiments that have produced interesting results.
AB - Several web sites deliver a large number of pages, each publishing data about one instance of some real world entity, such as an athlete, a stock quote, a book. Although it is easy for a human reader to recognize these instances, current search engines are unaware of them. Technologies for the Semantic Web aim at achieving this goal; however, so far they have been of little help in this respect, as semantic publishing is very limited. We have developed a method to automatically search on the web for pages that publish data representing an instance of a certain conceptual entity. Our method takes as input a small set of sample pages: it automatically infers a description of the underlying conceptual entity and then searches the web for other pages containing data representing the same entity. We have implemented our method in a system prototype, which has been used to conduct several experiments that have produced interesting results.
KW - Entity aware search engines
KW - Resource discovery
KW - Web exploration
UR - http://www.scopus.com/inward/record.url?scp=77951136761&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77951136761&partnerID=8YFLogxK
U2 - 10.1145/1458502.1458526
DO - 10.1145/1458502.1458526
M3 - Conference contribution
AN - SCOPUS:77951136761
SN - 9781605582603
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 149
EP - 156
BT - Proceedings of the 10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08
T2 - 10th ACM Workshop on Web Information and Data Management, WIDM '08, Co-located with the ACM 17th Conference on Information and Knowledge Management, CIKM '08
Y2 - 26 October 2008 through 30 October 2008
ER -