TY - GEN
T1 - Leveraging structural knowledge for hierarchically-informed keyword weight propagation in the web
AU - Kim, Jong Wook
AU - Candan, Kasim
PY - 2007
Y1 - 2007
N2 - Although web navigation hierarchies, such as Yahoo.com and Open Directory Project, enable effective browsing, their individual nodes cannot be indexed for search independently. This is because contents of the individual nodes in a hierarchy are related to the contents of their neighbors, ancestors, and descendants in the structure. In this paper, we show that significant improvements in precision can be obtained by leveraging knowledge about the structure of hierarchical web content. In particular, we propose a novel keyword weight propagation technique to properly enrich the data nodes in web hierarchies. Our approach relies on leveraging the context provided by neighbor entries in a given structure. We leverage this information for developing relativecontent preserving keyword propagation schemes. We compare the results obtained through proposed hierarchically-informed keyword weight (pre-) propagation schemes to existing state-of-the-art score and keyword propagation techniques and show that our approach significantly improves the precision.
AB - Although web navigation hierarchies, such as Yahoo.com and Open Directory Project, enable effective browsing, their individual nodes cannot be indexed for search independently. This is because contents of the individual nodes in a hierarchy are related to the contents of their neighbors, ancestors, and descendants in the structure. In this paper, we show that significant improvements in precision can be obtained by leveraging knowledge about the structure of hierarchical web content. In particular, we propose a novel keyword weight propagation technique to properly enrich the data nodes in web hierarchies. Our approach relies on leveraging the context provided by neighbor entries in a given structure. We leverage this information for developing relativecontent preserving keyword propagation schemes. We compare the results obtained through proposed hierarchically-informed keyword weight (pre-) propagation schemes to existing state-of-the-art score and keyword propagation techniques and show that our approach significantly improves the precision.
UR - http://www.scopus.com/inward/record.url?scp=38549091013&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=38549091013&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-77485-3_5
DO - 10.1007/978-3-540-77485-3_5
M3 - Conference contribution
AN - SCOPUS:38549091013
SN - 354077484X
SN - 9783540774846
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 72
EP - 91
BT - Advances in Web Mining and Web Usage Analysis - 8th International Workshop on Knowledge Discovery on the Web, WebKDD 2006, Revised Papers
PB - Springer Verlag
T2 - 8th International Workshop on Knowledge Discovery on the Web, WebKDD 2006
Y2 - 20 August 2006 through 20 August 2006
ER -