Extracting relevant snippets for web navigation

Qing Li, Kasim Candan, Qi Yan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)

Abstract

Search engines present fix-length passages from documents ranked by relevance against the query. In this paper, we present and compare novel, language-model based methods for extracting variable length document snippets by real-time processing of documents using the query issued by the user. With this extra level of information, the returned snippets are considerably more informative. Unlike previous work on passage retrieval which relies on searching relevant segments for filtering of preoccupied passages, we focus on query-informed segmentation to extract context-aware relevant snippets with variable length. In particular, we show that, when informed through an appropriate relevance language model, curvature analysis and Hidden Markov model (HMM) based content segmentation techniques can facilitate to extract relevant document snippets.

Original languageEnglish (US)
Title of host publicationProceedings of the National Conference on Artificial Intelligence
Pages1195-1200
Number of pages6
Volume2
StatePublished - 2008
Event23rd AAAI Conference on Artificial Intelligence and the 20th Innovative Applications of Artificial Intelligence Conference, AAAI-08/IAAI-08 - Chicago, IL, United States
Duration: Jul 13 2008Jul 17 2008

Other

Other23rd AAAI Conference on Artificial Intelligence and the 20th Innovative Applications of Artificial Intelligence Conference, AAAI-08/IAAI-08
CountryUnited States
CityChicago, IL
Period7/13/087/17/08

Fingerprint

World Wide Web
Navigation
Hidden Markov models
Search engines
Processing

ASJC Scopus subject areas

  • Software
  • Artificial Intelligence

Cite this

Li, Q., Candan, K., & Yan, Q. (2008). Extracting relevant snippets for web navigation. In Proceedings of the National Conference on Artificial Intelligence (Vol. 2, pp. 1195-1200)

Extracting relevant snippets for web navigation. / Li, Qing; Candan, Kasim; Yan, Qi.

Proceedings of the National Conference on Artificial Intelligence. Vol. 2 2008. p. 1195-1200.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Li, Q, Candan, K & Yan, Q 2008, Extracting relevant snippets for web navigation. in Proceedings of the National Conference on Artificial Intelligence. vol. 2, pp. 1195-1200, 23rd AAAI Conference on Artificial Intelligence and the 20th Innovative Applications of Artificial Intelligence Conference, AAAI-08/IAAI-08, Chicago, IL, United States, 7/13/08.
Li Q, Candan K, Yan Q. Extracting relevant snippets for web navigation. In Proceedings of the National Conference on Artificial Intelligence. Vol. 2. 2008. p. 1195-1200
Li, Qing ; Candan, Kasim ; Yan, Qi. / Extracting relevant snippets for web navigation. Proceedings of the National Conference on Artificial Intelligence. Vol. 2 2008. pp. 1195-1200
@inproceedings{3e8873b789bd4cb8ba10c653980d46af,
title = "Extracting relevant snippets for web navigation",
abstract = "Search engines present fix-length passages from documents ranked by relevance against the query. In this paper, we present and compare novel, language-model based methods for extracting variable length document snippets by real-time processing of documents using the query issued by the user. With this extra level of information, the returned snippets are considerably more informative. Unlike previous work on passage retrieval which relies on searching relevant segments for filtering of preoccupied passages, we focus on query-informed segmentation to extract context-aware relevant snippets with variable length. In particular, we show that, when informed through an appropriate relevance language model, curvature analysis and Hidden Markov model (HMM) based content segmentation techniques can facilitate to extract relevant document snippets.",
author = "Qing Li and Kasim Candan and Qi Yan",
year = "2008",
language = "English (US)",
isbn = "9781577353683",
volume = "2",
pages = "1195--1200",
booktitle = "Proceedings of the National Conference on Artificial Intelligence",

}

TY - GEN

T1 - Extracting relevant snippets for web navigation

AU - Li, Qing

AU - Candan, Kasim

AU - Yan, Qi

PY - 2008

Y1 - 2008

N2 - Search engines present fix-length passages from documents ranked by relevance against the query. In this paper, we present and compare novel, language-model based methods for extracting variable length document snippets by real-time processing of documents using the query issued by the user. With this extra level of information, the returned snippets are considerably more informative. Unlike previous work on passage retrieval which relies on searching relevant segments for filtering of preoccupied passages, we focus on query-informed segmentation to extract context-aware relevant snippets with variable length. In particular, we show that, when informed through an appropriate relevance language model, curvature analysis and Hidden Markov model (HMM) based content segmentation techniques can facilitate to extract relevant document snippets.

AB - Search engines present fix-length passages from documents ranked by relevance against the query. In this paper, we present and compare novel, language-model based methods for extracting variable length document snippets by real-time processing of documents using the query issued by the user. With this extra level of information, the returned snippets are considerably more informative. Unlike previous work on passage retrieval which relies on searching relevant segments for filtering of preoccupied passages, we focus on query-informed segmentation to extract context-aware relevant snippets with variable length. In particular, we show that, when informed through an appropriate relevance language model, curvature analysis and Hidden Markov model (HMM) based content segmentation techniques can facilitate to extract relevant document snippets.

UR - http://www.scopus.com/inward/record.url?scp=57749189559&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=57749189559&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:57749189559

SN - 9781577353683

VL - 2

SP - 1195

EP - 1200

BT - Proceedings of the National Conference on Artificial Intelligence

ER -