Abstract
In this paper we describe the approach used by the Arizona State University BioAI group for the ad-hoc retrieval task of the TREC Genomics Track 2005. We pre-process TREC query expression by adding the synonyms of genes, diseases, bio-processes, functions of organs, and selectively adding stemming verbs, nouns, and Mesh Heading categories. The pre-processed queries are used to perform initial search on the TREC Genomics collection of MEDLINE abstracts and produce a set of target abstracts using Apache Lucene. Tagging, anaphor resolution and fact extraction are performed on the target abstracts to refine the search results in terms of relevance. Finally, we rank the target abstracts according to the extracted facts, distance between terms and terms appeared in the query.
Original language | English (US) |
---|---|
Title of host publication | NIST Special Publication |
State | Published - 2005 |
Event | 14th Text REtrieval Conference, TREC 2005 - Gaithersburg, MD, United States Duration: Nov 15 2005 → Nov 18 2005 |
Other
Other | 14th Text REtrieval Conference, TREC 2005 |
---|---|
Country/Territory | United States |
City | Gaithersburg, MD |
Period | 11/15/05 → 11/18/05 |
ASJC Scopus subject areas
- Engineering(all)