TY - GEN
T1 - Exploiting agent and database technologies for biological data collection
AU - Davulcu, Hasan
AU - Lacroix, Zoé
AU - Parekh, Kaushal
AU - Ramakrishnan, I. V.
AU - Julasana, Nikeeta
PY - 2004/1/1
Y1 - 2004/1/1
N2 - Web data sources constitute an important resource for Biological research. A simple tool that can retrieve information from different Web sites through a single interface and store the extracted data in a standardized format for efficient future use is critical to scientific discovery. In this paper we discuss an approach that combines agent and database technologies for biological data integration. To illustrate this, we employ two software tools: WinAgent, for building agents, and dbXML, for XML data management. WinAgent learns from its users by recording a browsing session on Web sites and successive data extraction from regions of interest on retrieved Web pages. The results are stored in a XML document and can be managed, queried and updated using a native XML database system such as dbXML. This approach is currently being evaluated at the Brain Tumor Cancer Unit of the Translational Genomics Research Institute (TGen), Phoenix, Arizona.
AB - Web data sources constitute an important resource for Biological research. A simple tool that can retrieve information from different Web sites through a single interface and store the extracted data in a standardized format for efficient future use is critical to scientific discovery. In this paper we discuss an approach that combines agent and database technologies for biological data integration. To illustrate this, we employ two software tools: WinAgent, for building agents, and dbXML, for XML data management. WinAgent learns from its users by recording a browsing session on Web sites and successive data extraction from regions of interest on retrieved Web pages. The results are stored in a XML document and can be managed, queried and updated using a native XML database system such as dbXML. This approach is currently being evaluated at the Brain Tumor Cancer Unit of the Translational Genomics Research Institute (TGen), Phoenix, Arizona.
UR - http://www.scopus.com/inward/record.url?scp=10044281872&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=10044281872&partnerID=8YFLogxK
U2 - 10.1109/dexa.2004.1333503
DO - 10.1109/dexa.2004.1333503
M3 - Conference contribution
AN - SCOPUS:10044281872
SN - 0769521959
SN - 9780769521954
T3 - International Conference on Database and Expert Systems Applications - DEXA
SP - 376
EP - 381
BT - Proceedings - 15th International Workshop on Database and Expert Systems Applications
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - Proceedings - 15th International Workshop on Database and Expert Systems Applications
Y2 - 30 August 2004 through 3 September 2004
ER -