IntEx: A syntactic role driven protein-protein interaction extractor for bio-medical text

Syed Toufeeq Ahmed, Deepthi Chidambaram, Hasan Davulcu, Chitta Baral

Research output: Contribution to conferencePaperpeer-review

29 Scopus citations

Abstract

In this paper, we present a fully automated extraction system, named IntEx, to identify gene and protein interactions in biomedical text. Our approach is based on first splitting complex sentences into simple clausal structures made up of syntactic roles. Then, tagging biological entities with the help of biomedical and linguistic ontologies. Finally, extracting complete interactions by analyzing the matching contents of syntactic roles and their linguistically significant combinations. Our extraction system handles complex sentences and extracts multiple and nested interactions specified in a sentence. Experimental evaluations with two other state of the art extraction systems indicate that the IntEx system achieves better performance without the labor intensive pattern engineering requirement.

Original languageEnglish (US)
Pages54-61
Number of pages8
StatePublished - 2005
Event2005 ACL-ISMB Workshop on Linking Biological Literature, Ontologies and Databases: Mining Biological Semantics, ACL-ISMB 2005 - Detroit, United States
Duration: Jun 24 2005 → …

Conference

Conference2005 ACL-ISMB Workshop on Linking Biological Literature, Ontologies and Databases: Mining Biological Semantics, ACL-ISMB 2005
Country/TerritoryUnited States
CityDetroit
Period6/24/05 → …

ASJC Scopus subject areas

  • General Biochemistry, Genetics and Molecular Biology
  • Artificial Intelligence
  • Information Systems

Fingerprint

Dive into the research topics of 'IntEx: A syntactic role driven protein-protein interaction extractor for bio-medical text'. Together they form a unique fingerprint.

Cite this