Leveraging Contextual Information in Extracting Long Distance Relations from Clinical Notes

Hong Guan; Murthy Devarakonda

Leveraging Contextual Information in Extracting Long Distance Relations from Clinical Notes

Hong Guan, Murthy Devarakonda

Health Solutions, College of (CHS)

Research output: Contribution to journal › Article › peer-review

8 Scopus citations

Abstract

Relation extraction from biomedical text is important for clinical decision support applications. In post-marketing pharmacovigilance, for example, Adverse Drug Events (ADE) relate medical problems to the drugs that caused them and were the focus of two recent shared challenges. While good results were reported, there was a room for improvement. Here, we studied two new improved methods for relation extraction: (1) State-of-the-art deep learning contextual representation model called BERT, Bidirectional Encoder Representations from Transformers; (2) Selection of negative training samples based on the "near-miss" hypothesis (the Edge sampling). We used the datasets from MADE and N2C2 Task-2 for performance evaluation. BERT and Edge together improved performance of ADE and Reason (indication) relations extraction by 6.4-6.7 absolute percentage (and error rate reduction of 24%-28%). ADE and Reason relations contained longer text between the entities, which BERT and Edge were able to leverage to achieve the performance improvement. While the performance improvement for medication attribute relations was smaller in absolute percentages, error rate reduction was still considerable.

Original language	English (US)
Pages (from-to)	1051-1060
Number of pages	10
Journal	AMIA ... Annual Symposium proceedings. AMIA Symposium
Volume	2019
State	Published - 2019

ASJC Scopus subject areas

General Medicine

Cite this

@article{00de027d0e564ffd80420322330dc557,

title = "Leveraging Contextual Information in Extracting Long Distance Relations from Clinical Notes",

abstract = "Relation extraction from biomedical text is important for clinical decision support applications. In post-marketing pharmacovigilance, for example, Adverse Drug Events (ADE) relate medical problems to the drugs that caused them and were the focus of two recent shared challenges. While good results were reported, there was a room for improvement. Here, we studied two new improved methods for relation extraction: (1) State-of-the-art deep learning contextual representation model called BERT, Bidirectional Encoder Representations from Transformers; (2) Selection of negative training samples based on the {"}near-miss{"} hypothesis (the Edge sampling). We used the datasets from MADE and N2C2 Task-2 for performance evaluation. BERT and Edge together improved performance of ADE and Reason (indication) relations extraction by 6.4-6.7 absolute percentage (and error rate reduction of 24%-28%). ADE and Reason relations contained longer text between the entities, which BERT and Edge were able to leverage to achieve the performance improvement. While the performance improvement for medication attribute relations was smaller in absolute percentages, error rate reduction was still considerable.",

author = "Hong Guan and Murthy Devarakonda",

year = "2019",

language = "English (US)",

volume = "2019",

pages = "1051--1060",

journal = "AMIA ... Annual Symposium proceedings. AMIA Symposium",

issn = "1559-4076",

publisher = "American Medical Informatics Association",

}

TY - JOUR

T1 - Leveraging Contextual Information in Extracting Long Distance Relations from Clinical Notes

AU - Guan, Hong

AU - Devarakonda, Murthy

PY - 2019

Y1 - 2019

N2 - Relation extraction from biomedical text is important for clinical decision support applications. In post-marketing pharmacovigilance, for example, Adverse Drug Events (ADE) relate medical problems to the drugs that caused them and were the focus of two recent shared challenges. While good results were reported, there was a room for improvement. Here, we studied two new improved methods for relation extraction: (1) State-of-the-art deep learning contextual representation model called BERT, Bidirectional Encoder Representations from Transformers; (2) Selection of negative training samples based on the "near-miss" hypothesis (the Edge sampling). We used the datasets from MADE and N2C2 Task-2 for performance evaluation. BERT and Edge together improved performance of ADE and Reason (indication) relations extraction by 6.4-6.7 absolute percentage (and error rate reduction of 24%-28%). ADE and Reason relations contained longer text between the entities, which BERT and Edge were able to leverage to achieve the performance improvement. While the performance improvement for medication attribute relations was smaller in absolute percentages, error rate reduction was still considerable.

AB - Relation extraction from biomedical text is important for clinical decision support applications. In post-marketing pharmacovigilance, for example, Adverse Drug Events (ADE) relate medical problems to the drugs that caused them and were the focus of two recent shared challenges. While good results were reported, there was a room for improvement. Here, we studied two new improved methods for relation extraction: (1) State-of-the-art deep learning contextual representation model called BERT, Bidirectional Encoder Representations from Transformers; (2) Selection of negative training samples based on the "near-miss" hypothesis (the Edge sampling). We used the datasets from MADE and N2C2 Task-2 for performance evaluation. BERT and Edge together improved performance of ADE and Reason (indication) relations extraction by 6.4-6.7 absolute percentage (and error rate reduction of 24%-28%). ADE and Reason relations contained longer text between the entities, which BERT and Edge were able to leverage to achieve the performance improvement. While the performance improvement for medication attribute relations was smaller in absolute percentages, error rate reduction was still considerable.

UR - http://www.scopus.com/inward/record.url?scp=85083755272&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85083755272&partnerID=8YFLogxK

M3 - Article

C2 - 32308902

AN - SCOPUS:85083755272

SN - 1559-4076

VL - 2019

SP - 1051

EP - 1060

JO - AMIA ... Annual Symposium proceedings. AMIA Symposium

JF - AMIA ... Annual Symposium proceedings. AMIA Symposium

ER -

Leveraging Contextual Information in Extracting Long Distance Relations from Clinical Notes

Abstract

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this