Enhancing natural language inference using new and expanded training data sets and new learning models

Arindam Mitra; Ishan Shrivastava; Chitta Baral

Enhancing natural language inference using new and expanded training data sets and new learning models

Arindam Mitra, Ishan Shrivastava, Chitta Baral

Computing and Augmented Intelligence, School of (IAFSE-SCAI)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

8 Scopus citations

Abstract

Natural Language Inference (NLI) plays an important role in many natural language processing tasks such as question answering. However, existing NLI modules that are trained on existing NLI datasets have several drawbacks. For example, they do not capture the notion of entity and role well and often end up making mistakes such as “Peter signed a deal” can be inferred from “John signed a deal”. As part of this work, we have developed two datasets that help mitigate such issues and make the systems better at understanding the notion of “entities” and “roles”. After training the existing models on the new dataset we observe that the existing models do not perform well on one of the new benchmark. We then propose a modification to the “word-to-word” attention function which has been uniformly reused across several popular NLI architectures. The resulting models perform as well as their unmodified counterparts on the existing benchmarks and perform significantly well on the new benchmarks that emphasize “roles” and “entities”.

Original language	English (US)
Title of host publication	AAAI 2020 - 34th AAAI Conference on Artificial Intelligence
Publisher	AAAI press
Pages	8504-8511
Number of pages	8
ISBN (Electronic)	9781577358350
State	Published - 2020
Event	34th AAAI Conference on Artificial Intelligence, AAAI 2020 - New York, United States Duration: Feb 7 2020 → Feb 12 2020

Publication series

Name	AAAI 2020 - 34th AAAI Conference on Artificial Intelligence

Conference

Conference	34th AAAI Conference on Artificial Intelligence, AAAI 2020
Country/Territory	United States
City	New York
Period	2/7/20 → 2/12/20

ASJC Scopus subject areas

Artificial Intelligence

Cite this

Enhancing natural language inference using new and expanded training data sets and new learning models. / Mitra, Arindam; Shrivastava, Ishan; Baral, Chitta.
AAAI 2020 - 34th AAAI Conference on Artificial Intelligence. AAAI press, 2020. p. 8504-8511 (AAAI 2020 - 34th AAAI Conference on Artificial Intelligence).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Mitra, A, Shrivastava, I & Baral, C 2020, Enhancing natural language inference using new and expanded training data sets and new learning models. in AAAI 2020 - 34th AAAI Conference on Artificial Intelligence. AAAI 2020 - 34th AAAI Conference on Artificial Intelligence, AAAI press, pp. 8504-8511, 34th AAAI Conference on Artificial Intelligence, AAAI 2020, New York, United States, 2/7/20.

@inproceedings{dc5853c7304f4f9e8517ac6fdc1314d8,

title = "Enhancing natural language inference using new and expanded training data sets and new learning models",

abstract = "Natural Language Inference (NLI) plays an important role in many natural language processing tasks such as question answering. However, existing NLI modules that are trained on existing NLI datasets have several drawbacks. For example, they do not capture the notion of entity and role well and often end up making mistakes such as “Peter signed a deal” can be inferred from “John signed a deal”. As part of this work, we have developed two datasets that help mitigate such issues and make the systems better at understanding the notion of “entities” and “roles”. After training the existing models on the new dataset we observe that the existing models do not perform well on one of the new benchmark. We then propose a modification to the “word-to-word” attention function which has been uniformly reused across several popular NLI architectures. The resulting models perform as well as their unmodified counterparts on the existing benchmarks and perform significantly well on the new benchmarks that emphasize “roles” and “entities”.",

author = "Arindam Mitra and Ishan Shrivastava and Chitta Baral",

note = "Funding Information: Support from DARPA, and NSF grant 1816039 is acknowledged. Publisher Copyright: Copyright {\textcopyright} 2020, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.; 34th AAAI Conference on Artificial Intelligence, AAAI 2020 ; Conference date: 07-02-2020 Through 12-02-2020",

year = "2020",

language = "English (US)",

series = "AAAI 2020 - 34th AAAI Conference on Artificial Intelligence",

publisher = "AAAI press",

pages = "8504--8511",

booktitle = "AAAI 2020 - 34th AAAI Conference on Artificial Intelligence",

}

TY - GEN

T1 - Enhancing natural language inference using new and expanded training data sets and new learning models

AU - Mitra, Arindam

AU - Shrivastava, Ishan

AU - Baral, Chitta

N1 - Funding Information: Support from DARPA, and NSF grant 1816039 is acknowledged. Publisher Copyright: Copyright © 2020, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

PY - 2020

Y1 - 2020

N2 - Natural Language Inference (NLI) plays an important role in many natural language processing tasks such as question answering. However, existing NLI modules that are trained on existing NLI datasets have several drawbacks. For example, they do not capture the notion of entity and role well and often end up making mistakes such as “Peter signed a deal” can be inferred from “John signed a deal”. As part of this work, we have developed two datasets that help mitigate such issues and make the systems better at understanding the notion of “entities” and “roles”. After training the existing models on the new dataset we observe that the existing models do not perform well on one of the new benchmark. We then propose a modification to the “word-to-word” attention function which has been uniformly reused across several popular NLI architectures. The resulting models perform as well as their unmodified counterparts on the existing benchmarks and perform significantly well on the new benchmarks that emphasize “roles” and “entities”.

AB - Natural Language Inference (NLI) plays an important role in many natural language processing tasks such as question answering. However, existing NLI modules that are trained on existing NLI datasets have several drawbacks. For example, they do not capture the notion of entity and role well and often end up making mistakes such as “Peter signed a deal” can be inferred from “John signed a deal”. As part of this work, we have developed two datasets that help mitigate such issues and make the systems better at understanding the notion of “entities” and “roles”. After training the existing models on the new dataset we observe that the existing models do not perform well on one of the new benchmark. We then propose a modification to the “word-to-word” attention function which has been uniformly reused across several popular NLI architectures. The resulting models perform as well as their unmodified counterparts on the existing benchmarks and perform significantly well on the new benchmarks that emphasize “roles” and “entities”.

UR - http://www.scopus.com/inward/record.url?scp=85103077616&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85103077616&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85103077616

T3 - AAAI 2020 - 34th AAAI Conference on Artificial Intelligence

SP - 8504

EP - 8511

BT - AAAI 2020 - 34th AAAI Conference on Artificial Intelligence

PB - AAAI press

T2 - 34th AAAI Conference on Artificial Intelligence, AAAI 2020

Y2 - 7 February 2020 through 12 February 2020

ER -

Enhancing natural language inference using new and expanded training data sets and new learning models

Abstract

Publication series

Conference

ASJC Scopus subject areas

Other files and links

Cite this