Abstract

Unpacking causal relationships is essential for developing solutions for managing climate risks that threaten sociopolitical stability. However, the automatic discovery of complex causal chains among interlinked events and their participating actors within large corpora is not well studied. Previous studies on extracting causal relationships from text were based on laborious and incomplete hand developed lists of causal verbs, such as 'causes' and 'results in'. Such approaches result in limited recall because standard causal verbs may not generalize well to accommodate surface variations in texts when different keywords and phrases are used to express similar causal effects. This paper presents a Snowball system to generalize <Subject, Verb, Object> triplets extracted from corpora of online news articles, and cluster them into higher-level concepts without drift. We start with a seed set of causal verbs and apply a concept generalization technique to extract causal chains and their participating actors. Our novel algorithms overcome surface variations in written expressions of causal relationships and discover the domino effects between climate events and human security. Unlike prior studies, our semi-supervised approach alleviates the need for labor intensive keyword list development and annotated datasets. Experimental evaluations by domain experts achieve an average precision of 82%, a significant improvement from prior work. Qualitative assessments of causal chains show that results are consistent with the 2014 IPCC report illuminating causal mechanisms underlying the linkages between climatic stresses and social instability.

Original languageEnglish (US)
Title of host publicationProceedings - 2018 1st International Conference on Data Intelligence and Security, ICDIS 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages234-241
Number of pages8
ISBN (Electronic)9781538657621
DOIs
StatePublished - May 25 2018
Event1st International Conference on Data Intelligence and Security, ICDIS 2018 - South Padre Island, United States
Duration: Apr 8 2018Apr 10 2018

Other

Other1st International Conference on Data Intelligence and Security, ICDIS 2018
CountryUnited States
CitySouth Padre Island
Period4/8/184/10/18

Fingerprint

Climate change
Seed
Personnel

Keywords

  • Causal Chains
  • Causal Relations
  • Climate Change
  • Information Extraction
  • Natural Language Processing
  • Text Mining

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Safety, Risk, Reliability and Quality

Cite this

Alashri, S., Tsai, J. Y., Koppela, A. R., & Davulcu, H. (2018). Snowball: Extracting causal chains from climate change text corpora. In Proceedings - 2018 1st International Conference on Data Intelligence and Security, ICDIS 2018 (pp. 234-241). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICDIS.2018.00045

Snowball : Extracting causal chains from climate change text corpora. / Alashri, Saud; Tsai, Jiun Yi; Koppela, Anvesh Reddy; Davulcu, Hasan.

Proceedings - 2018 1st International Conference on Data Intelligence and Security, ICDIS 2018. Institute of Electrical and Electronics Engineers Inc., 2018. p. 234-241.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Alashri, S, Tsai, JY, Koppela, AR & Davulcu, H 2018, Snowball: Extracting causal chains from climate change text corpora. in Proceedings - 2018 1st International Conference on Data Intelligence and Security, ICDIS 2018. Institute of Electrical and Electronics Engineers Inc., pp. 234-241, 1st International Conference on Data Intelligence and Security, ICDIS 2018, South Padre Island, United States, 4/8/18. https://doi.org/10.1109/ICDIS.2018.00045
Alashri S, Tsai JY, Koppela AR, Davulcu H. Snowball: Extracting causal chains from climate change text corpora. In Proceedings - 2018 1st International Conference on Data Intelligence and Security, ICDIS 2018. Institute of Electrical and Electronics Engineers Inc. 2018. p. 234-241 https://doi.org/10.1109/ICDIS.2018.00045
Alashri, Saud ; Tsai, Jiun Yi ; Koppela, Anvesh Reddy ; Davulcu, Hasan. / Snowball : Extracting causal chains from climate change text corpora. Proceedings - 2018 1st International Conference on Data Intelligence and Security, ICDIS 2018. Institute of Electrical and Electronics Engineers Inc., 2018. pp. 234-241
@inproceedings{d378995e250b47c78c77fc0d433a9d11,
title = "Snowball: Extracting causal chains from climate change text corpora",
abstract = "Unpacking causal relationships is essential for developing solutions for managing climate risks that threaten sociopolitical stability. However, the automatic discovery of complex causal chains among interlinked events and their participating actors within large corpora is not well studied. Previous studies on extracting causal relationships from text were based on laborious and incomplete hand developed lists of causal verbs, such as 'causes' and 'results in'. Such approaches result in limited recall because standard causal verbs may not generalize well to accommodate surface variations in texts when different keywords and phrases are used to express similar causal effects. This paper presents a Snowball system to generalize <Subject, Verb, Object> triplets extracted from corpora of online news articles, and cluster them into higher-level concepts without drift. We start with a seed set of causal verbs and apply a concept generalization technique to extract causal chains and their participating actors. Our novel algorithms overcome surface variations in written expressions of causal relationships and discover the domino effects between climate events and human security. Unlike prior studies, our semi-supervised approach alleviates the need for labor intensive keyword list development and annotated datasets. Experimental evaluations by domain experts achieve an average precision of 82{\%}, a significant improvement from prior work. Qualitative assessments of causal chains show that results are consistent with the 2014 IPCC report illuminating causal mechanisms underlying the linkages between climatic stresses and social instability.",
keywords = "Causal Chains, Causal Relations, Climate Change, Information Extraction, Natural Language Processing, Text Mining",
author = "Saud Alashri and Tsai, {Jiun Yi} and Koppela, {Anvesh Reddy} and Hasan Davulcu",
year = "2018",
month = "5",
day = "25",
doi = "10.1109/ICDIS.2018.00045",
language = "English (US)",
pages = "234--241",
booktitle = "Proceedings - 2018 1st International Conference on Data Intelligence and Security, ICDIS 2018",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - Snowball

T2 - Extracting causal chains from climate change text corpora

AU - Alashri, Saud

AU - Tsai, Jiun Yi

AU - Koppela, Anvesh Reddy

AU - Davulcu, Hasan

PY - 2018/5/25

Y1 - 2018/5/25

N2 - Unpacking causal relationships is essential for developing solutions for managing climate risks that threaten sociopolitical stability. However, the automatic discovery of complex causal chains among interlinked events and their participating actors within large corpora is not well studied. Previous studies on extracting causal relationships from text were based on laborious and incomplete hand developed lists of causal verbs, such as 'causes' and 'results in'. Such approaches result in limited recall because standard causal verbs may not generalize well to accommodate surface variations in texts when different keywords and phrases are used to express similar causal effects. This paper presents a Snowball system to generalize <Subject, Verb, Object> triplets extracted from corpora of online news articles, and cluster them into higher-level concepts without drift. We start with a seed set of causal verbs and apply a concept generalization technique to extract causal chains and their participating actors. Our novel algorithms overcome surface variations in written expressions of causal relationships and discover the domino effects between climate events and human security. Unlike prior studies, our semi-supervised approach alleviates the need for labor intensive keyword list development and annotated datasets. Experimental evaluations by domain experts achieve an average precision of 82%, a significant improvement from prior work. Qualitative assessments of causal chains show that results are consistent with the 2014 IPCC report illuminating causal mechanisms underlying the linkages between climatic stresses and social instability.

AB - Unpacking causal relationships is essential for developing solutions for managing climate risks that threaten sociopolitical stability. However, the automatic discovery of complex causal chains among interlinked events and their participating actors within large corpora is not well studied. Previous studies on extracting causal relationships from text were based on laborious and incomplete hand developed lists of causal verbs, such as 'causes' and 'results in'. Such approaches result in limited recall because standard causal verbs may not generalize well to accommodate surface variations in texts when different keywords and phrases are used to express similar causal effects. This paper presents a Snowball system to generalize <Subject, Verb, Object> triplets extracted from corpora of online news articles, and cluster them into higher-level concepts without drift. We start with a seed set of causal verbs and apply a concept generalization technique to extract causal chains and their participating actors. Our novel algorithms overcome surface variations in written expressions of causal relationships and discover the domino effects between climate events and human security. Unlike prior studies, our semi-supervised approach alleviates the need for labor intensive keyword list development and annotated datasets. Experimental evaluations by domain experts achieve an average precision of 82%, a significant improvement from prior work. Qualitative assessments of causal chains show that results are consistent with the 2014 IPCC report illuminating causal mechanisms underlying the linkages between climatic stresses and social instability.

KW - Causal Chains

KW - Causal Relations

KW - Climate Change

KW - Information Extraction

KW - Natural Language Processing

KW - Text Mining

UR - http://www.scopus.com/inward/record.url?scp=85048523770&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85048523770&partnerID=8YFLogxK

U2 - 10.1109/ICDIS.2018.00045

DO - 10.1109/ICDIS.2018.00045

M3 - Conference contribution

SP - 234

EP - 241

BT - Proceedings - 2018 1st International Conference on Data Intelligence and Security, ICDIS 2018

PB - Institute of Electrical and Electronics Engineers Inc.

ER -