Evaluation Methods and Measures for Causal Learning Algorithms

Lu Cheng; Ruocheng Guo; Raha Moraffah; Paras Sheth; K. Selcuk Candan; Huan Liu

doi:10.1109/TAI.2022.3150264

Evaluation Methods and Measures for Causal Learning Algorithms

Lu Cheng, Ruocheng Guo, Raha Moraffah, Paras Sheth, K. Selcuk Candan, Huan Liu

Research output: Contribution to journal › Article › peer-review

14 Scopus citations

Abstract

The convenient access to copious multifaceted data has encouraged machine learning researchers to reconsider correlation-based learning and embrace the opportunity of causality-based learning, i.e., causal machine learning (causal learning). Recent years have, therefore, witnessed great effort in developing causal learning algorithms aiming to help artificial intelligence (AI) achieve human-level intelligence. Due to the lack of ground-truth data, one of the biggest challenges in current causal learning research is algorithm evaluations. This largely impedes the cross-pollination of AI and causal inference and hinders the two fields to benefit from the advances of the other. To bridge from conventional causal inference (i.e., based on statistical methods) to causal learning with Big Data (i.e., the intersection of causal inference and machine learning), in this survey, we review commonly used datasets, evaluation methods, and measures for causal learning using an evaluation pipeline similar to conventional machine learning. We focus on the two fundamental causal inference tasks and causality-aware machine learning tasks. Limitations of current evaluation procedures are also discussed. We, then, examine popular causal inference tools/packages and conclude with primary challenges and opportunities for benchmarking causal learning algorithms in the era of Big Data. The survey seeks to bring to the forefront the urgency of developing publicly available benchmarks and consensus-building standards for causal learning evaluation with observational data. In doing so, we hope to broaden the discussions and facilitate collaboration to advance the innovation and application of causal learning.

Original language	English (US)
Pages (from-to)	924-943
Number of pages	20
Journal	IEEE Transactions on Artificial Intelligence
Volume	3
Issue number	6
DOIs	https://doi.org/10.1109/TAI.2022.3150264
State	Published - Dec 1 2022

Keywords

Benchmarking
Big Data
causal inference
causal learning
evaluation

ASJC Scopus subject areas

Artificial Intelligence
Computer Science Applications

Access to Document

10.1109/TAI.2022.3150264

Cite this

@article{9d1b5f758f1048cf939a4be8253f3916,

title = "Evaluation Methods and Measures for Causal Learning Algorithms",

abstract = "The convenient access to copious multifaceted data has encouraged machine learning researchers to reconsider correlation-based learning and embrace the opportunity of causality-based learning, i.e., causal machine learning (causal learning). Recent years have, therefore, witnessed great effort in developing causal learning algorithms aiming to help artificial intelligence (AI) achieve human-level intelligence. Due to the lack of ground-truth data, one of the biggest challenges in current causal learning research is algorithm evaluations. This largely impedes the cross-pollination of AI and causal inference and hinders the two fields to benefit from the advances of the other. To bridge from conventional causal inference (i.e., based on statistical methods) to causal learning with Big Data (i.e., the intersection of causal inference and machine learning), in this survey, we review commonly used datasets, evaluation methods, and measures for causal learning using an evaluation pipeline similar to conventional machine learning. We focus on the two fundamental causal inference tasks and causality-aware machine learning tasks. Limitations of current evaluation procedures are also discussed. We, then, examine popular causal inference tools/packages and conclude with primary challenges and opportunities for benchmarking causal learning algorithms in the era of Big Data. The survey seeks to bring to the forefront the urgency of developing publicly available benchmarks and consensus-building standards for causal learning evaluation with observational data. In doing so, we hope to broaden the discussions and facilitate collaboration to advance the innovation and application of causal learning.",

keywords = "Benchmarking, Big Data, causal inference, causal learning, evaluation",

author = "Lu Cheng and Ruocheng Guo and Raha Moraffah and Paras Sheth and Candan, {K. Selcuk} and Huan Liu",

note = "Funding Information: This work was supported in part by the National Science Foundation under Grant 1909555, Grant 2029044, Grant 2125246, Grant 1633381, and Grant 1610282, in part by the Association for Research Libraries under Grant W911NF2020124, and in part by the U.S. Army Materiel Command under Grant W911NF2110030. Publisher Copyright: {\textcopyright} 2020 IEEE.",

year = "2022",

month = dec,

day = "1",

doi = "10.1109/TAI.2022.3150264",

language = "English (US)",

volume = "3",

pages = "924--943",

journal = "IEEE Transactions on Artificial Intelligence",

issn = "2691-4581",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "6",

}

TY - JOUR

T1 - Evaluation Methods and Measures for Causal Learning Algorithms

AU - Cheng, Lu

AU - Guo, Ruocheng

AU - Moraffah, Raha

AU - Sheth, Paras

AU - Candan, K. Selcuk

AU - Liu, Huan

N1 - Funding Information: This work was supported in part by the National Science Foundation under Grant 1909555, Grant 2029044, Grant 2125246, Grant 1633381, and Grant 1610282, in part by the Association for Research Libraries under Grant W911NF2020124, and in part by the U.S. Army Materiel Command under Grant W911NF2110030. Publisher Copyright: © 2020 IEEE.

PY - 2022/12/1

Y1 - 2022/12/1

N2 - The convenient access to copious multifaceted data has encouraged machine learning researchers to reconsider correlation-based learning and embrace the opportunity of causality-based learning, i.e., causal machine learning (causal learning). Recent years have, therefore, witnessed great effort in developing causal learning algorithms aiming to help artificial intelligence (AI) achieve human-level intelligence. Due to the lack of ground-truth data, one of the biggest challenges in current causal learning research is algorithm evaluations. This largely impedes the cross-pollination of AI and causal inference and hinders the two fields to benefit from the advances of the other. To bridge from conventional causal inference (i.e., based on statistical methods) to causal learning with Big Data (i.e., the intersection of causal inference and machine learning), in this survey, we review commonly used datasets, evaluation methods, and measures for causal learning using an evaluation pipeline similar to conventional machine learning. We focus on the two fundamental causal inference tasks and causality-aware machine learning tasks. Limitations of current evaluation procedures are also discussed. We, then, examine popular causal inference tools/packages and conclude with primary challenges and opportunities for benchmarking causal learning algorithms in the era of Big Data. The survey seeks to bring to the forefront the urgency of developing publicly available benchmarks and consensus-building standards for causal learning evaluation with observational data. In doing so, we hope to broaden the discussions and facilitate collaboration to advance the innovation and application of causal learning.

AB - The convenient access to copious multifaceted data has encouraged machine learning researchers to reconsider correlation-based learning and embrace the opportunity of causality-based learning, i.e., causal machine learning (causal learning). Recent years have, therefore, witnessed great effort in developing causal learning algorithms aiming to help artificial intelligence (AI) achieve human-level intelligence. Due to the lack of ground-truth data, one of the biggest challenges in current causal learning research is algorithm evaluations. This largely impedes the cross-pollination of AI and causal inference and hinders the two fields to benefit from the advances of the other. To bridge from conventional causal inference (i.e., based on statistical methods) to causal learning with Big Data (i.e., the intersection of causal inference and machine learning), in this survey, we review commonly used datasets, evaluation methods, and measures for causal learning using an evaluation pipeline similar to conventional machine learning. We focus on the two fundamental causal inference tasks and causality-aware machine learning tasks. Limitations of current evaluation procedures are also discussed. We, then, examine popular causal inference tools/packages and conclude with primary challenges and opportunities for benchmarking causal learning algorithms in the era of Big Data. The survey seeks to bring to the forefront the urgency of developing publicly available benchmarks and consensus-building standards for causal learning evaluation with observational data. In doing so, we hope to broaden the discussions and facilitate collaboration to advance the innovation and application of causal learning.

KW - Benchmarking

KW - Big Data

KW - causal inference

KW - causal learning

KW - evaluation

UR - http://www.scopus.com/inward/record.url?scp=85130431240&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85130431240&partnerID=8YFLogxK

U2 - 10.1109/TAI.2022.3150264

DO - 10.1109/TAI.2022.3150264

M3 - Article

AN - SCOPUS:85130431240

SN - 2691-4581

VL - 3

SP - 924

EP - 943

JO - IEEE Transactions on Artificial Intelligence

JF - IEEE Transactions on Artificial Intelligence

IS - 6

ER -

Evaluation Methods and Measures for Causal Learning Algorithms

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this