TY - GEN
T1 - An Analytical Framework for Security-Tuning of Artificial Intelligence Applications under Attack
AU - Sadeghi, Koosha
AU - Banerjee, Ayan
AU - Gupta, Sandeep K.S.
N1 - Funding Information:
This work has been partly funded by CNS grant #1218505, IIS grant #1116385, and NIH grant #EB019202.
Funding Information:
∗This work has been partly funded by CNS grant #1218505, IIS grant #1116385, and NIH grant #EB019202.
PY - 2019/5/17
Y1 - 2019/5/17
N2 - Machine Learning (ML) algorithms, as the core technology in Artificial Intelligence (AI) applications, such as self-driving vehicles, make important decisions by performing a variety of data classification or prediction tasks. Attacks on data or algorithms in AI applications can lead to misclassification or misprediction, which can fail the applications. For each dataset separately, the parameters of ML algorithms should be tuned to reach a desirable classification or prediction accuracy. Typically, ML experts tune the parameters empirically, which can be time consuming and does not guarantee the optimal result. To this end, some research suggests an analytical approach to tune the ML parameters for maximum accuracy. However, none of the works consider the ML performance under attack in their tuning process. This paper proposes an analytical framework for tuning the ML parameters to be secure against attacks, while keeping its accuracy high. The framework finds the optimal set of parameters by defining a novel objective function, which takes into account the test results of both ML accuracy and its security against attacks. For validating the framework, an AI application is implemented to recognize whether a subject's eyes are open or closed, by applying k-Nearest Neighbors (kNN) algorithm on her Electroencephalogram (EEG) signals. In this application, the number of neighbors (k) and the distance metric type, as the two main parameters of kNN, are chosen for tuning. The input data perturbation attack, as one of the most common attacks on ML algorithms, is used for testing the security of the application. Exhaustive search approach is used to solve the optimization problem. The experiment results show k = 43 and cosine distance metric is the optimal configuration of kNN for the EEG dataset, which leads to 83.75% classification accuracy and reduces the attack success rate to 5.21%.
AB - Machine Learning (ML) algorithms, as the core technology in Artificial Intelligence (AI) applications, such as self-driving vehicles, make important decisions by performing a variety of data classification or prediction tasks. Attacks on data or algorithms in AI applications can lead to misclassification or misprediction, which can fail the applications. For each dataset separately, the parameters of ML algorithms should be tuned to reach a desirable classification or prediction accuracy. Typically, ML experts tune the parameters empirically, which can be time consuming and does not guarantee the optimal result. To this end, some research suggests an analytical approach to tune the ML parameters for maximum accuracy. However, none of the works consider the ML performance under attack in their tuning process. This paper proposes an analytical framework for tuning the ML parameters to be secure against attacks, while keeping its accuracy high. The framework finds the optimal set of parameters by defining a novel objective function, which takes into account the test results of both ML accuracy and its security against attacks. For validating the framework, an AI application is implemented to recognize whether a subject's eyes are open or closed, by applying k-Nearest Neighbors (kNN) algorithm on her Electroencephalogram (EEG) signals. In this application, the number of neighbors (k) and the distance metric type, as the two main parameters of kNN, are chosen for tuning. The input data perturbation attack, as one of the most common attacks on ML algorithms, is used for testing the security of the application. Exhaustive search approach is used to solve the optimization problem. The experiment results show k = 43 and cosine distance metric is the optimal configuration of kNN for the EEG dataset, which leads to 83.75% classification accuracy and reduces the attack success rate to 5.21%.
KW - Artificial intelligence
KW - Machine learning
KW - Optimization
KW - Parameters tuning
KW - Perturbation attack
KW - Security
UR - http://www.scopus.com/inward/record.url?scp=85067090103&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85067090103&partnerID=8YFLogxK
U2 - 10.1109/AITest.2019.00012
DO - 10.1109/AITest.2019.00012
M3 - Conference contribution
AN - SCOPUS:85067090103
T3 - Proceedings - 2019 IEEE International Conference on Artificial Intelligence Testing, AITest 2019
SP - 111
EP - 118
BT - Proceedings - 2019 IEEE International Conference on Artificial Intelligence Testing, AITest 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 1st IEEE International Conference on Artificial Intelligence Testing, AITest 2019
Y2 - 4 April 2019 through 9 April 2019
ER -