Patch before exploited: An approach to identify targeted software vulnerabilities

Mohammed Almukaynizi; Eric Nunes; Krishna Dharaiya; Manoj Senguttuvan; Jana Shakarian; Paulo Shakarian

doi:10.1007/978-3-319-98842-9_4

Patch before exploited: An approach to identify targeted software vulnerabilities

Mohammed Almukaynizi, Eric Nunes, Krishna Dharaiya, Manoj Senguttuvan, Jana Shakarian, Paulo Shakarian

Research output: Chapter in Book/Report/Conference proceeding › Chapter

10 Scopus citations

Abstract

The number of software vulnerabilities discovered and publicly disclosed is increasing every year; however, only a small fraction of these vulnerabilities are exploited in real-world attacks. With limitations on time and skilled resources, organizations often look at ways to identify threatened vulnerabilities for patch prioritization. In this chapter, an exploit prediction model is presented, which predicts whether a vulnerability will likely be exploited. Our proposed model leverages data from a variety of online data sources (white hat community, vulnerability research community, and dark web/deep web (DW) websites) with vulnerability mentions. Compared to the standard scoring system (CVSS base score) and a benchmark model that leverages Twitter data in exploit prediction, our model outperforms the baseline models with an F1 measure of 0.40 on the minority class (266% improvement over CVSS base score) and also achieves high true positive rate and low false positive rate (90%, 13%, respectively), making it highly effective as an early predictor of exploits that could appear in the wild. A qualitative and a quantitative study are also conducted to investigate whether the likelihood of exploitation increases if a vulnerability is mentioned in each of the examined data sources. The proposed model is proven to be much more robust than adversarial examples—postings authored by adversaries in the attempt to induce the model to produce incorrect predictions. A discussion on the viability of the model is provided, showing cases where the classifier achieves high performance, and other cases where the classifier performs less efficiently.

Original language	English (US)
Title of host publication	Intelligent Systems Reference Library
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	81-113
Number of pages	33
DOIs	https://doi.org/10.1007/978-3-319-98842-9_4
State	Published - 2019

Publication series

Name	Intelligent Systems Reference Library
Volume	151
ISSN (Print)	1868-4394
ISSN (Electronic)	1868-4408

ASJC Scopus subject areas

General Computer Science
Information Systems and Management
Library and Information Sciences

Access to Document

10.1007/978-3-319-98842-9_4

Cite this

Almukaynizi, M., Nunes, E., Dharaiya, K., Senguttuvan, M., Shakarian, J., & Shakarian, P. (2019). Patch before exploited: An approach to identify targeted software vulnerabilities. In Intelligent Systems Reference Library (pp. 81-113). (Intelligent Systems Reference Library; Vol. 151). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-319-98842-9_4

Patch before exploited: An approach to identify targeted software vulnerabilities. / Almukaynizi, Mohammed; Nunes, Eric; Dharaiya, Krishna et al.
Intelligent Systems Reference Library. Springer Science and Business Media Deutschland GmbH, 2019. p. 81-113 (Intelligent Systems Reference Library; Vol. 151).

Research output: Chapter in Book/Report/Conference proceeding › Chapter

Almukaynizi, M, Nunes, E, Dharaiya, K, Senguttuvan, M, Shakarian, J & Shakarian, P 2019, Patch before exploited: An approach to identify targeted software vulnerabilities. in Intelligent Systems Reference Library. Intelligent Systems Reference Library, vol. 151, Springer Science and Business Media Deutschland GmbH, pp. 81-113. https://doi.org/10.1007/978-3-319-98842-9_4

@inbook{d077ad48ed9e417997d1467c3774289f,

title = "Patch before exploited: An approach to identify targeted software vulnerabilities",

abstract = "The number of software vulnerabilities discovered and publicly disclosed is increasing every year; however, only a small fraction of these vulnerabilities are exploited in real-world attacks. With limitations on time and skilled resources, organizations often look at ways to identify threatened vulnerabilities for patch prioritization. In this chapter, an exploit prediction model is presented, which predicts whether a vulnerability will likely be exploited. Our proposed model leverages data from a variety of online data sources (white hat community, vulnerability research community, and dark web/deep web (DW) websites) with vulnerability mentions. Compared to the standard scoring system (CVSS base score) and a benchmark model that leverages Twitter data in exploit prediction, our model outperforms the baseline models with an F1 measure of 0.40 on the minority class (266% improvement over CVSS base score) and also achieves high true positive rate and low false positive rate (90%, 13%, respectively), making it highly effective as an early predictor of exploits that could appear in the wild. A qualitative and a quantitative study are also conducted to investigate whether the likelihood of exploitation increases if a vulnerability is mentioned in each of the examined data sources. The proposed model is proven to be much more robust than adversarial examples—postings authored by adversaries in the attempt to induce the model to produce incorrect predictions. A discussion on the viability of the model is provided, showing cases where the classifier achieves high performance, and other cases where the classifier performs less efficiently.",

author = "Mohammed Almukaynizi and Eric Nunes and Krishna Dharaiya and Manoj Senguttuvan and Jana Shakarian and Paulo Shakarian",

note = "Funding Information: Acknowledgements Some of the authors were supported by the Office of Naval Research (ONR) contract N00014-15-1-2742, the Office of Naval Research (ONR) Neptune program and the ASU Global Security Initiative (GSI). Paulo Shakarian and Jana Shakarian are supported by the Office of the Director of National Intelligence (ODNI) and the Intelligence Advanced Research Projects Activity (IARPA) via the Air Force Research Laboratory (AFRL) contract number FA8750-16-C-0112. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright annotation thereon. Disclaimer: The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of ODNI, IARPA, AFRL, or the U.S. Government. Publisher Copyright: {\textcopyright} Springer Nature Switzerland AG 2019.",

year = "2019",

doi = "10.1007/978-3-319-98842-9_4",

language = "English (US)",

series = "Intelligent Systems Reference Library",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "81--113",

booktitle = "Intelligent Systems Reference Library",

address = "Germany",

}

TY - CHAP

T1 - Patch before exploited

T2 - An approach to identify targeted software vulnerabilities

AU - Almukaynizi, Mohammed

AU - Nunes, Eric

AU - Dharaiya, Krishna

AU - Senguttuvan, Manoj

AU - Shakarian, Jana

AU - Shakarian, Paulo

N1 - Funding Information: Acknowledgements Some of the authors were supported by the Office of Naval Research (ONR) contract N00014-15-1-2742, the Office of Naval Research (ONR) Neptune program and the ASU Global Security Initiative (GSI). Paulo Shakarian and Jana Shakarian are supported by the Office of the Director of National Intelligence (ODNI) and the Intelligence Advanced Research Projects Activity (IARPA) via the Air Force Research Laboratory (AFRL) contract number FA8750-16-C-0112. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright annotation thereon. Disclaimer: The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of ODNI, IARPA, AFRL, or the U.S. Government. Publisher Copyright: © Springer Nature Switzerland AG 2019.

PY - 2019

Y1 - 2019

N2 - The number of software vulnerabilities discovered and publicly disclosed is increasing every year; however, only a small fraction of these vulnerabilities are exploited in real-world attacks. With limitations on time and skilled resources, organizations often look at ways to identify threatened vulnerabilities for patch prioritization. In this chapter, an exploit prediction model is presented, which predicts whether a vulnerability will likely be exploited. Our proposed model leverages data from a variety of online data sources (white hat community, vulnerability research community, and dark web/deep web (DW) websites) with vulnerability mentions. Compared to the standard scoring system (CVSS base score) and a benchmark model that leverages Twitter data in exploit prediction, our model outperforms the baseline models with an F1 measure of 0.40 on the minority class (266% improvement over CVSS base score) and also achieves high true positive rate and low false positive rate (90%, 13%, respectively), making it highly effective as an early predictor of exploits that could appear in the wild. A qualitative and a quantitative study are also conducted to investigate whether the likelihood of exploitation increases if a vulnerability is mentioned in each of the examined data sources. The proposed model is proven to be much more robust than adversarial examples—postings authored by adversaries in the attempt to induce the model to produce incorrect predictions. A discussion on the viability of the model is provided, showing cases where the classifier achieves high performance, and other cases where the classifier performs less efficiently.

AB - The number of software vulnerabilities discovered and publicly disclosed is increasing every year; however, only a small fraction of these vulnerabilities are exploited in real-world attacks. With limitations on time and skilled resources, organizations often look at ways to identify threatened vulnerabilities for patch prioritization. In this chapter, an exploit prediction model is presented, which predicts whether a vulnerability will likely be exploited. Our proposed model leverages data from a variety of online data sources (white hat community, vulnerability research community, and dark web/deep web (DW) websites) with vulnerability mentions. Compared to the standard scoring system (CVSS base score) and a benchmark model that leverages Twitter data in exploit prediction, our model outperforms the baseline models with an F1 measure of 0.40 on the minority class (266% improvement over CVSS base score) and also achieves high true positive rate and low false positive rate (90%, 13%, respectively), making it highly effective as an early predictor of exploits that could appear in the wild. A qualitative and a quantitative study are also conducted to investigate whether the likelihood of exploitation increases if a vulnerability is mentioned in each of the examined data sources. The proposed model is proven to be much more robust than adversarial examples—postings authored by adversaries in the attempt to induce the model to produce incorrect predictions. A discussion on the viability of the model is provided, showing cases where the classifier achieves high performance, and other cases where the classifier performs less efficiently.

UR - http://www.scopus.com/inward/record.url?scp=85053673724&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85053673724&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-98842-9_4

DO - 10.1007/978-3-319-98842-9_4

M3 - Chapter

AN - SCOPUS:85053673724

T3 - Intelligent Systems Reference Library

SP - 81

EP - 113

BT - Intelligent Systems Reference Library

PB - Springer Science and Business Media Deutschland GmbH

ER -

Patch before exploited: An approach to identify targeted software vulnerabilities

Abstract

Publication series

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this