COVID-19 Detection using Audio Spectral Features and Machine Learning

Michael Esposito; Sunil Rao; Vivek Narayanaswamy; Andreas Spanias

doi:10.1109/IEEECONF53345.2021.9723323

COVID-19 Detection using Audio Spectral Features and Machine Learning

Michael Esposito, Sunil Rao, Vivek Narayanaswamy, Andreas Spanias

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

8 Scopus citations

Abstract

In this research and education REU project, we use audio waveform signatures of coughing to determine whether COVID-19 can be diagnosed. More specifically, we determine coughing audio spectral features and use neural network architectures to develop diagnostics for COVID-19. The non-invasive rapid and remote testing benefits of this approach relative to existing nose swab, saliva, and blood testing make this method attractive for deployment on smart phones. Challenges include distorted or low-quality audio samples, availability of reliable labeled data, confusability with other respiratory diseases, and lack of baseline (healthy) audio recordings for comparison. We have studied, compared, tuned and implemented in Python an array of convolutional neural network architectures. Results using a unique parallel machine learning architecture with a fusion unit are presented.

Original language	English (US)
Title of host publication	55th Asilomar Conference on Signals, Systems and Computers, ACSSC 2021
Editors	Michael B. Matthews
Publisher	IEEE Computer Society
Pages	1146-1150
Number of pages	5
ISBN (Electronic)	9781665458283
DOIs	https://doi.org/10.1109/IEEECONF53345.2021.9723323
State	Published - 2021
Event	55th Asilomar Conference on Signals, Systems and Computers, ACSSC 2021 - Virtual, Pacific Grove, United States Duration: Oct 31 2021 → Nov 3 2021

Publication series

Name	Conference Record - Asilomar Conference on Signals, Systems and Computers
Volume	2021-October
ISSN (Print)	1058-6393

Conference

Conference	55th Asilomar Conference on Signals, Systems and Computers, ACSSC 2021
Country/Territory	United States
City	Virtual, Pacific Grove
Period	10/31/21 → 11/3/21

Keywords

COVID-19
cough audio
machine learning
neural networks
spectral features
tachypnea

ASJC Scopus subject areas

Signal Processing
Computer Networks and Communications

Access to Document

10.1109/IEEECONF53345.2021.9723323

Cite this

Esposito, M., Rao, S., Narayanaswamy, V., & Spanias, A. (2021). COVID-19 Detection using Audio Spectral Features and Machine Learning. In M. B. Matthews (Ed.), 55th Asilomar Conference on Signals, Systems and Computers, ACSSC 2021 (pp. 1146-1150). (Conference Record - Asilomar Conference on Signals, Systems and Computers; Vol. 2021-October). IEEE Computer Society. https://doi.org/10.1109/IEEECONF53345.2021.9723323

COVID-19 Detection using Audio Spectral Features and Machine Learning. / Esposito, Michael; Rao, Sunil; Narayanaswamy, Vivek et al.
55th Asilomar Conference on Signals, Systems and Computers, ACSSC 2021. ed. / Michael B. Matthews. IEEE Computer Society, 2021. p. 1146-1150 (Conference Record - Asilomar Conference on Signals, Systems and Computers; Vol. 2021-October).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Esposito, M, Rao, S, Narayanaswamy, V & Spanias, A 2021, COVID-19 Detection using Audio Spectral Features and Machine Learning. in MB Matthews (ed.), 55th Asilomar Conference on Signals, Systems and Computers, ACSSC 2021. Conference Record - Asilomar Conference on Signals, Systems and Computers, vol. 2021-October, IEEE Computer Society, pp. 1146-1150, 55th Asilomar Conference on Signals, Systems and Computers, ACSSC 2021, Virtual, Pacific Grove, United States, 10/31/21. https://doi.org/10.1109/IEEECONF53345.2021.9723323

Esposito M, Rao S, Narayanaswamy V, Spanias A. COVID-19 Detection using Audio Spectral Features and Machine Learning. In Matthews MB, editor, 55th Asilomar Conference on Signals, Systems and Computers, ACSSC 2021. IEEE Computer Society. 2021. p. 1146-1150. (Conference Record - Asilomar Conference on Signals, Systems and Computers). doi: 10.1109/IEEECONF53345.2021.9723323

@inproceedings{4474e96cb78b4bf886167bbf45e77731,

title = "COVID-19 Detection using Audio Spectral Features and Machine Learning",

abstract = "In this research and education REU project, we use audio waveform signatures of coughing to determine whether COVID-19 can be diagnosed. More specifically, we determine coughing audio spectral features and use neural network architectures to develop diagnostics for COVID-19. The non-invasive rapid and remote testing benefits of this approach relative to existing nose swab, saliva, and blood testing make this method attractive for deployment on smart phones. Challenges include distorted or low-quality audio samples, availability of reliable labeled data, confusability with other respiratory diseases, and lack of baseline (healthy) audio recordings for comparison. We have studied, compared, tuned and implemented in Python an array of convolutional neural network architectures. Results using a unique parallel machine learning architecture with a fusion unit are presented.",

keywords = "COVID-19, cough audio, machine learning, neural networks, spectral features, tachypnea",

author = "Michael Esposito and Sunil Rao and Vivek Narayanaswamy and Andreas Spanias",

note = "Funding Information: We began this REU study in the spring of 2020 by first providing “bootcamp” training in research protocols, digital signal processing (DSP) and ML basics. The REU engagement was virtual due to pandemic restrictions. Online lectures, as well as hands-on programming activities, were provided to cover introductory content in DSP, speech processing and machine learning. The lecture topics included basics on Fast Fourier transforms, analog to digital conversion and sampling theory, basic properties of speech and audio signals, and fundamentals of machine learning. Initial hands-on DSP activities were supported by the lab book content [11] and simulations on the Java-DSP (J-DSP) [12] object-oriented environment. Specific preparation for audio included understanding and identifying harmonics, formants, time and frequency domain representations of audio [13], spectrograms, and Mel-frequency cepstral coefficients (MFCCs) [14]. The initial interactive training activities for machine leaning used specific ML J-DSP functions for k-means clustering and sound recognition from the spectrum using [15] followed by an introduction to MATLAB. MATLAB was used to implement the k-means algorithm [16] and also understand the basics of neural network classification. While simple ML concepts and algorithms were introduced in J-DSP and MATLAB [17], it became critical for the REU student to gain skills in Python programming. Sample ML Python code was provided from the SenSIP center labs and a formal online course in the basics of Python [18] was completed as part of this training. The course included instruction on the syntax of Python with quizzes and short projects. In addition to this formal course [18], simulations were run using ML packages [19] in Python in order to provide hands-on experiences. Once pre-training was completed, specific tasks included: a) literature review of recent audio based efforts towards COVID-19 detection, b) building knowledge on feature extraction and c) understanding ML, and more specifically, neural network algorithms. The REU student worked with two PhD students and a faculty member of ASU and held weekly meetings with informal presentations of progress. Feedback on every step was provided and the student began documenting his progress in a working document. Quarterly formal presentations to faculty and industry members of the ASU SenSIP industry-university center were given by the student to sharpen his research presentation skills. The REU student also gave a formal presentation at an undergraduate research conference, namely the NCUR 2021 which this year was held as a virtual meeting [20]. Publisher Copyright: {\textcopyright} 2021 IEEE.; 55th Asilomar Conference on Signals, Systems and Computers, ACSSC 2021 ; Conference date: 31-10-2021 Through 03-11-2021",

year = "2021",

doi = "10.1109/IEEECONF53345.2021.9723323",

language = "English (US)",

series = "Conference Record - Asilomar Conference on Signals, Systems and Computers",

publisher = "IEEE Computer Society",

pages = "1146--1150",

editor = "Matthews, {Michael B.}",

booktitle = "55th Asilomar Conference on Signals, Systems and Computers, ACSSC 2021",

}

TY - GEN

T1 - COVID-19 Detection using Audio Spectral Features and Machine Learning

AU - Esposito, Michael

AU - Rao, Sunil

AU - Narayanaswamy, Vivek

AU - Spanias, Andreas

N1 - Funding Information: We began this REU study in the spring of 2020 by first providing “bootcamp” training in research protocols, digital signal processing (DSP) and ML basics. The REU engagement was virtual due to pandemic restrictions. Online lectures, as well as hands-on programming activities, were provided to cover introductory content in DSP, speech processing and machine learning. The lecture topics included basics on Fast Fourier transforms, analog to digital conversion and sampling theory, basic properties of speech and audio signals, and fundamentals of machine learning. Initial hands-on DSP activities were supported by the lab book content [11] and simulations on the Java-DSP (J-DSP) [12] object-oriented environment. Specific preparation for audio included understanding and identifying harmonics, formants, time and frequency domain representations of audio [13], spectrograms, and Mel-frequency cepstral coefficients (MFCCs) [14]. The initial interactive training activities for machine leaning used specific ML J-DSP functions for k-means clustering and sound recognition from the spectrum using [15] followed by an introduction to MATLAB. MATLAB was used to implement the k-means algorithm [16] and also understand the basics of neural network classification. While simple ML concepts and algorithms were introduced in J-DSP and MATLAB [17], it became critical for the REU student to gain skills in Python programming. Sample ML Python code was provided from the SenSIP center labs and a formal online course in the basics of Python [18] was completed as part of this training. The course included instruction on the syntax of Python with quizzes and short projects. In addition to this formal course [18], simulations were run using ML packages [19] in Python in order to provide hands-on experiences. Once pre-training was completed, specific tasks included: a) literature review of recent audio based efforts towards COVID-19 detection, b) building knowledge on feature extraction and c) understanding ML, and more specifically, neural network algorithms. The REU student worked with two PhD students and a faculty member of ASU and held weekly meetings with informal presentations of progress. Feedback on every step was provided and the student began documenting his progress in a working document. Quarterly formal presentations to faculty and industry members of the ASU SenSIP industry-university center were given by the student to sharpen his research presentation skills. The REU student also gave a formal presentation at an undergraduate research conference, namely the NCUR 2021 which this year was held as a virtual meeting [20]. Publisher Copyright: © 2021 IEEE.

PY - 2021

Y1 - 2021

N2 - In this research and education REU project, we use audio waveform signatures of coughing to determine whether COVID-19 can be diagnosed. More specifically, we determine coughing audio spectral features and use neural network architectures to develop diagnostics for COVID-19. The non-invasive rapid and remote testing benefits of this approach relative to existing nose swab, saliva, and blood testing make this method attractive for deployment on smart phones. Challenges include distorted or low-quality audio samples, availability of reliable labeled data, confusability with other respiratory diseases, and lack of baseline (healthy) audio recordings for comparison. We have studied, compared, tuned and implemented in Python an array of convolutional neural network architectures. Results using a unique parallel machine learning architecture with a fusion unit are presented.

AB - In this research and education REU project, we use audio waveform signatures of coughing to determine whether COVID-19 can be diagnosed. More specifically, we determine coughing audio spectral features and use neural network architectures to develop diagnostics for COVID-19. The non-invasive rapid and remote testing benefits of this approach relative to existing nose swab, saliva, and blood testing make this method attractive for deployment on smart phones. Challenges include distorted or low-quality audio samples, availability of reliable labeled data, confusability with other respiratory diseases, and lack of baseline (healthy) audio recordings for comparison. We have studied, compared, tuned and implemented in Python an array of convolutional neural network architectures. Results using a unique parallel machine learning architecture with a fusion unit are presented.

KW - COVID-19

KW - cough audio

KW - machine learning

KW - neural networks

KW - spectral features

KW - tachypnea

UR - http://www.scopus.com/inward/record.url?scp=85117492843&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85117492843&partnerID=8YFLogxK

U2 - 10.1109/IEEECONF53345.2021.9723323

DO - 10.1109/IEEECONF53345.2021.9723323

M3 - Conference contribution

AN - SCOPUS:85117492843

T3 - Conference Record - Asilomar Conference on Signals, Systems and Computers

SP - 1146

EP - 1150

BT - 55th Asilomar Conference on Signals, Systems and Computers, ACSSC 2021

A2 - Matthews, Michael B.

PB - IEEE Computer Society

T2 - 55th Asilomar Conference on Signals, Systems and Computers, ACSSC 2021

Y2 - 31 October 2021 through 3 November 2021

ER -

COVID-19 Detection using Audio Spectral Features and Machine Learning

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this