An auditory-domain based speech enhancement algorithm

Harish Krishnamoorthi; Andreas Spanias; Visar Berisha; Homin Kwon; Harvey Thornburg

doi:10.1109/ICASSP.2010.5495147

An auditory-domain based speech enhancement algorithm

Harish Krishnamoorthi, Andreas Spanias, Visar Berisha, Homin Kwon, Harvey Thornburg

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

1 Scopus citations

Abstract

Typically, speech enhancement algorithms minimize a suitable error criterion in the spectral or time domain. Although the error criterions have included perceptual properties such as masking thresholds, non-uniform frequency resolution and sensitivity of the auditory system, these are only done heuristically and the error criterion does not explicitly include an auditory model in their formulation. In this paper, we propose an auditory-domain based speech enhancement algorithm that minimizes the distortion between the auditory representation of the estimated and desired signal. Simulation results indicate that the proposed algorithm performs effectively under different noise conditions and also results in a lower average loudness error.

Original language	English (US)
Title of host publication	2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 - Proceedings
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	4786-4789
Number of pages	4
ISBN (Print)	9781424442966
DOIs	https://doi.org/10.1109/ICASSP.2010.5495147
State	Published - 2010
Event	2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 - Dallas, TX, United States Duration: Mar 14 2010 → Mar 19 2010

Publication series

Name	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)	1520-6149

Other

Other	2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010
Country/Territory	United States
City	Dallas, TX
Period	3/14/10 → 3/19/10

Keywords

Auditory representation
Loudness
Psychoacoustics
Speech enhancement

ASJC Scopus subject areas

Software
Signal Processing
Electrical and Electronic Engineering

Access to Document

10.1109/ICASSP.2010.5495147

Cite this

Krishnamoorthi, H., Spanias, A., Berisha, V., Kwon, H., & Thornburg, H. (2010). An auditory-domain based speech enhancement algorithm. In 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 - Proceedings (pp. 4786-4789). Article 5495147 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICASSP.2010.5495147

An auditory-domain based speech enhancement algorithm. / Krishnamoorthi, Harish; Spanias, Andreas ; Berisha, Visar et al.
2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2010. p. 4786-4789 5495147 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Krishnamoorthi, H, Spanias, A , Berisha, V, Kwon, H & Thornburg, H 2010, An auditory-domain based speech enhancement algorithm. in 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 - Proceedings., 5495147, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Institute of Electrical and Electronics Engineers Inc., pp. 4786-4789, 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010, Dallas, TX, United States, 3/14/10. https://doi.org/10.1109/ICASSP.2010.5495147

Krishnamoorthi H, Spanias A , Berisha V, Kwon H, Thornburg H. An auditory-domain based speech enhancement algorithm. In 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2010. p. 4786-4789. 5495147. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). doi: 10.1109/ICASSP.2010.5495147

Krishnamoorthi, Harish ; Spanias, Andreas ; Berisha, Visar et al. / An auditory-domain based speech enhancement algorithm. 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2010. pp. 4786-4789 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

@inproceedings{471aaad83b064f409f4f55865908926d,

title = "An auditory-domain based speech enhancement algorithm",

abstract = "Typically, speech enhancement algorithms minimize a suitable error criterion in the spectral or time domain. Although the error criterions have included perceptual properties such as masking thresholds, non-uniform frequency resolution and sensitivity of the auditory system, these are only done heuristically and the error criterion does not explicitly include an auditory model in their formulation. In this paper, we propose an auditory-domain based speech enhancement algorithm that minimizes the distortion between the auditory representation of the estimated and desired signal. Simulation results indicate that the proposed algorithm performs effectively under different noise conditions and also results in a lower average loudness error.",

keywords = "Auditory representation, Loudness, Psychoacoustics, Speech enhancement",

author = "Harish Krishnamoorthi and Andreas Spanias and Visar Berisha and Homin Kwon and Harvey Thornburg",

year = "2010",

doi = "10.1109/ICASSP.2010.5495147",

language = "English (US)",

isbn = "9781424442966",

series = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "4786--4789",

booktitle = "2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 - Proceedings",

note = "2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 ; Conference date: 14-03-2010 Through 19-03-2010",

}

TY - GEN

T1 - An auditory-domain based speech enhancement algorithm

AU - Krishnamoorthi, Harish

AU - Spanias, Andreas

AU - Berisha, Visar

AU - Kwon, Homin

AU - Thornburg, Harvey

PY - 2010

Y1 - 2010

N2 - Typically, speech enhancement algorithms minimize a suitable error criterion in the spectral or time domain. Although the error criterions have included perceptual properties such as masking thresholds, non-uniform frequency resolution and sensitivity of the auditory system, these are only done heuristically and the error criterion does not explicitly include an auditory model in their formulation. In this paper, we propose an auditory-domain based speech enhancement algorithm that minimizes the distortion between the auditory representation of the estimated and desired signal. Simulation results indicate that the proposed algorithm performs effectively under different noise conditions and also results in a lower average loudness error.

AB - Typically, speech enhancement algorithms minimize a suitable error criterion in the spectral or time domain. Although the error criterions have included perceptual properties such as masking thresholds, non-uniform frequency resolution and sensitivity of the auditory system, these are only done heuristically and the error criterion does not explicitly include an auditory model in their formulation. In this paper, we propose an auditory-domain based speech enhancement algorithm that minimizes the distortion between the auditory representation of the estimated and desired signal. Simulation results indicate that the proposed algorithm performs effectively under different noise conditions and also results in a lower average loudness error.

KW - Auditory representation

KW - Loudness

KW - Psychoacoustics

KW - Speech enhancement

UR - http://www.scopus.com/inward/record.url?scp=78049357435&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78049357435&partnerID=8YFLogxK

U2 - 10.1109/ICASSP.2010.5495147

DO - 10.1109/ICASSP.2010.5495147

M3 - Conference contribution

AN - SCOPUS:78049357435

SN - 9781424442966

T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

SP - 4786

EP - 4789

BT - 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010

Y2 - 14 March 2010 through 19 March 2010

ER -

An auditory-domain based speech enhancement algorithm

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this