Speech enhancement using a state-based transform model

Michael E. Deisher; Andreas Spanias

doi:10.1109/ACSSC.1994.471657

Speech enhancement using a state-based transform model

Michael E. Deisher, Andreas Spanias

Electrical Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

An analysis/synthesis technique based on harmonic sinusoidal modeling of speech is used to develop a new hidden Markov model (HMM) based speech enhancement algorithm. State sequence estimation is done using a standard HMM-based approach. State-based enhancement is carried out by assuming a harmonic model for speech, i.e., by representing each block of speech as a sum of sine waves in terms of a set of amplitudes, phases, and harmonically related frequencies. Given the maximum a-posteriori probability (MAP) state sequence, the amplitudes, phases, voicing, and fundamental frequency are estimated. Simulation results are presented, comparing the performance of the proposed algorithm to that of a standard HMM-based approach. The proposed method was found to reduce the structured residual noise normally associated with HMM-based algorithms.

Original language	English (US)
Title of host publication	Conference Record - 28th Asilomar Conference on Signals, Systems and Computers, ACSSC 1994
Publisher	IEEE Computer Society
Pages	1242-1246
Number of pages	5
ISBN (Electronic)	0818664053
DOIs	https://doi.org/10.1109/ACSSC.1994.471657
State	Published - 1994
Event	28th Asilomar Conference on Signals, Systems and Computers, ACSSC 1994 - Pacific Grove, United States Duration: Oct 31 1994 → Nov 2 1994

Publication series

Name	Conference Record - Asilomar Conference on Signals, Systems and Computers
Volume	2
ISSN (Print)	1058-6393

Conference

Conference	28th Asilomar Conference on Signals, Systems and Computers, ACSSC 1994
Country/Territory	United States
City	Pacific Grove
Period	10/31/94 → 11/2/94

ASJC Scopus subject areas

Signal Processing
Computer Networks and Communications

Access to Document

10.1109/ACSSC.1994.471657

Cite this

Speech enhancement using a state-based transform model. / Deisher, Michael E.; Spanias, Andreas.
Conference Record - 28th Asilomar Conference on Signals, Systems and Computers, ACSSC 1994. IEEE Computer Society, 1994. p. 1242-1246 471657 (Conference Record - Asilomar Conference on Signals, Systems and Computers; Vol. 2).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Deisher, ME & Spanias, A 1994, Speech enhancement using a state-based transform model. in Conference Record - 28th Asilomar Conference on Signals, Systems and Computers, ACSSC 1994., 471657, Conference Record - Asilomar Conference on Signals, Systems and Computers, vol. 2, IEEE Computer Society, pp. 1242-1246, 28th Asilomar Conference on Signals, Systems and Computers, ACSSC 1994, Pacific Grove, United States, 10/31/94. https://doi.org/10.1109/ACSSC.1994.471657

@inproceedings{d43a01aef6494402810e77131a538c1b,

title = "Speech enhancement using a state-based transform model",

abstract = "An analysis/synthesis technique based on harmonic sinusoidal modeling of speech is used to develop a new hidden Markov model (HMM) based speech enhancement algorithm. State sequence estimation is done using a standard HMM-based approach. State-based enhancement is carried out by assuming a harmonic model for speech, i.e., by representing each block of speech as a sum of sine waves in terms of a set of amplitudes, phases, and harmonically related frequencies. Given the maximum a-posteriori probability (MAP) state sequence, the amplitudes, phases, voicing, and fundamental frequency are estimated. Simulation results are presented, comparing the performance of the proposed algorithm to that of a standard HMM-based approach. The proposed method was found to reduce the structured residual noise normally associated with HMM-based algorithms.",

author = "Deisher, {Michael E.} and Andreas Spanias",

note = "Publisher Copyright: {\textcopyright} 1995 IEEE.; 28th Asilomar Conference on Signals, Systems and Computers, ACSSC 1994 ; Conference date: 31-10-1994 Through 02-11-1994",

year = "1994",

doi = "10.1109/ACSSC.1994.471657",

language = "English (US)",

series = "Conference Record - Asilomar Conference on Signals, Systems and Computers",

publisher = "IEEE Computer Society",

pages = "1242--1246",

booktitle = "Conference Record - 28th Asilomar Conference on Signals, Systems and Computers, ACSSC 1994",

}

TY - GEN

T1 - Speech enhancement using a state-based transform model

AU - Deisher, Michael E.

AU - Spanias, Andreas

PY - 1994

Y1 - 1994

N2 - An analysis/synthesis technique based on harmonic sinusoidal modeling of speech is used to develop a new hidden Markov model (HMM) based speech enhancement algorithm. State sequence estimation is done using a standard HMM-based approach. State-based enhancement is carried out by assuming a harmonic model for speech, i.e., by representing each block of speech as a sum of sine waves in terms of a set of amplitudes, phases, and harmonically related frequencies. Given the maximum a-posteriori probability (MAP) state sequence, the amplitudes, phases, voicing, and fundamental frequency are estimated. Simulation results are presented, comparing the performance of the proposed algorithm to that of a standard HMM-based approach. The proposed method was found to reduce the structured residual noise normally associated with HMM-based algorithms.

AB - An analysis/synthesis technique based on harmonic sinusoidal modeling of speech is used to develop a new hidden Markov model (HMM) based speech enhancement algorithm. State sequence estimation is done using a standard HMM-based approach. State-based enhancement is carried out by assuming a harmonic model for speech, i.e., by representing each block of speech as a sum of sine waves in terms of a set of amplitudes, phases, and harmonically related frequencies. Given the maximum a-posteriori probability (MAP) state sequence, the amplitudes, phases, voicing, and fundamental frequency are estimated. Simulation results are presented, comparing the performance of the proposed algorithm to that of a standard HMM-based approach. The proposed method was found to reduce the structured residual noise normally associated with HMM-based algorithms.

UR - http://www.scopus.com/inward/record.url?scp=85063502939&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85063502939&partnerID=8YFLogxK

U2 - 10.1109/ACSSC.1994.471657

DO - 10.1109/ACSSC.1994.471657

M3 - Conference contribution

AN - SCOPUS:85063502939

T3 - Conference Record - Asilomar Conference on Signals, Systems and Computers

SP - 1242

EP - 1246

BT - Conference Record - 28th Asilomar Conference on Signals, Systems and Computers, ACSSC 1994

PB - IEEE Computer Society

T2 - 28th Asilomar Conference on Signals, Systems and Computers, ACSSC 1994

Y2 - 31 October 1994 through 2 November 1994

ER -

Speech enhancement using a state-based transform model

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this