Analysis/synthesis of speech using the short-time Fourier transform and a time-varying ARMA process

Andreas Spanias; Philipos Loizou; Gim Lim; Ye Chen; Gen Hu

Analysis/synthesis of speech using the short-time Fourier transform and a time-varying ARMA process

Andreas Spanias, Philipos Loizou, Gim Lim, Ye Chen, Gen Hu

Electrical Engineering

Research output: Contribution to journal › Article › peer-review

2 Scopus citations

Abstract

A speech analysis/synthesis system that relies on a time-varying Auto Regressive Moving Average (ARMA) process and the Short-Time Fourier Transform (STFT) is proposed. The narrowband components in speech are represented in the frequency domain by a set of harmonic components, while the broadband random components are represented by a time-varying ARMA process. The time-varying ARMA model has a dual function, namely, it creates a spectral envelope that fits accurately the harmonic STFT components, and provides for the spectral representation of the broadband components of speech. The proposed model essentially combines the features of waveform coders by employing the STFT and the features of traditional vocoders by incorporating an appropriately shaped noise sequence.

Original language	English (US)
Pages (from-to)	645-652
Number of pages	8
Journal	IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
Volume	E76-A
Issue number	4
State	Published - Apr 1993

ASJC Scopus subject areas

Signal Processing
Computer Graphics and Computer-Aided Design
Electrical and Electronic Engineering
Applied Mathematics

Cite this

@article{26a74f3938c34fef805222f219c0f6cd,

title = "Analysis/synthesis of speech using the short-time Fourier transform and a time-varying ARMA process",

abstract = "A speech analysis/synthesis system that relies on a time-varying Auto Regressive Moving Average (ARMA) process and the Short-Time Fourier Transform (STFT) is proposed. The narrowband components in speech are represented in the frequency domain by a set of harmonic components, while the broadband random components are represented by a time-varying ARMA process. The time-varying ARMA model has a dual function, namely, it creates a spectral envelope that fits accurately the harmonic STFT components, and provides for the spectral representation of the broadband components of speech. The proposed model essentially combines the features of waveform coders by employing the STFT and the features of traditional vocoders by incorporating an appropriately shaped noise sequence.",

author = "Andreas Spanias and Philipos Loizou and Gim Lim and Ye Chen and Gen Hu",

year = "1993",

month = apr,

language = "English (US)",

volume = "E76-A",

pages = "645--652",

journal = "IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences",

issn = "0916-8508",

publisher = "Maruzen Co., Ltd/Maruzen Kabushikikaisha",

number = "4",

}

TY - JOUR

T1 - Analysis/synthesis of speech using the short-time Fourier transform and a time-varying ARMA process

AU - Spanias, Andreas

AU - Loizou, Philipos

AU - Lim, Gim

AU - Chen, Ye

AU - Hu, Gen

PY - 1993/4

Y1 - 1993/4

N2 - A speech analysis/synthesis system that relies on a time-varying Auto Regressive Moving Average (ARMA) process and the Short-Time Fourier Transform (STFT) is proposed. The narrowband components in speech are represented in the frequency domain by a set of harmonic components, while the broadband random components are represented by a time-varying ARMA process. The time-varying ARMA model has a dual function, namely, it creates a spectral envelope that fits accurately the harmonic STFT components, and provides for the spectral representation of the broadband components of speech. The proposed model essentially combines the features of waveform coders by employing the STFT and the features of traditional vocoders by incorporating an appropriately shaped noise sequence.

AB - A speech analysis/synthesis system that relies on a time-varying Auto Regressive Moving Average (ARMA) process and the Short-Time Fourier Transform (STFT) is proposed. The narrowband components in speech are represented in the frequency domain by a set of harmonic components, while the broadband random components are represented by a time-varying ARMA process. The time-varying ARMA model has a dual function, namely, it creates a spectral envelope that fits accurately the harmonic STFT components, and provides for the spectral representation of the broadband components of speech. The proposed model essentially combines the features of waveform coders by employing the STFT and the features of traditional vocoders by incorporating an appropriately shaped noise sequence.

UR - http://www.scopus.com/inward/record.url?scp=0027585471&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0027585471&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:0027585471

SN - 0916-8508

VL - E76-A

SP - 645

EP - 652

JO - IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences

JF - IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences

IS - 4

ER -

Analysis/synthesis of speech using the short-time Fourier transform and a time-varying ARMA process

Abstract

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this