Analysis/synthesis of speech using the short-time Fourier transform and a time-varying ARMA process

Andreas Spanias, Philipos Loizou, Gim Lim, Ye Chen, Gen Hu

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

A speech analysis/synthesis system that relies on a time-varying Auto Regressive Moving Average (ARMA) process and the Short-Time Fourier Transform (STFT) is proposed. The narrowband components in speech are represented in the frequency domain by a set of harmonic components, while the broadband random components are represented by a time-varying ARMA process. The time-varying ARMA model has a dual function, namely, it creates a spectral envelope that fits accurately the harmonic STFT components, and provides for the spectral representation of the broadband components of speech. The proposed model essentially combines the features of waveform coders by employing the STFT and the features of traditional vocoders by incorporating an appropriately shaped noise sequence.

Original languageEnglish (US)
Pages (from-to)645-652
Number of pages8
JournalIEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
VolumeE76-A
Issue number4
StatePublished - Apr 1993

Fingerprint

Short-time Fourier Transform
Moving Average Process
Autoregressive Moving Average
Autoregressive Process
Time-varying
Fourier transforms
Synthesis
Vocoders
Speech analysis
Broadband
Harmonic
Speech Analysis
Autoregressive Moving Average Model
Spectral Representation
Waveform
Envelope
Frequency Domain
Speech

ASJC Scopus subject areas

  • Hardware and Architecture
  • Information Systems
  • Electrical and Electronic Engineering

Cite this

Analysis/synthesis of speech using the short-time Fourier transform and a time-varying ARMA process. / Spanias, Andreas; Loizou, Philipos; Lim, Gim; Chen, Ye; Hu, Gen.

In: IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, Vol. E76-A, No. 4, 04.1993, p. 645-652.

Research output: Contribution to journalArticle

@article{26a74f3938c34fef805222f219c0f6cd,
title = "Analysis/synthesis of speech using the short-time Fourier transform and a time-varying ARMA process",
abstract = "A speech analysis/synthesis system that relies on a time-varying Auto Regressive Moving Average (ARMA) process and the Short-Time Fourier Transform (STFT) is proposed. The narrowband components in speech are represented in the frequency domain by a set of harmonic components, while the broadband random components are represented by a time-varying ARMA process. The time-varying ARMA model has a dual function, namely, it creates a spectral envelope that fits accurately the harmonic STFT components, and provides for the spectral representation of the broadband components of speech. The proposed model essentially combines the features of waveform coders by employing the STFT and the features of traditional vocoders by incorporating an appropriately shaped noise sequence.",
author = "Andreas Spanias and Philipos Loizou and Gim Lim and Ye Chen and Gen Hu",
year = "1993",
month = "4",
language = "English (US)",
volume = "E76-A",
pages = "645--652",
journal = "IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences",
issn = "0916-8508",
publisher = "Maruzen Co., Ltd/Maruzen Kabushikikaisha",
number = "4",

}

TY - JOUR

T1 - Analysis/synthesis of speech using the short-time Fourier transform and a time-varying ARMA process

AU - Spanias, Andreas

AU - Loizou, Philipos

AU - Lim, Gim

AU - Chen, Ye

AU - Hu, Gen

PY - 1993/4

Y1 - 1993/4

N2 - A speech analysis/synthesis system that relies on a time-varying Auto Regressive Moving Average (ARMA) process and the Short-Time Fourier Transform (STFT) is proposed. The narrowband components in speech are represented in the frequency domain by a set of harmonic components, while the broadband random components are represented by a time-varying ARMA process. The time-varying ARMA model has a dual function, namely, it creates a spectral envelope that fits accurately the harmonic STFT components, and provides for the spectral representation of the broadband components of speech. The proposed model essentially combines the features of waveform coders by employing the STFT and the features of traditional vocoders by incorporating an appropriately shaped noise sequence.

AB - A speech analysis/synthesis system that relies on a time-varying Auto Regressive Moving Average (ARMA) process and the Short-Time Fourier Transform (STFT) is proposed. The narrowband components in speech are represented in the frequency domain by a set of harmonic components, while the broadband random components are represented by a time-varying ARMA process. The time-varying ARMA model has a dual function, namely, it creates a spectral envelope that fits accurately the harmonic STFT components, and provides for the spectral representation of the broadband components of speech. The proposed model essentially combines the features of waveform coders by employing the STFT and the features of traditional vocoders by incorporating an appropriately shaped noise sequence.

UR - http://www.scopus.com/inward/record.url?scp=0027585471&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0027585471&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:0027585471

VL - E76-A

SP - 645

EP - 652

JO - IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences

JF - IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences

SN - 0916-8508

IS - 4

ER -