A new phase model for sinusoidal transform coding of speech

Sassan Ahmadi, Andreas Spanias

Research output: Contribution to journalArticle

12 Citations (Scopus)

Abstract

A phase modeling algorithm for sinusoidal analysis-synthesis of speech is presented, where short-time sinusoidal phases are approximated using a combination of linear prediction, spectral sampling, delay compensation, and phase correction techniques. The algorithm is different than phase compensation methods proposed for source-system LPC in that it has been tailored to sinusoidal representation of speech. Performance analysis on a large speech data base reveals improvement in temporal and spectral signal matching, as well as in the subjective quality of reconstructed speech. The method can be applied to enhance phase matching in low bit rate sinusoidal coders, where underlying sine wave amplitudes are extracted from an all-pole model. Preliminary subjective results are presented for a 2.4 kb/s sinusoidal coder.

Original languageEnglish (US)
Pages (from-to)495-501
Number of pages7
JournalIEEE Transactions on Speech and Audio Processing
Volume6
Issue number5
DOIs
StatePublished - 1998

Fingerprint

coding
coders
linear prediction
Phase matching
sine waves
data bases
phase matching
Poles
poles
sampling
Sampling
synthesis
Compensation and Redress

Keywords

  • Linear prediction
  • Linear predictive coding
  • Phase modeling
  • Sinusoidal representation
  • Speech coding

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Acoustics and Ultrasonics

Cite this

A new phase model for sinusoidal transform coding of speech. / Ahmadi, Sassan; Spanias, Andreas.

In: IEEE Transactions on Speech and Audio Processing, Vol. 6, No. 5, 1998, p. 495-501.

Research output: Contribution to journalArticle

@article{8b7857602d8f4e9892e42f09dfb84928,
title = "A new phase model for sinusoidal transform coding of speech",
abstract = "A phase modeling algorithm for sinusoidal analysis-synthesis of speech is presented, where short-time sinusoidal phases are approximated using a combination of linear prediction, spectral sampling, delay compensation, and phase correction techniques. The algorithm is different than phase compensation methods proposed for source-system LPC in that it has been tailored to sinusoidal representation of speech. Performance analysis on a large speech data base reveals improvement in temporal and spectral signal matching, as well as in the subjective quality of reconstructed speech. The method can be applied to enhance phase matching in low bit rate sinusoidal coders, where underlying sine wave amplitudes are extracted from an all-pole model. Preliminary subjective results are presented for a 2.4 kb/s sinusoidal coder.",
keywords = "Linear prediction, Linear predictive coding, Phase modeling, Sinusoidal representation, Speech coding",
author = "Sassan Ahmadi and Andreas Spanias",
year = "1998",
doi = "10.1109/89.709675",
language = "English (US)",
volume = "6",
pages = "495--501",
journal = "IEEE Transactions on Speech and Audio Processing",
issn = "1558-7916",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "5",

}

TY - JOUR

T1 - A new phase model for sinusoidal transform coding of speech

AU - Ahmadi, Sassan

AU - Spanias, Andreas

PY - 1998

Y1 - 1998

N2 - A phase modeling algorithm for sinusoidal analysis-synthesis of speech is presented, where short-time sinusoidal phases are approximated using a combination of linear prediction, spectral sampling, delay compensation, and phase correction techniques. The algorithm is different than phase compensation methods proposed for source-system LPC in that it has been tailored to sinusoidal representation of speech. Performance analysis on a large speech data base reveals improvement in temporal and spectral signal matching, as well as in the subjective quality of reconstructed speech. The method can be applied to enhance phase matching in low bit rate sinusoidal coders, where underlying sine wave amplitudes are extracted from an all-pole model. Preliminary subjective results are presented for a 2.4 kb/s sinusoidal coder.

AB - A phase modeling algorithm for sinusoidal analysis-synthesis of speech is presented, where short-time sinusoidal phases are approximated using a combination of linear prediction, spectral sampling, delay compensation, and phase correction techniques. The algorithm is different than phase compensation methods proposed for source-system LPC in that it has been tailored to sinusoidal representation of speech. Performance analysis on a large speech data base reveals improvement in temporal and spectral signal matching, as well as in the subjective quality of reconstructed speech. The method can be applied to enhance phase matching in low bit rate sinusoidal coders, where underlying sine wave amplitudes are extracted from an all-pole model. Preliminary subjective results are presented for a 2.4 kb/s sinusoidal coder.

KW - Linear prediction

KW - Linear predictive coding

KW - Phase modeling

KW - Sinusoidal representation

KW - Speech coding

UR - http://www.scopus.com/inward/record.url?scp=0032165792&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0032165792&partnerID=8YFLogxK

U2 - 10.1109/89.709675

DO - 10.1109/89.709675

M3 - Article

AN - SCOPUS:0032165792

VL - 6

SP - 495

EP - 501

JO - IEEE Transactions on Speech and Audio Processing

JF - IEEE Transactions on Speech and Audio Processing

SN - 1558-7916

IS - 5

ER -