Low rate speech representation by vector quantizing transform components

Philipos C. Loizou, Andreas Spanias

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

A transform-based system is described for vector quantizing speech at 4800 BPS. Multiple codebooks were used to quantize groups of harmonic components of speech, corresponding to different frequency bands. To ensure that the distortion caused by the vector quantizer at low frequencies was kept at a minimum, low-dimensionality codebooks were used. Furthermore, to reduce the errors of the FFT resolution, transform components, within the baseband and adjacent to the pitch-harmonic components, were also encoded and quantized by low-dimensionality codebooks. This was found to improve the quality of speech. Speech produced at 4800 BPS using this method was found to be of good quality, particularly for female speakers. This is because the number of harmonic components required for female (high-pitch) speakers is generally smaller than that required for male (low-pitch) speakers.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE International Symposium on Circuits and Systems
PublisherPubl by IEEE
Pages320-323
Number of pages4
Volume1
StatePublished - 1991
Event1991 IEEE International Symposium on Circuits and Systems Part 1 (of 5) - Singapore, Singapore
Duration: Jun 11 1991Jun 14 1991

Other

Other1991 IEEE International Symposium on Circuits and Systems Part 1 (of 5)
CitySingapore, Singapore
Period6/11/916/14/91

Fingerprint

Fast Fourier transforms
Frequency bands
Mathematical transformations

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Electronic, Optical and Magnetic Materials

Cite this

Loizou, P. C., & Spanias, A. (1991). Low rate speech representation by vector quantizing transform components. In Proceedings - IEEE International Symposium on Circuits and Systems (Vol. 1, pp. 320-323). Publ by IEEE.

Low rate speech representation by vector quantizing transform components. / Loizou, Philipos C.; Spanias, Andreas.

Proceedings - IEEE International Symposium on Circuits and Systems. Vol. 1 Publ by IEEE, 1991. p. 320-323.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Loizou, PC & Spanias, A 1991, Low rate speech representation by vector quantizing transform components. in Proceedings - IEEE International Symposium on Circuits and Systems. vol. 1, Publ by IEEE, pp. 320-323, 1991 IEEE International Symposium on Circuits and Systems Part 1 (of 5), Singapore, Singapore, 6/11/91.
Loizou PC, Spanias A. Low rate speech representation by vector quantizing transform components. In Proceedings - IEEE International Symposium on Circuits and Systems. Vol. 1. Publ by IEEE. 1991. p. 320-323
Loizou, Philipos C. ; Spanias, Andreas. / Low rate speech representation by vector quantizing transform components. Proceedings - IEEE International Symposium on Circuits and Systems. Vol. 1 Publ by IEEE, 1991. pp. 320-323
@inproceedings{1c63437112be4240851dfa106e76113f,
title = "Low rate speech representation by vector quantizing transform components",
abstract = "A transform-based system is described for vector quantizing speech at 4800 BPS. Multiple codebooks were used to quantize groups of harmonic components of speech, corresponding to different frequency bands. To ensure that the distortion caused by the vector quantizer at low frequencies was kept at a minimum, low-dimensionality codebooks were used. Furthermore, to reduce the errors of the FFT resolution, transform components, within the baseband and adjacent to the pitch-harmonic components, were also encoded and quantized by low-dimensionality codebooks. This was found to improve the quality of speech. Speech produced at 4800 BPS using this method was found to be of good quality, particularly for female speakers. This is because the number of harmonic components required for female (high-pitch) speakers is generally smaller than that required for male (low-pitch) speakers.",
author = "Loizou, {Philipos C.} and Andreas Spanias",
year = "1991",
language = "English (US)",
volume = "1",
pages = "320--323",
booktitle = "Proceedings - IEEE International Symposium on Circuits and Systems",
publisher = "Publ by IEEE",

}

TY - GEN

T1 - Low rate speech representation by vector quantizing transform components

AU - Loizou, Philipos C.

AU - Spanias, Andreas

PY - 1991

Y1 - 1991

N2 - A transform-based system is described for vector quantizing speech at 4800 BPS. Multiple codebooks were used to quantize groups of harmonic components of speech, corresponding to different frequency bands. To ensure that the distortion caused by the vector quantizer at low frequencies was kept at a minimum, low-dimensionality codebooks were used. Furthermore, to reduce the errors of the FFT resolution, transform components, within the baseband and adjacent to the pitch-harmonic components, were also encoded and quantized by low-dimensionality codebooks. This was found to improve the quality of speech. Speech produced at 4800 BPS using this method was found to be of good quality, particularly for female speakers. This is because the number of harmonic components required for female (high-pitch) speakers is generally smaller than that required for male (low-pitch) speakers.

AB - A transform-based system is described for vector quantizing speech at 4800 BPS. Multiple codebooks were used to quantize groups of harmonic components of speech, corresponding to different frequency bands. To ensure that the distortion caused by the vector quantizer at low frequencies was kept at a minimum, low-dimensionality codebooks were used. Furthermore, to reduce the errors of the FFT resolution, transform components, within the baseband and adjacent to the pitch-harmonic components, were also encoded and quantized by low-dimensionality codebooks. This was found to improve the quality of speech. Speech produced at 4800 BPS using this method was found to be of good quality, particularly for female speakers. This is because the number of harmonic components required for female (high-pitch) speakers is generally smaller than that required for male (low-pitch) speakers.

UR - http://www.scopus.com/inward/record.url?scp=0026371719&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0026371719&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:0026371719

VL - 1

SP - 320

EP - 323

BT - Proceedings - IEEE International Symposium on Circuits and Systems

PB - Publ by IEEE

ER -