Low rate speech representation by vector quantizing transform components

Philipos C. Loizou; Andreas Spanias

Low rate speech representation by vector quantizing transform components

Electrical Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

A transform-based system is described for vector quantizing speech at 4800 BPS. Multiple codebooks were used to quantize groups of harmonic components of speech, corresponding to different frequency bands. To ensure that the distortion caused by the vector quantizer at low frequencies was kept at a minimum, low-dimensionality codebooks were used. Furthermore, to reduce the errors of the FFT resolution, transform components, within the baseband and adjacent to the pitch-harmonic components, were also encoded and quantized by low-dimensionality codebooks. This was found to improve the quality of speech. Speech produced at 4800 BPS using this method was found to be of good quality, particularly for female speakers. This is because the number of harmonic components required for female (high-pitch) speakers is generally smaller than that required for male (low-pitch) speakers.

Original language	English (US)
Title of host publication	Proceedings - IEEE International Symposium on Circuits and Systems
Publisher	Publ by IEEE
Pages	320-323
Number of pages	4
Volume	1
State	Published - 1991
Event	1991 IEEE International Symposium on Circuits and Systems Part 1 (of 5) - Singapore, Singapore Duration: Jun 11 1991 → Jun 14 1991

Other

Other	1991 IEEE International Symposium on Circuits and Systems Part 1 (of 5)
City	Singapore, Singapore
Period	6/11/91 → 6/14/91

ASJC Scopus subject areas

Electrical and Electronic Engineering
Electronic, Optical and Magnetic Materials

Cite this

@inproceedings{1c63437112be4240851dfa106e76113f,

title = "Low rate speech representation by vector quantizing transform components",

abstract = "A transform-based system is described for vector quantizing speech at 4800 BPS. Multiple codebooks were used to quantize groups of harmonic components of speech, corresponding to different frequency bands. To ensure that the distortion caused by the vector quantizer at low frequencies was kept at a minimum, low-dimensionality codebooks were used. Furthermore, to reduce the errors of the FFT resolution, transform components, within the baseband and adjacent to the pitch-harmonic components, were also encoded and quantized by low-dimensionality codebooks. This was found to improve the quality of speech. Speech produced at 4800 BPS using this method was found to be of good quality, particularly for female speakers. This is because the number of harmonic components required for female (high-pitch) speakers is generally smaller than that required for male (low-pitch) speakers.",

author = "Loizou, {Philipos C.} and Andreas Spanias",

year = "1991",

language = "English (US)",

volume = "1",

pages = "320--323",

booktitle = "Proceedings - IEEE International Symposium on Circuits and Systems",

publisher = "Publ by IEEE",

note = "1991 IEEE International Symposium on Circuits and Systems Part 1 (of 5) ; Conference date: 11-06-1991 Through 14-06-1991",

}

TY - GEN

T1 - Low rate speech representation by vector quantizing transform components

AU - Loizou, Philipos C.

AU - Spanias, Andreas

PY - 1991

Y1 - 1991

N2 - A transform-based system is described for vector quantizing speech at 4800 BPS. Multiple codebooks were used to quantize groups of harmonic components of speech, corresponding to different frequency bands. To ensure that the distortion caused by the vector quantizer at low frequencies was kept at a minimum, low-dimensionality codebooks were used. Furthermore, to reduce the errors of the FFT resolution, transform components, within the baseband and adjacent to the pitch-harmonic components, were also encoded and quantized by low-dimensionality codebooks. This was found to improve the quality of speech. Speech produced at 4800 BPS using this method was found to be of good quality, particularly for female speakers. This is because the number of harmonic components required for female (high-pitch) speakers is generally smaller than that required for male (low-pitch) speakers.

AB - A transform-based system is described for vector quantizing speech at 4800 BPS. Multiple codebooks were used to quantize groups of harmonic components of speech, corresponding to different frequency bands. To ensure that the distortion caused by the vector quantizer at low frequencies was kept at a minimum, low-dimensionality codebooks were used. Furthermore, to reduce the errors of the FFT resolution, transform components, within the baseband and adjacent to the pitch-harmonic components, were also encoded and quantized by low-dimensionality codebooks. This was found to improve the quality of speech. Speech produced at 4800 BPS using this method was found to be of good quality, particularly for female speakers. This is because the number of harmonic components required for female (high-pitch) speakers is generally smaller than that required for male (low-pitch) speakers.

UR - http://www.scopus.com/inward/record.url?scp=0026371719&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0026371719&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:0026371719

VL - 1

SP - 320

EP - 323

BT - Proceedings - IEEE International Symposium on Circuits and Systems

PB - Publ by IEEE

T2 - 1991 IEEE International Symposium on Circuits and Systems Part 1 (of 5)

Y2 - 11 June 1991 through 14 June 1991

ER -

Low rate speech representation by vector quantizing transform components

Abstract

Other

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this