Speech Coding: A Tutorial Review

Andreas Spanias

doi:10.1109/5.326413

Speech Coding: A Tutorial Review

Andreas Spanias

Electrical Engineering

Research output: Contribution to journal › Article › peer-review

283 Scopus citations

Abstract

The past decade has witnessed substantial progress towards the application of low-rate speech coders to civilian and military communications as well as computer-related voice applications. Central to this progress has been the development of new speech coders capable of producing high-quality speech at low data rates. Most of these coders incorporate mechanisms to: Represent the spectral properties of speech, provide for speech waveform matching, and “optimize” the coder's performance for the human ear. A number of these coders have already been adopted in national and international cellular telephony standards. The objective of this paper is to provide a tutorial overview of speech coding methodologies with emphasis on those algorithms that are part of the recent low-rate standards for cellular communications. Although the emphasis is on the new low-rate coders, we attempt to provide a comprehensive survey by covering some of the traditional methodologies as well. We feel that this approach will not only point out key references but will also provide valuable background to the beginner. The paper starts with a historical perspective and continues with a brief discussion on the speech properties and performance measures. We then proceed with descriptions of waveform coders, sinusoidal transform coders, linear predictive vocoders, and analysis-by-synthesis linear predictive coders. Finally, we present concluding remarks followed by a discussion of opportunities for future research.

Original language	English (US)
Pages (from-to)	1541-1582
Number of pages	42
Journal	Proceedings of the IEEE
Volume	82
Issue number	10
DOIs	https://doi.org/10.1109/5.326413
State	Published - Oct 1994

ASJC Scopus subject areas

General Computer Science
Electrical and Electronic Engineering

Access to Document

10.1109/5.326413

Cite this

@article{fcb472de134e4ac5bf5cd4096a4f82f1,

title = "Speech Coding: A Tutorial Review",

abstract = "The past decade has witnessed substantial progress towards the application of low-rate speech coders to civilian and military communications as well as computer-related voice applications. Central to this progress has been the development of new speech coders capable of producing high-quality speech at low data rates. Most of these coders incorporate mechanisms to: Represent the spectral properties of speech, provide for speech waveform matching, and “optimize” the coder's performance for the human ear. A number of these coders have already been adopted in national and international cellular telephony standards. The objective of this paper is to provide a tutorial overview of speech coding methodologies with emphasis on those algorithms that are part of the recent low-rate standards for cellular communications. Although the emphasis is on the new low-rate coders, we attempt to provide a comprehensive survey by covering some of the traditional methodologies as well. We feel that this approach will not only point out key references but will also provide valuable background to the beginner. The paper starts with a historical perspective and continues with a brief discussion on the speech properties and performance measures. We then proceed with descriptions of waveform coders, sinusoidal transform coders, linear predictive vocoders, and analysis-by-synthesis linear predictive coders. Finally, we present concluding remarks followed by a discussion of opportunities for future research.",

author = "Andreas Spanias",

note = "Funding Information: Manuscript received July 6, 1993; revised March 4, 1994. Portions of this work have been supported by Intel Corporation. The author is with the Department of Electrical Engineering, Telecommunications Research Center, Arizona State University, Tempe, AZ 85287-5706 USA. IEEE Log Number 94015 I 1.",

year = "1994",

month = oct,

doi = "10.1109/5.326413",

language = "English (US)",

volume = "82",

pages = "1541--1582",

journal = "Proceedings of the IEEE",

issn = "0018-9219",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "10",

}

TY - JOUR

T1 - Speech Coding

T2 - A Tutorial Review

AU - Spanias, Andreas

N1 - Funding Information: Manuscript received July 6, 1993; revised March 4, 1994. Portions of this work have been supported by Intel Corporation. The author is with the Department of Electrical Engineering, Telecommunications Research Center, Arizona State University, Tempe, AZ 85287-5706 USA. IEEE Log Number 94015 I 1.

PY - 1994/10

Y1 - 1994/10

N2 - The past decade has witnessed substantial progress towards the application of low-rate speech coders to civilian and military communications as well as computer-related voice applications. Central to this progress has been the development of new speech coders capable of producing high-quality speech at low data rates. Most of these coders incorporate mechanisms to: Represent the spectral properties of speech, provide for speech waveform matching, and “optimize” the coder's performance for the human ear. A number of these coders have already been adopted in national and international cellular telephony standards. The objective of this paper is to provide a tutorial overview of speech coding methodologies with emphasis on those algorithms that are part of the recent low-rate standards for cellular communications. Although the emphasis is on the new low-rate coders, we attempt to provide a comprehensive survey by covering some of the traditional methodologies as well. We feel that this approach will not only point out key references but will also provide valuable background to the beginner. The paper starts with a historical perspective and continues with a brief discussion on the speech properties and performance measures. We then proceed with descriptions of waveform coders, sinusoidal transform coders, linear predictive vocoders, and analysis-by-synthesis linear predictive coders. Finally, we present concluding remarks followed by a discussion of opportunities for future research.

AB - The past decade has witnessed substantial progress towards the application of low-rate speech coders to civilian and military communications as well as computer-related voice applications. Central to this progress has been the development of new speech coders capable of producing high-quality speech at low data rates. Most of these coders incorporate mechanisms to: Represent the spectral properties of speech, provide for speech waveform matching, and “optimize” the coder's performance for the human ear. A number of these coders have already been adopted in national and international cellular telephony standards. The objective of this paper is to provide a tutorial overview of speech coding methodologies with emphasis on those algorithms that are part of the recent low-rate standards for cellular communications. Although the emphasis is on the new low-rate coders, we attempt to provide a comprehensive survey by covering some of the traditional methodologies as well. We feel that this approach will not only point out key references but will also provide valuable background to the beginner. The paper starts with a historical perspective and continues with a brief discussion on the speech properties and performance measures. We then proceed with descriptions of waveform coders, sinusoidal transform coders, linear predictive vocoders, and analysis-by-synthesis linear predictive coders. Finally, we present concluding remarks followed by a discussion of opportunities for future research.

UR - http://www.scopus.com/inward/record.url?scp=0028515948&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0028515948&partnerID=8YFLogxK

U2 - 10.1109/5.326413

DO - 10.1109/5.326413

M3 - Article

AN - SCOPUS:0028515948

SN - 0018-9219

VL - 82

SP - 1541

EP - 1582

JO - Proceedings of the IEEE

JF - Proceedings of the IEEE

IS - 10

ER -

Speech Coding: A Tutorial Review

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this