Abstract
The sinusoidal transform coding (STC) is a frequency-domain speech compression technique, in which finite duration segments of speech signal are represented by linear combination of sinusoids with time-varying amplitudes, phases, and frequencies. The STC is known to reconstructed speech of high quality at data rats below 10 kbps. It can be shown that if the measured sine wave frequencies are replaced by a harmonic set, then reconstructed speech of good quality can still be obtained. The methods that are discussed in this paper have been exploited in the development of the STC coders at data rates from 9.6 to 2.4 kbps and resulted in reconstructed speech of high quality and intelligibility. Accurate pitch detection algorithm, perception-based split vector quantization, improved overlap/add and frame interpolation algorithms, minimum variance phase estimation, and finally computational efficiency are the basic features that discriminate our implementations from other implementations of sinusoidal coders. This paper focuses on the development of a fully quantized sinusoidal coder at 2.4 kbps.
Original language | English (US) |
---|---|
Title of host publication | Conference Record of the Asilomar Conference on Signals, Systems and Computers |
Publisher | IEEE |
Pages | 770-774 |
Number of pages | 5 |
Volume | 1 |
State | Published - 1997 |
Event | Proceedings of the 1996 30th Asilomar Conference on Signals, Systems & Computers. Part 2 (of 2) - Pacific Grove, CA, USA Duration: Nov 3 1996 → Nov 6 1996 |
Other
Other | Proceedings of the 1996 30th Asilomar Conference on Signals, Systems & Computers. Part 2 (of 2) |
---|---|
City | Pacific Grove, CA, USA |
Period | 11/3/96 → 11/6/96 |
ASJC Scopus subject areas
- Hardware and Architecture
- Signal Processing
- Electrical and Electronic Engineering