New techniques for sinusoidal coding of speech at 2400 bps

Sassan Ahmadi; Andreas Spanias

New techniques for sinusoidal coding of speech at 2400 bps

Sassan Ahmadi, Andreas Spanias

Electrical Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

3 Scopus citations

Abstract

The sinusoidal transform coding (STC) is a frequency-domain speech compression technique, in which finite duration segments of speech signal are represented by linear combination of sinusoids with time-varying amplitudes, phases, and frequencies. The STC is known to reconstructed speech of high quality at data rats below 10 kbps. It can be shown that if the measured sine wave frequencies are replaced by a harmonic set, then reconstructed speech of good quality can still be obtained. The methods that are discussed in this paper have been exploited in the development of the STC coders at data rates from 9.6 to 2.4 kbps and resulted in reconstructed speech of high quality and intelligibility. Accurate pitch detection algorithm, perception-based split vector quantization, improved overlap/add and frame interpolation algorithms, minimum variance phase estimation, and finally computational efficiency are the basic features that discriminate our implementations from other implementations of sinusoidal coders. This paper focuses on the development of a fully quantized sinusoidal coder at 2.4 kbps.

Original language	English (US)
Title of host publication	Conference Record of the Asilomar Conference on Signals, Systems and Computers
Publisher	IEEE
Pages	770-774
Number of pages	5
Volume	1
State	Published - 1997
Event	Proceedings of the 1996 30th Asilomar Conference on Signals, Systems & Computers. Part 2 (of 2) - Pacific Grove, CA, USA Duration: Nov 3 1996 → Nov 6 1996

Other

Other	Proceedings of the 1996 30th Asilomar Conference on Signals, Systems & Computers. Part 2 (of 2)
City	Pacific Grove, CA, USA
Period	11/3/96 → 11/6/96

ASJC Scopus subject areas

Hardware and Architecture
Signal Processing
Electrical and Electronic Engineering

Cite this

@inproceedings{e28cd15469dc4bbebfa1aedd1382f4fe,

title = "New techniques for sinusoidal coding of speech at 2400 bps",

abstract = "The sinusoidal transform coding (STC) is a frequency-domain speech compression technique, in which finite duration segments of speech signal are represented by linear combination of sinusoids with time-varying amplitudes, phases, and frequencies. The STC is known to reconstructed speech of high quality at data rats below 10 kbps. It can be shown that if the measured sine wave frequencies are replaced by a harmonic set, then reconstructed speech of good quality can still be obtained. The methods that are discussed in this paper have been exploited in the development of the STC coders at data rates from 9.6 to 2.4 kbps and resulted in reconstructed speech of high quality and intelligibility. Accurate pitch detection algorithm, perception-based split vector quantization, improved overlap/add and frame interpolation algorithms, minimum variance phase estimation, and finally computational efficiency are the basic features that discriminate our implementations from other implementations of sinusoidal coders. This paper focuses on the development of a fully quantized sinusoidal coder at 2.4 kbps.",

author = "Sassan Ahmadi and Andreas Spanias",

year = "1997",

language = "English (US)",

volume = "1",

pages = "770--774",

booktitle = "Conference Record of the Asilomar Conference on Signals, Systems and Computers",

publisher = "IEEE",

note = "Proceedings of the 1996 30th Asilomar Conference on Signals, Systems & Computers. Part 2 (of 2) ; Conference date: 03-11-1996 Through 06-11-1996",

}

TY - GEN

T1 - New techniques for sinusoidal coding of speech at 2400 bps

AU - Ahmadi, Sassan

AU - Spanias, Andreas

PY - 1997

Y1 - 1997

N2 - The sinusoidal transform coding (STC) is a frequency-domain speech compression technique, in which finite duration segments of speech signal are represented by linear combination of sinusoids with time-varying amplitudes, phases, and frequencies. The STC is known to reconstructed speech of high quality at data rats below 10 kbps. It can be shown that if the measured sine wave frequencies are replaced by a harmonic set, then reconstructed speech of good quality can still be obtained. The methods that are discussed in this paper have been exploited in the development of the STC coders at data rates from 9.6 to 2.4 kbps and resulted in reconstructed speech of high quality and intelligibility. Accurate pitch detection algorithm, perception-based split vector quantization, improved overlap/add and frame interpolation algorithms, minimum variance phase estimation, and finally computational efficiency are the basic features that discriminate our implementations from other implementations of sinusoidal coders. This paper focuses on the development of a fully quantized sinusoidal coder at 2.4 kbps.

AB - The sinusoidal transform coding (STC) is a frequency-domain speech compression technique, in which finite duration segments of speech signal are represented by linear combination of sinusoids with time-varying amplitudes, phases, and frequencies. The STC is known to reconstructed speech of high quality at data rats below 10 kbps. It can be shown that if the measured sine wave frequencies are replaced by a harmonic set, then reconstructed speech of good quality can still be obtained. The methods that are discussed in this paper have been exploited in the development of the STC coders at data rates from 9.6 to 2.4 kbps and resulted in reconstructed speech of high quality and intelligibility. Accurate pitch detection algorithm, perception-based split vector quantization, improved overlap/add and frame interpolation algorithms, minimum variance phase estimation, and finally computational efficiency are the basic features that discriminate our implementations from other implementations of sinusoidal coders. This paper focuses on the development of a fully quantized sinusoidal coder at 2.4 kbps.

UR - http://www.scopus.com/inward/record.url?scp=0030647540&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0030647540&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:0030647540

VL - 1

SP - 770

EP - 774

BT - Conference Record of the Asilomar Conference on Signals, Systems and Computers

PB - IEEE

T2 - Proceedings of the 1996 30th Asilomar Conference on Signals, Systems & Computers. Part 2 (of 2)

Y2 - 3 November 1996 through 6 November 1996

ER -

New techniques for sinusoidal coding of speech at 2400 bps

Abstract

Other

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this