Abstract
An improved cepstrum-based voicing detection and pitch determination algorithm is presented. Voicing decisions are made using a multifeature voiced/unvoiced classification algorithm based on statistical analysis of cepstral peak, zero-crossing rate, and energy of short-time segments of the speech signal. Pitch frequency information is extracted by a modified cepstrum-based method and then carefully refined using pitch tracking, correction, and smoothing algorithms. Performance analysis on a large database indicates considerable improvement relative to the conventional cepstrum method. The proposed algorithm is also shown to be robust to additive noise.
Original language | English (US) |
---|---|
Pages (from-to) | 333-338 |
Number of pages | 6 |
Journal | IEEE Transactions on Speech and Audio Processing |
Volume | 7 |
Issue number | 3 |
DOIs | |
State | Published - 1999 |
Externally published | Yes |
ASJC Scopus subject areas
- Software
- Acoustics and Ultrasonics
- Computer Vision and Pattern Recognition
- Electrical and Electronic Engineering