Abstract
This paper presents two fundamental enhancements in a hybrid audio signal model consisting of sinusoidal, transient, and noise (STN) components. The first enhancement involves a novel application of a perceptual metric for optimal time segmentation for the analysis of transients. In particular, Moore and Glasberg's model of partial loudness is modified for use with general signals and then integrated into a novel time segmentation scheme. The second, and perhaps more significant STN enhancement is concerned with a new methodology for ranking and selection of the most perceptually relevant sinusoids. A systematic procedure is developed for the selection of a compact set of sinusoids and comparative results are given to demonstrate the merit of this method.
Original language | English (US) |
---|---|
Pages (from-to) | 149-161 |
Number of pages | 13 |
Journal | IEEE Transactions on Speech and Audio Processing |
Volume | 13 |
Issue number | 2 |
DOIs | |
State | Published - Mar 2005 |
Keywords
- Audio coding
- Psychoacoustics
- Segmentation
- Sinusoidal models
ASJC Scopus subject areas
- Software
- Acoustics and Ultrasonics
- Computer Vision and Pattern Recognition
- Electrical and Electronic Engineering