Perceptual segmentation and component selection in compact sinusoidal representations of audio

T. Painter, Andreas Spanias

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Scopus citations

Abstract

This paper presents two fundamental enhancements in a hybrid audio signal model consisting of sinusoidal, transient, and noise (STN) components. The first enhancement involves a novel application of a perceptual metric for optimal time segmentation for the analysis of transients. In particular, Moore and Glasberg's model of partial loudness is modified for use with general signals and then integrated into a novel time segmentation scheme. The second and perhaps more significant STN enhancement is concerned with a new methodology for ranking and selection of the most perceptually relevant sinusoids.

Original languageEnglish (US)
Title of host publicationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Pages3289-3292
Number of pages4
Volume5
StatePublished - 2001
Externally publishedYes
Event2001 IEEE Interntional Conference on Acoustics, Speech, and Signal Processing - Salt Lake, UT, United States
Duration: May 7 2001May 11 2001

Other

Other2001 IEEE Interntional Conference on Acoustics, Speech, and Signal Processing
CountryUnited States
CitySalt Lake, UT
Period5/7/015/11/01

    Fingerprint

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Signal Processing
  • Acoustics and Ultrasonics

Cite this

Painter, T., & Spanias, A. (2001). Perceptual segmentation and component selection in compact sinusoidal representations of audio. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (Vol. 5, pp. 3289-3292)