Perceptual segmentation and component selection in compact sinusoidal representations of audio

T. Painter, Andreas Spanias

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Citations (Scopus)

Abstract

This paper presents two fundamental enhancements in a hybrid audio signal model consisting of sinusoidal, transient, and noise (STN) components. The first enhancement involves a novel application of a perceptual metric for optimal time segmentation for the analysis of transients. In particular, Moore and Glasberg's model of partial loudness is modified for use with general signals and then integrated into a novel time segmentation scheme. The second and perhaps more significant STN enhancement is concerned with a new methodology for ranking and selection of the most perceptually relevant sinusoids.

Original languageEnglish (US)
Title of host publicationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Pages3289-3292
Number of pages4
Volume5
StatePublished - 2001
Externally publishedYes
Event2001 IEEE Interntional Conference on Acoustics, Speech, and Signal Processing - Salt Lake, UT, United States
Duration: May 7 2001May 11 2001

Other

Other2001 IEEE Interntional Conference on Acoustics, Speech, and Signal Processing
CountryUnited States
CitySalt Lake, UT
Period5/7/015/11/01

Fingerprint

augmentation
audio signals
loudness
ranking
sine waves
methodology

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Signal Processing
  • Acoustics and Ultrasonics

Cite this

Painter, T., & Spanias, A. (2001). Perceptual segmentation and component selection in compact sinusoidal representations of audio. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (Vol. 5, pp. 3289-3292)

Perceptual segmentation and component selection in compact sinusoidal representations of audio. / Painter, T.; Spanias, Andreas.

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 5 2001. p. 3289-3292.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Painter, T & Spanias, A 2001, Perceptual segmentation and component selection in compact sinusoidal representations of audio. in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. vol. 5, pp. 3289-3292, 2001 IEEE Interntional Conference on Acoustics, Speech, and Signal Processing, Salt Lake, UT, United States, 5/7/01.
Painter T, Spanias A. Perceptual segmentation and component selection in compact sinusoidal representations of audio. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 5. 2001. p. 3289-3292
Painter, T. ; Spanias, Andreas. / Perceptual segmentation and component selection in compact sinusoidal representations of audio. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 5 2001. pp. 3289-3292
@inproceedings{d94fd3b417844a74876f350bfbb41420,
title = "Perceptual segmentation and component selection in compact sinusoidal representations of audio",
abstract = "This paper presents two fundamental enhancements in a hybrid audio signal model consisting of sinusoidal, transient, and noise (STN) components. The first enhancement involves a novel application of a perceptual metric for optimal time segmentation for the analysis of transients. In particular, Moore and Glasberg's model of partial loudness is modified for use with general signals and then integrated into a novel time segmentation scheme. The second and perhaps more significant STN enhancement is concerned with a new methodology for ranking and selection of the most perceptually relevant sinusoids.",
author = "T. Painter and Andreas Spanias",
year = "2001",
language = "English (US)",
volume = "5",
pages = "3289--3292",
booktitle = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

}

TY - GEN

T1 - Perceptual segmentation and component selection in compact sinusoidal representations of audio

AU - Painter, T.

AU - Spanias, Andreas

PY - 2001

Y1 - 2001

N2 - This paper presents two fundamental enhancements in a hybrid audio signal model consisting of sinusoidal, transient, and noise (STN) components. The first enhancement involves a novel application of a perceptual metric for optimal time segmentation for the analysis of transients. In particular, Moore and Glasberg's model of partial loudness is modified for use with general signals and then integrated into a novel time segmentation scheme. The second and perhaps more significant STN enhancement is concerned with a new methodology for ranking and selection of the most perceptually relevant sinusoids.

AB - This paper presents two fundamental enhancements in a hybrid audio signal model consisting of sinusoidal, transient, and noise (STN) components. The first enhancement involves a novel application of a perceptual metric for optimal time segmentation for the analysis of transients. In particular, Moore and Glasberg's model of partial loudness is modified for use with general signals and then integrated into a novel time segmentation scheme. The second and perhaps more significant STN enhancement is concerned with a new methodology for ranking and selection of the most perceptually relevant sinusoids.

UR - http://www.scopus.com/inward/record.url?scp=0034848131&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0034848131&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:0034848131

VL - 5

SP - 3289

EP - 3292

BT - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

ER -