Perceptual segmentation and component selection in compact sinusoidal representations of audio

T. Painter; Andreas Spanias

Perceptual segmentation and component selection in compact sinusoidal representations of audio

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

This paper presents two fundamental enhancements in a hybrid audio signal model consisting of sinusoidal, transient, and noise (STN) components. The first enhancement involves a novel application of a perceptual metric for optimal time segmentation for the analysis of transients. In particular, Moore and Glasberg's model of partial loudness is modified for use with general signals and then integrated into a novel time segmentation scheme. The second and perhaps more significant STN enhancement is concerned with a new methodology for ranking and selection of the most perceptually relevant sinusoids.

Original language	English (US)
Title of host publication	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Pages	3289-3292
Number of pages	4
Volume	5
State	Published - 2001
Externally published	Yes
Event	2001 IEEE Interntional Conference on Acoustics, Speech, and Signal Processing - Salt Lake, UT, United States Duration: May 7 2001 → May 11 2001

Other

Other	2001 IEEE Interntional Conference on Acoustics, Speech, and Signal Processing
Country/Territory	United States
City	Salt Lake, UT
Period	5/7/01 → 5/11/01

ASJC Scopus subject areas

Electrical and Electronic Engineering
Signal Processing
Acoustics and Ultrasonics

Cite this

@inproceedings{d94fd3b417844a74876f350bfbb41420,

title = "Perceptual segmentation and component selection in compact sinusoidal representations of audio",

abstract = "This paper presents two fundamental enhancements in a hybrid audio signal model consisting of sinusoidal, transient, and noise (STN) components. The first enhancement involves a novel application of a perceptual metric for optimal time segmentation for the analysis of transients. In particular, Moore and Glasberg's model of partial loudness is modified for use with general signals and then integrated into a novel time segmentation scheme. The second and perhaps more significant STN enhancement is concerned with a new methodology for ranking and selection of the most perceptually relevant sinusoids.",

author = "T. Painter and Andreas Spanias",

year = "2001",

language = "English (US)",

volume = "5",

pages = "3289--3292",

booktitle = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

note = "2001 IEEE Interntional Conference on Acoustics, Speech, and Signal Processing ; Conference date: 07-05-2001 Through 11-05-2001",

}

TY - GEN

T1 - Perceptual segmentation and component selection in compact sinusoidal representations of audio

AU - Painter, T.

AU - Spanias, Andreas

PY - 2001

Y1 - 2001

N2 - This paper presents two fundamental enhancements in a hybrid audio signal model consisting of sinusoidal, transient, and noise (STN) components. The first enhancement involves a novel application of a perceptual metric for optimal time segmentation for the analysis of transients. In particular, Moore and Glasberg's model of partial loudness is modified for use with general signals and then integrated into a novel time segmentation scheme. The second and perhaps more significant STN enhancement is concerned with a new methodology for ranking and selection of the most perceptually relevant sinusoids.

AB - This paper presents two fundamental enhancements in a hybrid audio signal model consisting of sinusoidal, transient, and noise (STN) components. The first enhancement involves a novel application of a perceptual metric for optimal time segmentation for the analysis of transients. In particular, Moore and Glasberg's model of partial loudness is modified for use with general signals and then integrated into a novel time segmentation scheme. The second and perhaps more significant STN enhancement is concerned with a new methodology for ranking and selection of the most perceptually relevant sinusoids.

UR - http://www.scopus.com/inward/record.url?scp=0034848131&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0034848131&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:0034848131

VL - 5

SP - 3289

EP - 3292

BT - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

T2 - 2001 IEEE Interntional Conference on Acoustics, Speech, and Signal Processing

Y2 - 7 May 2001 through 11 May 2001

ER -

Perceptual segmentation and component selection in compact sinusoidal representations of audio

Abstract

Other

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this