Enhancing the quality of coded audio using perceptual criteria

Visar Berisha; Andreas Spanias

doi:10.1109/MMSP.2005.248613

Enhancing the quality of coded audio using perceptual criteria

Visar Berisha, Andreas Spanias

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

4 Scopus citations

Abstract

Code excited linear predictive (CELP) coding standards often fail to properly represent non-speech signals because they are inherently optimized for speech. Most modern CELP coders include provisions for the inclusion of indirect perceptual criteria to counteract this problem, however no direct psychoacoustic models are employed. In this paper, we present a pre- and post- processor for the vocoder that makes use of the MPEG-1 psychoacoustic model 1 in order to enhance the quality of the coded audio. A novel frequency-domain technique is proposed that attempts to shape the residual of a vocoder such that it falls below psychoacoustic thresholds.

Original language	English (US)
Title of host publication	2005 IEEE 7th Workshop on Multimedia Signal Processing, MMSP 2005
Publisher	IEEE Computer Society
ISBN (Print)	0780392892, 9780780392892
DOIs	https://doi.org/10.1109/MMSP.2005.248613
State	Published - 2005
Event	2005 IEEE 7th Workshop on Multimedia Signal Processing, MMSP 2005 - Shanghai, China Duration: Oct 30 2005 → Nov 2 2005

Publication series

Name	2005 IEEE 7th Workshop on Multimedia Signal Processing

Other

Other	2005 IEEE 7th Workshop on Multimedia Signal Processing, MMSP 2005
Country/Territory	China
City	Shanghai
Period	10/30/05 → 11/2/05

ASJC Scopus subject areas

Signal Processing

Access to Document

10.1109/MMSP.2005.248613

Cite this

Berisha, V & Spanias, A 2005, Enhancing the quality of coded audio using perceptual criteria. in 2005 IEEE 7th Workshop on Multimedia Signal Processing, MMSP 2005., 4014034, 2005 IEEE 7th Workshop on Multimedia Signal Processing, IEEE Computer Society, 2005 IEEE 7th Workshop on Multimedia Signal Processing, MMSP 2005, Shanghai, China, 10/30/05. https://doi.org/10.1109/MMSP.2005.248613

@inproceedings{1768ab1a1e804f66a7630c1c56a24e39,

title = "Enhancing the quality of coded audio using perceptual criteria",

abstract = "Code excited linear predictive (CELP) coding standards often fail to properly represent non-speech signals because they are inherently optimized for speech. Most modern CELP coders include provisions for the inclusion of indirect perceptual criteria to counteract this problem, however no direct psychoacoustic models are employed. In this paper, we present a pre- and post- processor for the vocoder that makes use of the MPEG-1 psychoacoustic model 1 in order to enhance the quality of the coded audio. A novel frequency-domain technique is proposed that attempts to shape the residual of a vocoder such that it falls below psychoacoustic thresholds.",

author = "Visar Berisha and Andreas Spanias",

year = "2005",

doi = "10.1109/MMSP.2005.248613",

language = "English (US)",

isbn = "0780392892",

series = "2005 IEEE 7th Workshop on Multimedia Signal Processing",

publisher = "IEEE Computer Society",

booktitle = "2005 IEEE 7th Workshop on Multimedia Signal Processing, MMSP 2005",

note = "2005 IEEE 7th Workshop on Multimedia Signal Processing, MMSP 2005 ; Conference date: 30-10-2005 Through 02-11-2005",

}

TY - GEN

T1 - Enhancing the quality of coded audio using perceptual criteria

AU - Berisha, Visar

AU - Spanias, Andreas

PY - 2005

Y1 - 2005

N2 - Code excited linear predictive (CELP) coding standards often fail to properly represent non-speech signals because they are inherently optimized for speech. Most modern CELP coders include provisions for the inclusion of indirect perceptual criteria to counteract this problem, however no direct psychoacoustic models are employed. In this paper, we present a pre- and post- processor for the vocoder that makes use of the MPEG-1 psychoacoustic model 1 in order to enhance the quality of the coded audio. A novel frequency-domain technique is proposed that attempts to shape the residual of a vocoder such that it falls below psychoacoustic thresholds.

AB - Code excited linear predictive (CELP) coding standards often fail to properly represent non-speech signals because they are inherently optimized for speech. Most modern CELP coders include provisions for the inclusion of indirect perceptual criteria to counteract this problem, however no direct psychoacoustic models are employed. In this paper, we present a pre- and post- processor for the vocoder that makes use of the MPEG-1 psychoacoustic model 1 in order to enhance the quality of the coded audio. A novel frequency-domain technique is proposed that attempts to shape the residual of a vocoder such that it falls below psychoacoustic thresholds.

UR - http://www.scopus.com/inward/record.url?scp=37649032342&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=37649032342&partnerID=8YFLogxK

U2 - 10.1109/MMSP.2005.248613

DO - 10.1109/MMSP.2005.248613

M3 - Conference contribution

AN - SCOPUS:37649032342

SN - 0780392892

SN - 9780780392892

T3 - 2005 IEEE 7th Workshop on Multimedia Signal Processing

BT - 2005 IEEE 7th Workshop on Multimedia Signal Processing, MMSP 2005

PB - IEEE Computer Society

T2 - 2005 IEEE 7th Workshop on Multimedia Signal Processing, MMSP 2005

Y2 - 30 October 2005 through 2 November 2005

ER -

Enhancing the quality of coded audio using perceptual criteria

Abstract

Publication series

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Cite this