Abstract

Code excited linear predictive (CELP) coding standards often fail to properly represent non-speech signals because they are inherently optimized for speech. Most modern CELP coders include provisions for the inclusion of indirect perceptual criteria to counteract this problem, however no direct psychoacoustic models are employed. In this paper, we present a pre- and post- processor for the vocoder that makes use of the MPEG-1 psychoacoustic model 1 in order to enhance the quality of the coded audio. A novel frequency-domain technique is proposed that attempts to shape the residual of a vocoder such that it falls below psychoacoustic thresholds.

Original languageEnglish (US)
Title of host publication2005 IEEE 7th Workshop on Multimedia Signal Processing, MMSP 2005
PublisherIEEE Computer Society
ISBN (Print)0780392892, 9780780392892
DOIs
StatePublished - Jan 1 2005
Event2005 IEEE 7th Workshop on Multimedia Signal Processing, MMSP 2005 - Shanghai, China
Duration: Oct 30 2005Nov 2 2005

Publication series

Name2005 IEEE 7th Workshop on Multimedia Signal Processing

Other

Other2005 IEEE 7th Workshop on Multimedia Signal Processing, MMSP 2005
CountryChina
CityShanghai
Period10/30/0511/2/05

ASJC Scopus subject areas

  • Signal Processing

Cite this

Berisha, V., & Spanias, A. (2005). Enhancing the quality of coded audio using perceptual criteria. In 2005 IEEE 7th Workshop on Multimedia Signal Processing, MMSP 2005 [4014034] (2005 IEEE 7th Workshop on Multimedia Signal Processing). IEEE Computer Society. https://doi.org/10.1109/MMSP.2005.248613