Abstract

Code excited linear predictive (CELP) coding standards often fail to properly represent non-speech signals because they are inherently optimized for speech. Most modern CELP coders include provisions for the inclusion of indirect perceptual criteria to counteract this problem, however no direct psychoacoustic models are employed. In this paper, we present a pre- and post- processor for the vocoder that makes use of the MPEG-1 psychoacoustic model 1 in order to enhance the quality of the coded audio. A novel frequency-domain technique is proposed that attempts to shape the residual of a vocoder such that it falls below psychoacoustic thresholds.

Original languageEnglish (US)
Title of host publication2005 IEEE 7th Workshop on Multimedia Signal Processing
DOIs
StatePublished - 2006
Event2005 IEEE 7th Workshop on Multimedia Signal Processing, MMSP 2005 - Shanghai, China
Duration: Oct 30 2005Nov 2 2005

Other

Other2005 IEEE 7th Workshop on Multimedia Signal Processing, MMSP 2005
CountryChina
CityShanghai
Period10/30/0511/2/05

ASJC Scopus subject areas

  • Signal Processing

Cite this

Enhancing the quality of coded audio using perceptual criteria. / Berisha, Visar; Spanias, Andreas.

2005 IEEE 7th Workshop on Multimedia Signal Processing. 2006. 4014034.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Berisha, V & Spanias, A 2006, Enhancing the quality of coded audio using perceptual criteria. in 2005 IEEE 7th Workshop on Multimedia Signal Processing., 4014034, 2005 IEEE 7th Workshop on Multimedia Signal Processing, MMSP 2005, Shanghai, China, 10/30/05. https://doi.org/10.1109/MMSP.2005.248613
Berisha, Visar ; Spanias, Andreas. / Enhancing the quality of coded audio using perceptual criteria. 2005 IEEE 7th Workshop on Multimedia Signal Processing. 2006.
@inproceedings{1768ab1a1e804f66a7630c1c56a24e39,
title = "Enhancing the quality of coded audio using perceptual criteria",
abstract = "Code excited linear predictive (CELP) coding standards often fail to properly represent non-speech signals because they are inherently optimized for speech. Most modern CELP coders include provisions for the inclusion of indirect perceptual criteria to counteract this problem, however no direct psychoacoustic models are employed. In this paper, we present a pre- and post- processor for the vocoder that makes use of the MPEG-1 psychoacoustic model 1 in order to enhance the quality of the coded audio. A novel frequency-domain technique is proposed that attempts to shape the residual of a vocoder such that it falls below psychoacoustic thresholds.",
author = "Visar Berisha and Andreas Spanias",
year = "2006",
doi = "10.1109/MMSP.2005.248613",
language = "English (US)",
isbn = "0780392892",
booktitle = "2005 IEEE 7th Workshop on Multimedia Signal Processing",

}

TY - GEN

T1 - Enhancing the quality of coded audio using perceptual criteria

AU - Berisha, Visar

AU - Spanias, Andreas

PY - 2006

Y1 - 2006

N2 - Code excited linear predictive (CELP) coding standards often fail to properly represent non-speech signals because they are inherently optimized for speech. Most modern CELP coders include provisions for the inclusion of indirect perceptual criteria to counteract this problem, however no direct psychoacoustic models are employed. In this paper, we present a pre- and post- processor for the vocoder that makes use of the MPEG-1 psychoacoustic model 1 in order to enhance the quality of the coded audio. A novel frequency-domain technique is proposed that attempts to shape the residual of a vocoder such that it falls below psychoacoustic thresholds.

AB - Code excited linear predictive (CELP) coding standards often fail to properly represent non-speech signals because they are inherently optimized for speech. Most modern CELP coders include provisions for the inclusion of indirect perceptual criteria to counteract this problem, however no direct psychoacoustic models are employed. In this paper, we present a pre- and post- processor for the vocoder that makes use of the MPEG-1 psychoacoustic model 1 in order to enhance the quality of the coded audio. A novel frequency-domain technique is proposed that attempts to shape the residual of a vocoder such that it falls below psychoacoustic thresholds.

UR - http://www.scopus.com/inward/record.url?scp=37649032342&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=37649032342&partnerID=8YFLogxK

U2 - 10.1109/MMSP.2005.248613

DO - 10.1109/MMSP.2005.248613

M3 - Conference contribution

SN - 0780392892

SN - 9780780392892

BT - 2005 IEEE 7th Workshop on Multimedia Signal Processing

ER -