Rate determination based on perceptual loudness

Venkatraman Atti, Andreas Spanias

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Scopus citations

Abstract

In this paper, we describe a perceptuallymotivated rate determination algorithm (RDA) for variable bit rate speech coding. Unlike the existing rate selection strategies that are based on a voice activity detector and energy thresholds, the proposed method employs a perceptual loudness (PL) measure. The TIA IS-127 enhanced variable rate codec (EVRC) has been chosen as the test-bed for evaluating the performance of the PL-based rate selection strategy relative to three other existing methods. In particular, the comparative study includes the following rate determination algorithms: voice activity detection, speech frame energy-thresholding, and phonetic segmentation. Experimental results demonstrate that the proposed PLbased RDA compares well against other rate selection techniques in terms of average bitrate and speech quality.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE International Symposium on Circuits and Systems
Pages848-851
Number of pages4
DOIs
StatePublished - 2005
EventIEEE International Symposium on Circuits and Systems 2005, ISCAS 2005 - Kobe, Japan
Duration: May 23 2005May 26 2005

Other

OtherIEEE International Symposium on Circuits and Systems 2005, ISCAS 2005
CountryJapan
CityKobe
Period5/23/055/26/05

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'Rate determination based on perceptual loudness'. Together they form a unique fingerprint.

  • Cite this

    Atti, V., & Spanias, A. (2005). Rate determination based on perceptual loudness. In Proceedings - IEEE International Symposium on Circuits and Systems (pp. 848-851). [1464721] https://doi.org/10.1109/ISCAS.2005.1464721