Rate determination based on perceptual loudness

Venkatraman Atti, Andreas Spanias

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

In this paper, we describe a perceptuallymotivated rate determination algorithm (RDA) for variable bit rate speech coding. Unlike the existing rate selection strategies that are based on a voice activity detector and energy thresholds, the proposed method employs a perceptual loudness (PL) measure. The TIA IS-127 enhanced variable rate codec (EVRC) has been chosen as the test-bed for evaluating the performance of the PL-based rate selection strategy relative to three other existing methods. In particular, the comparative study includes the following rate determination algorithms: voice activity detection, speech frame energy-thresholding, and phonetic segmentation. Experimental results demonstrate that the proposed PLbased RDA compares well against other rate selection techniques in terms of average bitrate and speech quality.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE International Symposium on Circuits and Systems
Pages848-851
Number of pages4
DOIs
StatePublished - 2005
EventIEEE International Symposium on Circuits and Systems 2005, ISCAS 2005 - Kobe, Japan
Duration: May 23 2005May 26 2005

Other

OtherIEEE International Symposium on Circuits and Systems 2005, ISCAS 2005
CountryJapan
CityKobe
Period5/23/055/26/05

Fingerprint

Speech coding
Speech analysis
Detectors

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Cite this

Atti, V., & Spanias, A. (2005). Rate determination based on perceptual loudness. In Proceedings - IEEE International Symposium on Circuits and Systems (pp. 848-851). [1464721] https://doi.org/10.1109/ISCAS.2005.1464721

Rate determination based on perceptual loudness. / Atti, Venkatraman; Spanias, Andreas.

Proceedings - IEEE International Symposium on Circuits and Systems. 2005. p. 848-851 1464721.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Atti, V & Spanias, A 2005, Rate determination based on perceptual loudness. in Proceedings - IEEE International Symposium on Circuits and Systems., 1464721, pp. 848-851, IEEE International Symposium on Circuits and Systems 2005, ISCAS 2005, Kobe, Japan, 5/23/05. https://doi.org/10.1109/ISCAS.2005.1464721
Atti V, Spanias A. Rate determination based on perceptual loudness. In Proceedings - IEEE International Symposium on Circuits and Systems. 2005. p. 848-851. 1464721 https://doi.org/10.1109/ISCAS.2005.1464721
Atti, Venkatraman ; Spanias, Andreas. / Rate determination based on perceptual loudness. Proceedings - IEEE International Symposium on Circuits and Systems. 2005. pp. 848-851
@inproceedings{d451e6e426af434580875044f99126ad,
title = "Rate determination based on perceptual loudness",
abstract = "In this paper, we describe a perceptuallymotivated rate determination algorithm (RDA) for variable bit rate speech coding. Unlike the existing rate selection strategies that are based on a voice activity detector and energy thresholds, the proposed method employs a perceptual loudness (PL) measure. The TIA IS-127 enhanced variable rate codec (EVRC) has been chosen as the test-bed for evaluating the performance of the PL-based rate selection strategy relative to three other existing methods. In particular, the comparative study includes the following rate determination algorithms: voice activity detection, speech frame energy-thresholding, and phonetic segmentation. Experimental results demonstrate that the proposed PLbased RDA compares well against other rate selection techniques in terms of average bitrate and speech quality.",
author = "Venkatraman Atti and Andreas Spanias",
year = "2005",
doi = "10.1109/ISCAS.2005.1464721",
language = "English (US)",
pages = "848--851",
booktitle = "Proceedings - IEEE International Symposium on Circuits and Systems",

}

TY - GEN

T1 - Rate determination based on perceptual loudness

AU - Atti, Venkatraman

AU - Spanias, Andreas

PY - 2005

Y1 - 2005

N2 - In this paper, we describe a perceptuallymotivated rate determination algorithm (RDA) for variable bit rate speech coding. Unlike the existing rate selection strategies that are based on a voice activity detector and energy thresholds, the proposed method employs a perceptual loudness (PL) measure. The TIA IS-127 enhanced variable rate codec (EVRC) has been chosen as the test-bed for evaluating the performance of the PL-based rate selection strategy relative to three other existing methods. In particular, the comparative study includes the following rate determination algorithms: voice activity detection, speech frame energy-thresholding, and phonetic segmentation. Experimental results demonstrate that the proposed PLbased RDA compares well against other rate selection techniques in terms of average bitrate and speech quality.

AB - In this paper, we describe a perceptuallymotivated rate determination algorithm (RDA) for variable bit rate speech coding. Unlike the existing rate selection strategies that are based on a voice activity detector and energy thresholds, the proposed method employs a perceptual loudness (PL) measure. The TIA IS-127 enhanced variable rate codec (EVRC) has been chosen as the test-bed for evaluating the performance of the PL-based rate selection strategy relative to three other existing methods. In particular, the comparative study includes the following rate determination algorithms: voice activity detection, speech frame energy-thresholding, and phonetic segmentation. Experimental results demonstrate that the proposed PLbased RDA compares well against other rate selection techniques in terms of average bitrate and speech quality.

UR - http://www.scopus.com/inward/record.url?scp=34250762443&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34250762443&partnerID=8YFLogxK

U2 - 10.1109/ISCAS.2005.1464721

DO - 10.1109/ISCAS.2005.1464721

M3 - Conference contribution

AN - SCOPUS:34250762443

SP - 848

EP - 851

BT - Proceedings - IEEE International Symposium on Circuits and Systems

ER -