Embedding perceptual metrics in rate control algorithms

Venkatraman Atti, Andreas Spanias

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper, we describe a perceptuallymotivated rate control algorithm for variable bit rate (VBR) speech coding. Unlike the existing rate selection strategies that are based on statistical metrics, the proposed method employs a perceptual loudness (PL) measure. The VBR speech standard TIA IS-127 enhanced variable rate codec (EVRC) has been chosen as the test-bed for evaluating the performance of the PL-based rate selection strategy relative to three other existing methods. In particular, the comparative study includes the following rate determination algorithms: voice activity detection, speech frame energy-thresholding, and phonetic segmentation. Experimental results demonstrate that the proposed PL-based RDA compares well against other rate selection techniques in terms of average bitrate and speech quality.

Original languageEnglish (US)
Title of host publicationProceedings of the 20th IEEE International Symposium on Intelligent Control, ISIC '05 and the 13th Mediterranean Conference on Control and Automation, MED '05
Pages861-865
Number of pages5
Volume2005
DOIs
StatePublished - 2005
Event20th IEEE International Symposium on Intelligent Control, ISIC '05 and the13th Mediterranean Conference on Control and Automation, MED '05 - Limassol, Cyprus
Duration: Jun 27 2005Jun 29 2005

Other

Other20th IEEE International Symposium on Intelligent Control, ISIC '05 and the13th Mediterranean Conference on Control and Automation, MED '05
CountryCyprus
CityLimassol
Period6/27/056/29/05

Fingerprint

Speech coding
Speech analysis

Keywords

  • Perceptual rate control algorithm
  • Signal processing
  • Speech modeling
  • Variable bit rate speech analysis

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Atti, V., & Spanias, A. (2005). Embedding perceptual metrics in rate control algorithms. In Proceedings of the 20th IEEE International Symposium on Intelligent Control, ISIC '05 and the 13th Mediterranean Conference on Control and Automation, MED '05 (Vol. 2005, pp. 861-865). [1467127] https://doi.org/10.1109/.2005.1467127

Embedding perceptual metrics in rate control algorithms. / Atti, Venkatraman; Spanias, Andreas.

Proceedings of the 20th IEEE International Symposium on Intelligent Control, ISIC '05 and the 13th Mediterranean Conference on Control and Automation, MED '05. Vol. 2005 2005. p. 861-865 1467127.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Atti, V & Spanias, A 2005, Embedding perceptual metrics in rate control algorithms. in Proceedings of the 20th IEEE International Symposium on Intelligent Control, ISIC '05 and the 13th Mediterranean Conference on Control and Automation, MED '05. vol. 2005, 1467127, pp. 861-865, 20th IEEE International Symposium on Intelligent Control, ISIC '05 and the13th Mediterranean Conference on Control and Automation, MED '05, Limassol, Cyprus, 6/27/05. https://doi.org/10.1109/.2005.1467127
Atti V, Spanias A. Embedding perceptual metrics in rate control algorithms. In Proceedings of the 20th IEEE International Symposium on Intelligent Control, ISIC '05 and the 13th Mediterranean Conference on Control and Automation, MED '05. Vol. 2005. 2005. p. 861-865. 1467127 https://doi.org/10.1109/.2005.1467127
Atti, Venkatraman ; Spanias, Andreas. / Embedding perceptual metrics in rate control algorithms. Proceedings of the 20th IEEE International Symposium on Intelligent Control, ISIC '05 and the 13th Mediterranean Conference on Control and Automation, MED '05. Vol. 2005 2005. pp. 861-865
@inproceedings{b23ab87a8d634bbda6467dcb5fd54368,
title = "Embedding perceptual metrics in rate control algorithms",
abstract = "In this paper, we describe a perceptuallymotivated rate control algorithm for variable bit rate (VBR) speech coding. Unlike the existing rate selection strategies that are based on statistical metrics, the proposed method employs a perceptual loudness (PL) measure. The VBR speech standard TIA IS-127 enhanced variable rate codec (EVRC) has been chosen as the test-bed for evaluating the performance of the PL-based rate selection strategy relative to three other existing methods. In particular, the comparative study includes the following rate determination algorithms: voice activity detection, speech frame energy-thresholding, and phonetic segmentation. Experimental results demonstrate that the proposed PL-based RDA compares well against other rate selection techniques in terms of average bitrate and speech quality.",
keywords = "Perceptual rate control algorithm, Signal processing, Speech modeling, Variable bit rate speech analysis",
author = "Venkatraman Atti and Andreas Spanias",
year = "2005",
doi = "10.1109/.2005.1467127",
language = "English (US)",
isbn = "0780389360",
volume = "2005",
pages = "861--865",
booktitle = "Proceedings of the 20th IEEE International Symposium on Intelligent Control, ISIC '05 and the 13th Mediterranean Conference on Control and Automation, MED '05",

}

TY - GEN

T1 - Embedding perceptual metrics in rate control algorithms

AU - Atti, Venkatraman

AU - Spanias, Andreas

PY - 2005

Y1 - 2005

N2 - In this paper, we describe a perceptuallymotivated rate control algorithm for variable bit rate (VBR) speech coding. Unlike the existing rate selection strategies that are based on statistical metrics, the proposed method employs a perceptual loudness (PL) measure. The VBR speech standard TIA IS-127 enhanced variable rate codec (EVRC) has been chosen as the test-bed for evaluating the performance of the PL-based rate selection strategy relative to three other existing methods. In particular, the comparative study includes the following rate determination algorithms: voice activity detection, speech frame energy-thresholding, and phonetic segmentation. Experimental results demonstrate that the proposed PL-based RDA compares well against other rate selection techniques in terms of average bitrate and speech quality.

AB - In this paper, we describe a perceptuallymotivated rate control algorithm for variable bit rate (VBR) speech coding. Unlike the existing rate selection strategies that are based on statistical metrics, the proposed method employs a perceptual loudness (PL) measure. The VBR speech standard TIA IS-127 enhanced variable rate codec (EVRC) has been chosen as the test-bed for evaluating the performance of the PL-based rate selection strategy relative to three other existing methods. In particular, the comparative study includes the following rate determination algorithms: voice activity detection, speech frame energy-thresholding, and phonetic segmentation. Experimental results demonstrate that the proposed PL-based RDA compares well against other rate selection techniques in terms of average bitrate and speech quality.

KW - Perceptual rate control algorithm

KW - Signal processing

KW - Speech modeling

KW - Variable bit rate speech analysis

UR - http://www.scopus.com/inward/record.url?scp=33745191651&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33745191651&partnerID=8YFLogxK

U2 - 10.1109/.2005.1467127

DO - 10.1109/.2005.1467127

M3 - Conference contribution

AN - SCOPUS:33745191651

SN - 0780389360

SN - 9780780389366

VL - 2005

SP - 861

EP - 865

BT - Proceedings of the 20th IEEE International Symposium on Intelligent Control, ISIC '05 and the 13th Mediterranean Conference on Control and Automation, MED '05

ER -