Abstract

Most modern bandwidth extension techniques predict the high- frequency band based on features extracted from the lower band. While this works for some frames, problems arise when the correlation between the low and the high band is insufficient. In these situations, additional high-band information must be sent to the decoder. In this paper, we propose a scalable speech coding method based on the principles of bandwidth extension. The rate selection is based on explicit psyehoacoustic criteria, while the bandwidth extension is performed using a constrained MMSE estimation technique. Objective and subjective evaluations indicate that the proposed system performs at a lower average bit rate when compared to other similar algorithms while improving speech quality.

Original languageEnglish (US)
Title of host publicationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume4
DOIs
StatePublished - 2007
Event2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07 - Honolulu, HI, United States
Duration: Apr 15 2007Apr 20 2007

Other

Other2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07
CountryUnited States
CityHonolulu, HI
Period4/15/074/20/07

Fingerprint

bandwidth
Bandwidth
Speech coding
decoders
Frequency bands
coding
evaluation

Keywords

  • Bandwidth extension
  • Multirate coding
  • Psychoacoustics
  • Scalable speech coding
  • Speech enhancement

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Signal Processing
  • Acoustics and Ultrasonics

Cite this

Berisha, V., & Spanias, A. (2007). A scalable bandwidth extension algorithm. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (Vol. 4). [4218172] https://doi.org/10.1109/ICASSP.2007.366984

A scalable bandwidth extension algorithm. / Berisha, Visar; Spanias, Andreas.

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 4 2007. 4218172.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Berisha, V & Spanias, A 2007, A scalable bandwidth extension algorithm. in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. vol. 4, 4218172, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, Honolulu, HI, United States, 4/15/07. https://doi.org/10.1109/ICASSP.2007.366984
Berisha V, Spanias A. A scalable bandwidth extension algorithm. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 4. 2007. 4218172 https://doi.org/10.1109/ICASSP.2007.366984
Berisha, Visar ; Spanias, Andreas. / A scalable bandwidth extension algorithm. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. 4 2007.
@inproceedings{6d7d46ab1c8f4767b45cf25762a78149,
title = "A scalable bandwidth extension algorithm",
abstract = "Most modern bandwidth extension techniques predict the high- frequency band based on features extracted from the lower band. While this works for some frames, problems arise when the correlation between the low and the high band is insufficient. In these situations, additional high-band information must be sent to the decoder. In this paper, we propose a scalable speech coding method based on the principles of bandwidth extension. The rate selection is based on explicit psyehoacoustic criteria, while the bandwidth extension is performed using a constrained MMSE estimation technique. Objective and subjective evaluations indicate that the proposed system performs at a lower average bit rate when compared to other similar algorithms while improving speech quality.",
keywords = "Bandwidth extension, Multirate coding, Psychoacoustics, Scalable speech coding, Speech enhancement",
author = "Visar Berisha and Andreas Spanias",
year = "2007",
doi = "10.1109/ICASSP.2007.366984",
language = "English (US)",
isbn = "1424407281",
volume = "4",
booktitle = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

}

TY - GEN

T1 - A scalable bandwidth extension algorithm

AU - Berisha, Visar

AU - Spanias, Andreas

PY - 2007

Y1 - 2007

N2 - Most modern bandwidth extension techniques predict the high- frequency band based on features extracted from the lower band. While this works for some frames, problems arise when the correlation between the low and the high band is insufficient. In these situations, additional high-band information must be sent to the decoder. In this paper, we propose a scalable speech coding method based on the principles of bandwidth extension. The rate selection is based on explicit psyehoacoustic criteria, while the bandwidth extension is performed using a constrained MMSE estimation technique. Objective and subjective evaluations indicate that the proposed system performs at a lower average bit rate when compared to other similar algorithms while improving speech quality.

AB - Most modern bandwidth extension techniques predict the high- frequency band based on features extracted from the lower band. While this works for some frames, problems arise when the correlation between the low and the high band is insufficient. In these situations, additional high-band information must be sent to the decoder. In this paper, we propose a scalable speech coding method based on the principles of bandwidth extension. The rate selection is based on explicit psyehoacoustic criteria, while the bandwidth extension is performed using a constrained MMSE estimation technique. Objective and subjective evaluations indicate that the proposed system performs at a lower average bit rate when compared to other similar algorithms while improving speech quality.

KW - Bandwidth extension

KW - Multirate coding

KW - Psychoacoustics

KW - Scalable speech coding

KW - Speech enhancement

UR - http://www.scopus.com/inward/record.url?scp=34547547928&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34547547928&partnerID=8YFLogxK

U2 - 10.1109/ICASSP.2007.366984

DO - 10.1109/ICASSP.2007.366984

M3 - Conference contribution

SN - 1424407281

SN - 9781424407286

VL - 4

BT - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

ER -