Abstract

Most modern speech coders operate on a limited bandwidth. This tends to decrease the naturalness of the synthesized audio and often also affects the intelligibility of certain sounds. While a few wideband speech coders have been standardized, implementing them in existing systems would require significant changes to the infrastructure. One solution is to use bandwidth extension techniques that predict the high-frequency band based on low-band features. Problems arise however when the correlation between the low and the high band is insufficient for an adequate representation of the wideband signal. In this paper, we propose a novel source-filter bandwidth extension algorithm that makes use of psychoacoustic concepts to determine the perceptual benefits that a particular audio frame gains from a more exact representation of the high band. Preliminary results indicate that the proposed system performs at a lower average bit rate when compared to other similar algorithms without compromising the audio quality.

Original languageEnglish (US)
Title of host publication2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006
Pages146-149
Number of pages4
DOIs
StatePublished - 2007
Event2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006 - Victoria, BC, Canada
Duration: Oct 3 2006Oct 6 2006

Other

Other2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006
CountryCanada
CityVictoria, BC
Period10/3/0610/6/06

Fingerprint

Bandwidth
Frequency bands
Acoustic waves

ASJC Scopus subject areas

  • Signal Processing

Cite this

Berisha, V., & Spanias, A. (2007). Bandwidth extension of audio based on partial loudness criteria. In 2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006 (pp. 146-149). [4064536] https://doi.org/10.1109/MMSP.2006.285286

Bandwidth extension of audio based on partial loudness criteria. / Berisha, Visar; Spanias, Andreas.

2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006. 2007. p. 146-149 4064536.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Berisha, V & Spanias, A 2007, Bandwidth extension of audio based on partial loudness criteria. in 2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006., 4064536, pp. 146-149, 2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006, Victoria, BC, Canada, 10/3/06. https://doi.org/10.1109/MMSP.2006.285286
Berisha V, Spanias A. Bandwidth extension of audio based on partial loudness criteria. In 2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006. 2007. p. 146-149. 4064536 https://doi.org/10.1109/MMSP.2006.285286
Berisha, Visar ; Spanias, Andreas. / Bandwidth extension of audio based on partial loudness criteria. 2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006. 2007. pp. 146-149
@inproceedings{48c9c85cb27149beaab61719a329cceb,
title = "Bandwidth extension of audio based on partial loudness criteria",
abstract = "Most modern speech coders operate on a limited bandwidth. This tends to decrease the naturalness of the synthesized audio and often also affects the intelligibility of certain sounds. While a few wideband speech coders have been standardized, implementing them in existing systems would require significant changes to the infrastructure. One solution is to use bandwidth extension techniques that predict the high-frequency band based on low-band features. Problems arise however when the correlation between the low and the high band is insufficient for an adequate representation of the wideband signal. In this paper, we propose a novel source-filter bandwidth extension algorithm that makes use of psychoacoustic concepts to determine the perceptual benefits that a particular audio frame gains from a more exact representation of the high band. Preliminary results indicate that the proposed system performs at a lower average bit rate when compared to other similar algorithms without compromising the audio quality.",
author = "Visar Berisha and Andreas Spanias",
year = "2007",
doi = "10.1109/MMSP.2006.285286",
language = "English (US)",
isbn = "0780397517",
pages = "146--149",
booktitle = "2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006",

}

TY - GEN

T1 - Bandwidth extension of audio based on partial loudness criteria

AU - Berisha, Visar

AU - Spanias, Andreas

PY - 2007

Y1 - 2007

N2 - Most modern speech coders operate on a limited bandwidth. This tends to decrease the naturalness of the synthesized audio and often also affects the intelligibility of certain sounds. While a few wideband speech coders have been standardized, implementing them in existing systems would require significant changes to the infrastructure. One solution is to use bandwidth extension techniques that predict the high-frequency band based on low-band features. Problems arise however when the correlation between the low and the high band is insufficient for an adequate representation of the wideband signal. In this paper, we propose a novel source-filter bandwidth extension algorithm that makes use of psychoacoustic concepts to determine the perceptual benefits that a particular audio frame gains from a more exact representation of the high band. Preliminary results indicate that the proposed system performs at a lower average bit rate when compared to other similar algorithms without compromising the audio quality.

AB - Most modern speech coders operate on a limited bandwidth. This tends to decrease the naturalness of the synthesized audio and often also affects the intelligibility of certain sounds. While a few wideband speech coders have been standardized, implementing them in existing systems would require significant changes to the infrastructure. One solution is to use bandwidth extension techniques that predict the high-frequency band based on low-band features. Problems arise however when the correlation between the low and the high band is insufficient for an adequate representation of the wideband signal. In this paper, we propose a novel source-filter bandwidth extension algorithm that makes use of psychoacoustic concepts to determine the perceptual benefits that a particular audio frame gains from a more exact representation of the high band. Preliminary results indicate that the proposed system performs at a lower average bit rate when compared to other similar algorithms without compromising the audio quality.

UR - http://www.scopus.com/inward/record.url?scp=34250724837&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34250724837&partnerID=8YFLogxK

U2 - 10.1109/MMSP.2006.285286

DO - 10.1109/MMSP.2006.285286

M3 - Conference contribution

SN - 0780397517

SN - 9780780397514

SP - 146

EP - 149

BT - 2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006

ER -