Bandwidth extension of audio based on partial loudness criteria

Visar Berisha; Andreas Spanias

doi:10.1109/MMSP.2006.285286

Bandwidth extension of audio based on partial loudness criteria

Visar Berisha, Andreas Spanias

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

4 Scopus citations

Abstract

Most modern speech coders operate on a limited bandwidth. This tends to decrease the naturalness of the synthesized audio and often also affects the intelligibility of certain sounds. While a few wideband speech coders have been standardized, implementing them in existing systems would require significant changes to the infrastructure. One solution is to use bandwidth extension techniques that predict the high-frequency band based on low-band features. Problems arise however when the correlation between the low and the high band is insufficient for an adequate representation of the wideband signal. In this paper, we propose a novel source-filter bandwidth extension algorithm that makes use of psychoacoustic concepts to determine the perceptual benefits that a particular audio frame gains from a more exact representation of the high band. Preliminary results indicate that the proposed system performs at a lower average bit rate when compared to other similar algorithms without compromising the audio quality.

Original language	English (US)
Title of host publication	2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006
Publisher	IEEE Computer Society
Pages	146-149
Number of pages	4
ISBN (Print)	0780397517, 9780780397514
DOIs	https://doi.org/10.1109/MMSP.2006.285286
State	Published - Jan 1 2006
Event	2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006 - Victoria, BC, Canada Duration: Oct 3 2006 → Oct 6 2006

Publication series

Name	2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006

Other

Other	2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006
Country/Territory	Canada
City	Victoria, BC
Period	10/3/06 → 10/6/06

ASJC Scopus subject areas

Signal Processing

Access to Document

10.1109/MMSP.2006.285286

Cite this

Bandwidth extension of audio based on partial loudness criteria. / Berisha, Visar ; Spanias, Andreas.
2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006. IEEE Computer Society, 2006. p. 146-149 4064536 (2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Berisha, V & Spanias, A 2006, Bandwidth extension of audio based on partial loudness criteria. in 2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006., 4064536, 2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006, IEEE Computer Society, pp. 146-149, 2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006, Victoria, BC, Canada, 10/3/06. https://doi.org/10.1109/MMSP.2006.285286

@inproceedings{48c9c85cb27149beaab61719a329cceb,

title = "Bandwidth extension of audio based on partial loudness criteria",

abstract = "Most modern speech coders operate on a limited bandwidth. This tends to decrease the naturalness of the synthesized audio and often also affects the intelligibility of certain sounds. While a few wideband speech coders have been standardized, implementing them in existing systems would require significant changes to the infrastructure. One solution is to use bandwidth extension techniques that predict the high-frequency band based on low-band features. Problems arise however when the correlation between the low and the high band is insufficient for an adequate representation of the wideband signal. In this paper, we propose a novel source-filter bandwidth extension algorithm that makes use of psychoacoustic concepts to determine the perceptual benefits that a particular audio frame gains from a more exact representation of the high band. Preliminary results indicate that the proposed system performs at a lower average bit rate when compared to other similar algorithms without compromising the audio quality.",

author = "Visar Berisha and Andreas Spanias",

year = "2006",

month = jan,

day = "1",

doi = "10.1109/MMSP.2006.285286",

language = "English (US)",

isbn = "0780397517",

series = "2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006",

publisher = "IEEE Computer Society",

pages = "146--149",

booktitle = "2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006",

note = "2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006 ; Conference date: 03-10-2006 Through 06-10-2006",

}

TY - GEN

T1 - Bandwidth extension of audio based on partial loudness criteria

AU - Berisha, Visar

AU - Spanias, Andreas

PY - 2006/1/1

Y1 - 2006/1/1

N2 - Most modern speech coders operate on a limited bandwidth. This tends to decrease the naturalness of the synthesized audio and often also affects the intelligibility of certain sounds. While a few wideband speech coders have been standardized, implementing them in existing systems would require significant changes to the infrastructure. One solution is to use bandwidth extension techniques that predict the high-frequency band based on low-band features. Problems arise however when the correlation between the low and the high band is insufficient for an adequate representation of the wideband signal. In this paper, we propose a novel source-filter bandwidth extension algorithm that makes use of psychoacoustic concepts to determine the perceptual benefits that a particular audio frame gains from a more exact representation of the high band. Preliminary results indicate that the proposed system performs at a lower average bit rate when compared to other similar algorithms without compromising the audio quality.

AB - Most modern speech coders operate on a limited bandwidth. This tends to decrease the naturalness of the synthesized audio and often also affects the intelligibility of certain sounds. While a few wideband speech coders have been standardized, implementing them in existing systems would require significant changes to the infrastructure. One solution is to use bandwidth extension techniques that predict the high-frequency band based on low-band features. Problems arise however when the correlation between the low and the high band is insufficient for an adequate representation of the wideband signal. In this paper, we propose a novel source-filter bandwidth extension algorithm that makes use of psychoacoustic concepts to determine the perceptual benefits that a particular audio frame gains from a more exact representation of the high band. Preliminary results indicate that the proposed system performs at a lower average bit rate when compared to other similar algorithms without compromising the audio quality.

UR - http://www.scopus.com/inward/record.url?scp=34250724837&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34250724837&partnerID=8YFLogxK

U2 - 10.1109/MMSP.2006.285286

DO - 10.1109/MMSP.2006.285286

M3 - Conference contribution

AN - SCOPUS:34250724837

SN - 0780397517

SN - 9780780397514

T3 - 2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006

SP - 146

EP - 149

BT - 2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006

PB - IEEE Computer Society

T2 - 2006 IEEE 8th Workshop on Multimedia Signal Processing, MMSP 2006

Y2 - 3 October 2006 through 6 October 2006

ER -

Bandwidth extension of audio based on partial loudness criteria

Abstract

Publication series

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this