Speech analysis by estimating perceptually relevant pole locations

Venkatraman Atti, Andreas Spanias

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

An approach for estimating the perceptually-relevant pole locations is described. This "perceptual poles" are determined by using an auditory excitation pattern-matching method. The estimated perceptual poles are then used to construct a perceptually-motivated all-pole (PMAP) filter for use in speech analysis/synthesis. The proposed PMAP approach is compared against some of the existing perceptually-based linear prediction (LP) methods, i.e., the perceptual LP and the Warped LP. The PMAP approach compares well against the perceptual LP and the warped LP in terms of speech reconstruction quality and estimation of the formant frequencies.

Original languageEnglish (US)
Title of host publicationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
VolumeI
ISBN (Print)0780388747, 9780780388741
DOIs
StatePublished - 2005
Event2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Philadelphia, PA, United States
Duration: Mar 18 2005Mar 23 2005

Other

Other2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05
CountryUnited States
CityPhiladelphia, PA
Period3/18/053/23/05

Fingerprint

Speech analysis
linear prediction
Poles
estimating
poles
Pattern matching
filters
synthesis
excitation

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Signal Processing
  • Acoustics and Ultrasonics

Cite this

Atti, V., & Spanias, A. (2005). Speech analysis by estimating perceptually relevant pole locations. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (Vol. I). [1415089] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICASSP.2005.1415089

Speech analysis by estimating perceptually relevant pole locations. / Atti, Venkatraman; Spanias, Andreas.

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. I Institute of Electrical and Electronics Engineers Inc., 2005. 1415089.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Atti, V & Spanias, A 2005, Speech analysis by estimating perceptually relevant pole locations. in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. vol. I, 1415089, Institute of Electrical and Electronics Engineers Inc., 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05, Philadelphia, PA, United States, 3/18/05. https://doi.org/10.1109/ICASSP.2005.1415089
Atti V, Spanias A. Speech analysis by estimating perceptually relevant pole locations. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. I. Institute of Electrical and Electronics Engineers Inc. 2005. 1415089 https://doi.org/10.1109/ICASSP.2005.1415089
Atti, Venkatraman ; Spanias, Andreas. / Speech analysis by estimating perceptually relevant pole locations. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Vol. I Institute of Electrical and Electronics Engineers Inc., 2005.
@inproceedings{2b087783f71e4bb08fabfc8521df830a,
title = "Speech analysis by estimating perceptually relevant pole locations",
abstract = "An approach for estimating the perceptually-relevant pole locations is described. This {"}perceptual poles{"} are determined by using an auditory excitation pattern-matching method. The estimated perceptual poles are then used to construct a perceptually-motivated all-pole (PMAP) filter for use in speech analysis/synthesis. The proposed PMAP approach is compared against some of the existing perceptually-based linear prediction (LP) methods, i.e., the perceptual LP and the Warped LP. The PMAP approach compares well against the perceptual LP and the warped LP in terms of speech reconstruction quality and estimation of the formant frequencies.",
author = "Venkatraman Atti and Andreas Spanias",
year = "2005",
doi = "10.1109/ICASSP.2005.1415089",
language = "English (US)",
isbn = "0780388747",
volume = "I",
booktitle = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - Speech analysis by estimating perceptually relevant pole locations

AU - Atti, Venkatraman

AU - Spanias, Andreas

PY - 2005

Y1 - 2005

N2 - An approach for estimating the perceptually-relevant pole locations is described. This "perceptual poles" are determined by using an auditory excitation pattern-matching method. The estimated perceptual poles are then used to construct a perceptually-motivated all-pole (PMAP) filter for use in speech analysis/synthesis. The proposed PMAP approach is compared against some of the existing perceptually-based linear prediction (LP) methods, i.e., the perceptual LP and the Warped LP. The PMAP approach compares well against the perceptual LP and the warped LP in terms of speech reconstruction quality and estimation of the formant frequencies.

AB - An approach for estimating the perceptually-relevant pole locations is described. This "perceptual poles" are determined by using an auditory excitation pattern-matching method. The estimated perceptual poles are then used to construct a perceptually-motivated all-pole (PMAP) filter for use in speech analysis/synthesis. The proposed PMAP approach is compared against some of the existing perceptually-based linear prediction (LP) methods, i.e., the perceptual LP and the Warped LP. The PMAP approach compares well against the perceptual LP and the warped LP in terms of speech reconstruction quality and estimation of the formant frequencies.

UR - http://www.scopus.com/inward/record.url?scp=33646779499&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33646779499&partnerID=8YFLogxK

U2 - 10.1109/ICASSP.2005.1415089

DO - 10.1109/ICASSP.2005.1415089

M3 - Conference contribution

SN - 0780388747

SN - 9780780388741

VL - I

BT - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

ER -