On the number of channels needed to understand speech

Philipos C. Loizou, Michael Dorman, Zhemin Tu

    Research output: Contribution to journalArticle

    147 Citations (Scopus)

    Abstract

    Recent studies have shown that high levels of speech understanding could be achieved when the speech spectrum was divided into four channels and then reconstructed as a sum of four noise bands or sine waves with frequencies equal to the center frequencies of the channels. In these studies speech understanding was assessed using sentences produced by a single male talker. The aim of experiment 1 was to assess the number of channels necessary for a high level of speech understanding when sentences were produced by multiple talkers. In experiment 1, sentences produced by 135 different talkers were processed through n (2≤n≤16) number of channels, synthesized as a sum of n sinc waves with frequencies equal to the center frequencies of the filters, and presented to normal-hearing listeners for identification. A minimum of five channels was needed to achieve a high level (90%) of speech understanding. Asymptotic performance was achieved with eight channels, at least for the speech material used in this study. The outcome of experiment 1 demonstrated that the number of channels needed to reach asymptotic performance varies as a function of the recognition task and/or need for listeners to attend to fine phonetic detail. In experiment 2, sentences were processed through 6 and 16 channels and quantized into a small number of steps. The purpose of this experiment was to investigate whether listeners use across-channel differences in amplitude to code frequency information, particularly when speech is processed through a small number of channels. For sentences processed through six channels there was a significant reduction in speech understanding when the spectral amplitudes were quantized into a small number (<8) of steps. High levels (92%) of speech understanding were maintained for sentences processed through 16 channels and quantized into only 2 steps. The findings of experiment 2 suggest an inverse: relationship between the importance of spectral amplitude resolution (number of steps) and spectral resolution (number of channels).

    Original languageEnglish (US)
    Pages (from-to)2097-2103
    Number of pages7
    JournalJournal of the Acoustical Society of America
    Volume106
    Issue number4 I
    DOIs
    StatePublished - 1999

    Fingerprint

    sentences
    Experiment
    phonetics
    sine waves
    hearing
    spectral resolution
    filters
    Listeners
    Talkers
    Spectrality
    Waves

    ASJC Scopus subject areas

    • Acoustics and Ultrasonics

    Cite this

    On the number of channels needed to understand speech. / Loizou, Philipos C.; Dorman, Michael; Tu, Zhemin.

    In: Journal of the Acoustical Society of America, Vol. 106, No. 4 I, 1999, p. 2097-2103.

    Research output: Contribution to journalArticle

    Loizou, Philipos C. ; Dorman, Michael ; Tu, Zhemin. / On the number of channels needed to understand speech. In: Journal of the Acoustical Society of America. 1999 ; Vol. 106, No. 4 I. pp. 2097-2103.
    @article{9d025b41e1cd41e4ba5a80d05b72e6c0,
    title = "On the number of channels needed to understand speech",
    abstract = "Recent studies have shown that high levels of speech understanding could be achieved when the speech spectrum was divided into four channels and then reconstructed as a sum of four noise bands or sine waves with frequencies equal to the center frequencies of the channels. In these studies speech understanding was assessed using sentences produced by a single male talker. The aim of experiment 1 was to assess the number of channels necessary for a high level of speech understanding when sentences were produced by multiple talkers. In experiment 1, sentences produced by 135 different talkers were processed through n (2≤n≤16) number of channels, synthesized as a sum of n sinc waves with frequencies equal to the center frequencies of the filters, and presented to normal-hearing listeners for identification. A minimum of five channels was needed to achieve a high level (90{\%}) of speech understanding. Asymptotic performance was achieved with eight channels, at least for the speech material used in this study. The outcome of experiment 1 demonstrated that the number of channels needed to reach asymptotic performance varies as a function of the recognition task and/or need for listeners to attend to fine phonetic detail. In experiment 2, sentences were processed through 6 and 16 channels and quantized into a small number of steps. The purpose of this experiment was to investigate whether listeners use across-channel differences in amplitude to code frequency information, particularly when speech is processed through a small number of channels. For sentences processed through six channels there was a significant reduction in speech understanding when the spectral amplitudes were quantized into a small number (<8) of steps. High levels (92{\%}) of speech understanding were maintained for sentences processed through 16 channels and quantized into only 2 steps. The findings of experiment 2 suggest an inverse: relationship between the importance of spectral amplitude resolution (number of steps) and spectral resolution (number of channels).",
    author = "Loizou, {Philipos C.} and Michael Dorman and Zhemin Tu",
    year = "1999",
    doi = "10.1121/1.427954",
    language = "English (US)",
    volume = "106",
    pages = "2097--2103",
    journal = "Journal of the Acoustical Society of America",
    issn = "0001-4966",
    publisher = "Acoustical Society of America",
    number = "4 I",

    }

    TY - JOUR

    T1 - On the number of channels needed to understand speech

    AU - Loizou, Philipos C.

    AU - Dorman, Michael

    AU - Tu, Zhemin

    PY - 1999

    Y1 - 1999

    N2 - Recent studies have shown that high levels of speech understanding could be achieved when the speech spectrum was divided into four channels and then reconstructed as a sum of four noise bands or sine waves with frequencies equal to the center frequencies of the channels. In these studies speech understanding was assessed using sentences produced by a single male talker. The aim of experiment 1 was to assess the number of channels necessary for a high level of speech understanding when sentences were produced by multiple talkers. In experiment 1, sentences produced by 135 different talkers were processed through n (2≤n≤16) number of channels, synthesized as a sum of n sinc waves with frequencies equal to the center frequencies of the filters, and presented to normal-hearing listeners for identification. A minimum of five channels was needed to achieve a high level (90%) of speech understanding. Asymptotic performance was achieved with eight channels, at least for the speech material used in this study. The outcome of experiment 1 demonstrated that the number of channels needed to reach asymptotic performance varies as a function of the recognition task and/or need for listeners to attend to fine phonetic detail. In experiment 2, sentences were processed through 6 and 16 channels and quantized into a small number of steps. The purpose of this experiment was to investigate whether listeners use across-channel differences in amplitude to code frequency information, particularly when speech is processed through a small number of channels. For sentences processed through six channels there was a significant reduction in speech understanding when the spectral amplitudes were quantized into a small number (<8) of steps. High levels (92%) of speech understanding were maintained for sentences processed through 16 channels and quantized into only 2 steps. The findings of experiment 2 suggest an inverse: relationship between the importance of spectral amplitude resolution (number of steps) and spectral resolution (number of channels).

    AB - Recent studies have shown that high levels of speech understanding could be achieved when the speech spectrum was divided into four channels and then reconstructed as a sum of four noise bands or sine waves with frequencies equal to the center frequencies of the channels. In these studies speech understanding was assessed using sentences produced by a single male talker. The aim of experiment 1 was to assess the number of channels necessary for a high level of speech understanding when sentences were produced by multiple talkers. In experiment 1, sentences produced by 135 different talkers were processed through n (2≤n≤16) number of channels, synthesized as a sum of n sinc waves with frequencies equal to the center frequencies of the filters, and presented to normal-hearing listeners for identification. A minimum of five channels was needed to achieve a high level (90%) of speech understanding. Asymptotic performance was achieved with eight channels, at least for the speech material used in this study. The outcome of experiment 1 demonstrated that the number of channels needed to reach asymptotic performance varies as a function of the recognition task and/or need for listeners to attend to fine phonetic detail. In experiment 2, sentences were processed through 6 and 16 channels and quantized into a small number of steps. The purpose of this experiment was to investigate whether listeners use across-channel differences in amplitude to code frequency information, particularly when speech is processed through a small number of channels. For sentences processed through six channels there was a significant reduction in speech understanding when the spectral amplitudes were quantized into a small number (<8) of steps. High levels (92%) of speech understanding were maintained for sentences processed through 16 channels and quantized into only 2 steps. The findings of experiment 2 suggest an inverse: relationship between the importance of spectral amplitude resolution (number of steps) and spectral resolution (number of channels).

    UR - http://www.scopus.com/inward/record.url?scp=0032825236&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=0032825236&partnerID=8YFLogxK

    U2 - 10.1121/1.427954

    DO - 10.1121/1.427954

    M3 - Article

    C2 - 10530032

    AN - SCOPUS:0032825236

    VL - 106

    SP - 2097

    EP - 2103

    JO - Journal of the Acoustical Society of America

    JF - Journal of the Acoustical Society of America

    SN - 0001-4966

    IS - 4 I

    ER -