Speech recognition and temporal amplitude modulation processing by Mandarin-speaking cochlear implant users

Xin Luo, Qian Jie Fu, Chao Gang Wei, Ke Li Cao

Research output: Contribution to journalArticle

62 Citations (Scopus)

Abstract

OBJECTIVES: Fundamental frequency (F0) information is important to Chinese tone and speech recognition. Cochlear implant (CI) speech processors typically provide limited F0 information via temporal envelopes delivered to stimulating electrodes. Previous studies have shown that English-speaking CI users' speech performance is correlated with amplitude modulation detection thresholds (AMDTs). The present study investigated whether Chinese-speaking CI users' speech performance (especially tone recognition) is correlated with temporal processing capabilities. DESIGN: Chinese tone, vowel, consonant, and sentence recognition were measured in 10 native Mandarin-speaking CI users via clinically assigned speech processors. AMDTs were measured in the same subjects for 20- and 100-Hz amplitude modulated (AM) stimuli presented to a middle electrode at five stimulation levels that spanned the dynamic range. To further investigate the CI users' sensitivity to temporal envelope cues, AM frequency discrimination thresholds (AMFDTs) were measured for two standard AM frequencies (50 and 100 Hz), presented to the same middle electrode at 30% and 70% dynamic range with a fixed modulation depth (50%). RESULTS: Results showed that AMDTs significantly improved with increasing stimulation level and that individual subjects exhibited markedly different AMDT functions. AMFDTs also improved with increasing stimulation level and were better with the 100-Hz standard AM frequency than with the 50-Hz standard AM frequency. Statistical analyses revealed that both mean AMDTs (averaged for 20- or 100-Hz AM across all stimulation levels) and mean AMFDTs (averaged for the 50-Hz standard AM frequency across both stimulation levels) were significantly correlated with tone, consonant, and sentence recognition scores, but not with vowel recognition scores. Mean AMDTs were also significantly correlated with mean AMFDTs. CONCLUSIONS: These preliminary results, obtained from a limited number of subjects, demonstrate the importance of temporal processing to CI speech recognition. The results further suggest that CI users' Chinese tone and speech recognition may be improved by enhancing temporal envelope cues delivered by speech processing algorithms.

Original languageEnglish (US)
Pages (from-to)957-970
Number of pages14
JournalEar and Hearing
Volume29
Issue number6
DOIs
StatePublished - Dec 2008
Externally publishedYes

Fingerprint

Cochlear Implants
Electrodes
Cues
Recognition (Psychology)
Discrimination (Psychology)

ASJC Scopus subject areas

  • Otorhinolaryngology
  • Speech and Hearing

Cite this

Speech recognition and temporal amplitude modulation processing by Mandarin-speaking cochlear implant users. / Luo, Xin; Fu, Qian Jie; Wei, Chao Gang; Cao, Ke Li.

In: Ear and Hearing, Vol. 29, No. 6, 12.2008, p. 957-970.

Research output: Contribution to journalArticle

Luo, Xin ; Fu, Qian Jie ; Wei, Chao Gang ; Cao, Ke Li. / Speech recognition and temporal amplitude modulation processing by Mandarin-speaking cochlear implant users. In: Ear and Hearing. 2008 ; Vol. 29, No. 6. pp. 957-970.
@article{7203c51cac4745a7b78a2b188a979775,
title = "Speech recognition and temporal amplitude modulation processing by Mandarin-speaking cochlear implant users",
abstract = "OBJECTIVES: Fundamental frequency (F0) information is important to Chinese tone and speech recognition. Cochlear implant (CI) speech processors typically provide limited F0 information via temporal envelopes delivered to stimulating electrodes. Previous studies have shown that English-speaking CI users' speech performance is correlated with amplitude modulation detection thresholds (AMDTs). The present study investigated whether Chinese-speaking CI users' speech performance (especially tone recognition) is correlated with temporal processing capabilities. DESIGN: Chinese tone, vowel, consonant, and sentence recognition were measured in 10 native Mandarin-speaking CI users via clinically assigned speech processors. AMDTs were measured in the same subjects for 20- and 100-Hz amplitude modulated (AM) stimuli presented to a middle electrode at five stimulation levels that spanned the dynamic range. To further investigate the CI users' sensitivity to temporal envelope cues, AM frequency discrimination thresholds (AMFDTs) were measured for two standard AM frequencies (50 and 100 Hz), presented to the same middle electrode at 30{\%} and 70{\%} dynamic range with a fixed modulation depth (50{\%}). RESULTS: Results showed that AMDTs significantly improved with increasing stimulation level and that individual subjects exhibited markedly different AMDT functions. AMFDTs also improved with increasing stimulation level and were better with the 100-Hz standard AM frequency than with the 50-Hz standard AM frequency. Statistical analyses revealed that both mean AMDTs (averaged for 20- or 100-Hz AM across all stimulation levels) and mean AMFDTs (averaged for the 50-Hz standard AM frequency across both stimulation levels) were significantly correlated with tone, consonant, and sentence recognition scores, but not with vowel recognition scores. Mean AMDTs were also significantly correlated with mean AMFDTs. CONCLUSIONS: These preliminary results, obtained from a limited number of subjects, demonstrate the importance of temporal processing to CI speech recognition. The results further suggest that CI users' Chinese tone and speech recognition may be improved by enhancing temporal envelope cues delivered by speech processing algorithms.",
author = "Xin Luo and Fu, {Qian Jie} and Wei, {Chao Gang} and Cao, {Ke Li}",
year = "2008",
month = "12",
doi = "10.1097/AUD.0b013e3181888f61",
language = "English (US)",
volume = "29",
pages = "957--970",
journal = "Ear and Hearing",
issn = "0196-0202",
publisher = "Lippincott Williams and Wilkins",
number = "6",

}

TY - JOUR

T1 - Speech recognition and temporal amplitude modulation processing by Mandarin-speaking cochlear implant users

AU - Luo, Xin

AU - Fu, Qian Jie

AU - Wei, Chao Gang

AU - Cao, Ke Li

PY - 2008/12

Y1 - 2008/12

N2 - OBJECTIVES: Fundamental frequency (F0) information is important to Chinese tone and speech recognition. Cochlear implant (CI) speech processors typically provide limited F0 information via temporal envelopes delivered to stimulating electrodes. Previous studies have shown that English-speaking CI users' speech performance is correlated with amplitude modulation detection thresholds (AMDTs). The present study investigated whether Chinese-speaking CI users' speech performance (especially tone recognition) is correlated with temporal processing capabilities. DESIGN: Chinese tone, vowel, consonant, and sentence recognition were measured in 10 native Mandarin-speaking CI users via clinically assigned speech processors. AMDTs were measured in the same subjects for 20- and 100-Hz amplitude modulated (AM) stimuli presented to a middle electrode at five stimulation levels that spanned the dynamic range. To further investigate the CI users' sensitivity to temporal envelope cues, AM frequency discrimination thresholds (AMFDTs) were measured for two standard AM frequencies (50 and 100 Hz), presented to the same middle electrode at 30% and 70% dynamic range with a fixed modulation depth (50%). RESULTS: Results showed that AMDTs significantly improved with increasing stimulation level and that individual subjects exhibited markedly different AMDT functions. AMFDTs also improved with increasing stimulation level and were better with the 100-Hz standard AM frequency than with the 50-Hz standard AM frequency. Statistical analyses revealed that both mean AMDTs (averaged for 20- or 100-Hz AM across all stimulation levels) and mean AMFDTs (averaged for the 50-Hz standard AM frequency across both stimulation levels) were significantly correlated with tone, consonant, and sentence recognition scores, but not with vowel recognition scores. Mean AMDTs were also significantly correlated with mean AMFDTs. CONCLUSIONS: These preliminary results, obtained from a limited number of subjects, demonstrate the importance of temporal processing to CI speech recognition. The results further suggest that CI users' Chinese tone and speech recognition may be improved by enhancing temporal envelope cues delivered by speech processing algorithms.

AB - OBJECTIVES: Fundamental frequency (F0) information is important to Chinese tone and speech recognition. Cochlear implant (CI) speech processors typically provide limited F0 information via temporal envelopes delivered to stimulating electrodes. Previous studies have shown that English-speaking CI users' speech performance is correlated with amplitude modulation detection thresholds (AMDTs). The present study investigated whether Chinese-speaking CI users' speech performance (especially tone recognition) is correlated with temporal processing capabilities. DESIGN: Chinese tone, vowel, consonant, and sentence recognition were measured in 10 native Mandarin-speaking CI users via clinically assigned speech processors. AMDTs were measured in the same subjects for 20- and 100-Hz amplitude modulated (AM) stimuli presented to a middle electrode at five stimulation levels that spanned the dynamic range. To further investigate the CI users' sensitivity to temporal envelope cues, AM frequency discrimination thresholds (AMFDTs) were measured for two standard AM frequencies (50 and 100 Hz), presented to the same middle electrode at 30% and 70% dynamic range with a fixed modulation depth (50%). RESULTS: Results showed that AMDTs significantly improved with increasing stimulation level and that individual subjects exhibited markedly different AMDT functions. AMFDTs also improved with increasing stimulation level and were better with the 100-Hz standard AM frequency than with the 50-Hz standard AM frequency. Statistical analyses revealed that both mean AMDTs (averaged for 20- or 100-Hz AM across all stimulation levels) and mean AMFDTs (averaged for the 50-Hz standard AM frequency across both stimulation levels) were significantly correlated with tone, consonant, and sentence recognition scores, but not with vowel recognition scores. Mean AMDTs were also significantly correlated with mean AMFDTs. CONCLUSIONS: These preliminary results, obtained from a limited number of subjects, demonstrate the importance of temporal processing to CI speech recognition. The results further suggest that CI users' Chinese tone and speech recognition may be improved by enhancing temporal envelope cues delivered by speech processing algorithms.

UR - http://www.scopus.com/inward/record.url?scp=60849104831&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=60849104831&partnerID=8YFLogxK

U2 - 10.1097/AUD.0b013e3181888f61

DO - 10.1097/AUD.0b013e3181888f61

M3 - Article

C2 - 18818548

AN - SCOPUS:60849104831

VL - 29

SP - 957

EP - 970

JO - Ear and Hearing

JF - Ear and Hearing

SN - 0196-0202

IS - 6

ER -