Models for objective evaluation of dysarthric speech from data annotated by multiple listeners

Ming Tu; Yishan Jiao; Visar Berisha; Julie Liss

doi:10.1109/ACSSC.2016.7869163

Models for objective evaluation of dysarthric speech from data annotated by multiple listeners

Ming Tu, Yishan Jiao, Visar Berisha, Julie Liss

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

5 Scopus citations

Abstract

In subjective evaluation of dysarthric speech, the inter-rater agreement between clinicians can be low. Disagreement among clinicians results from differences in their perceptual assessment abilities, familiarization with a client, clinical experiences, etc. Recently, there has been interest in developing signal processing and machine learning models for objective evaluation of subjective speech quality. In this paper, we propose a new method to address this problem by collecting subjective ratings from multiple evaluators and modeling the reliability of each annotator within a machine learning framework. In contrast to previous work, our model explicitly models the dependence of the speaker on an evaluators reliability. We evaluate the model on a series of experiments on a dysarthric speech database and show that our method outperforms other similar approaches.

Original language	English (US)
Title of host publication	Conference Record of the 50th Asilomar Conference on Signals, Systems and Computers, ACSSC 2016
Editors	Michael B. Matthews
Publisher	IEEE Computer Society
Pages	827-830
Number of pages	4
ISBN (Electronic)	9781538639542
DOIs	https://doi.org/10.1109/ACSSC.2016.7869163
State	Published - Mar 1 2017
Event	50th Asilomar Conference on Signals, Systems and Computers, ACSSC 2016 - Pacific Grove, United States Duration: Nov 6 2016 → Nov 9 2016

Publication series

Name	Conference Record - Asilomar Conference on Signals, Systems and Computers
ISSN (Print)	1058-6393

Other

Other	50th Asilomar Conference on Signals, Systems and Computers, ACSSC 2016
Country/Territory	United States
City	Pacific Grove
Period	11/6/16 → 11/9/16

ASJC Scopus subject areas

Signal Processing
Computer Networks and Communications

Access to Document

10.1109/ACSSC.2016.7869163

Cite this

Tu, M., Jiao, Y., Berisha, V., & Liss, J. (2017). Models for objective evaluation of dysarthric speech from data annotated by multiple listeners. In M. B. Matthews (Ed.), Conference Record of the 50th Asilomar Conference on Signals, Systems and Computers, ACSSC 2016 (pp. 827-830). Article 7869163 (Conference Record - Asilomar Conference on Signals, Systems and Computers). IEEE Computer Society. https://doi.org/10.1109/ACSSC.2016.7869163

Models for objective evaluation of dysarthric speech from data annotated by multiple listeners. / Tu, Ming; Jiao, Yishan; Berisha, Visar et al.
Conference Record of the 50th Asilomar Conference on Signals, Systems and Computers, ACSSC 2016. ed. / Michael B. Matthews. IEEE Computer Society, 2017. p. 827-830 7869163 (Conference Record - Asilomar Conference on Signals, Systems and Computers).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Tu, M, Jiao, Y, Berisha, V & Liss, J 2017, Models for objective evaluation of dysarthric speech from data annotated by multiple listeners. in MB Matthews (ed.), Conference Record of the 50th Asilomar Conference on Signals, Systems and Computers, ACSSC 2016., 7869163, Conference Record - Asilomar Conference on Signals, Systems and Computers, IEEE Computer Society, pp. 827-830, 50th Asilomar Conference on Signals, Systems and Computers, ACSSC 2016, Pacific Grove, United States, 11/6/16. https://doi.org/10.1109/ACSSC.2016.7869163

Tu M, Jiao Y, Berisha V , Liss J. Models for objective evaluation of dysarthric speech from data annotated by multiple listeners. In Matthews MB, editor, Conference Record of the 50th Asilomar Conference on Signals, Systems and Computers, ACSSC 2016. IEEE Computer Society. 2017. p. 827-830. 7869163. (Conference Record - Asilomar Conference on Signals, Systems and Computers). doi: 10.1109/ACSSC.2016.7869163

Tu, Ming ; Jiao, Yishan ; Berisha, Visar et al. / Models for objective evaluation of dysarthric speech from data annotated by multiple listeners. Conference Record of the 50th Asilomar Conference on Signals, Systems and Computers, ACSSC 2016. editor / Michael B. Matthews. IEEE Computer Society, 2017. pp. 827-830 (Conference Record - Asilomar Conference on Signals, Systems and Computers).

@inproceedings{9354141a0c6041698790ad2e73789695,

title = "Models for objective evaluation of dysarthric speech from data annotated by multiple listeners",

abstract = "In subjective evaluation of dysarthric speech, the inter-rater agreement between clinicians can be low. Disagreement among clinicians results from differences in their perceptual assessment abilities, familiarization with a client, clinical experiences, etc. Recently, there has been interest in developing signal processing and machine learning models for objective evaluation of subjective speech quality. In this paper, we propose a new method to address this problem by collecting subjective ratings from multiple evaluators and modeling the reliability of each annotator within a machine learning framework. In contrast to previous work, our model explicitly models the dependence of the speaker on an evaluators reliability. We evaluate the model on a series of experiments on a dysarthric speech database and show that our method outperforms other similar approaches.",

author = "Ming Tu and Yishan Jiao and Visar Berisha and Julie Liss",

note = "Publisher Copyright: {\textcopyright} 2016 IEEE.; 50th Asilomar Conference on Signals, Systems and Computers, ACSSC 2016 ; Conference date: 06-11-2016 Through 09-11-2016",

year = "2017",

month = mar,

day = "1",

doi = "10.1109/ACSSC.2016.7869163",

language = "English (US)",

series = "Conference Record - Asilomar Conference on Signals, Systems and Computers",

publisher = "IEEE Computer Society",

pages = "827--830",

editor = "Matthews, {Michael B.}",

booktitle = "Conference Record of the 50th Asilomar Conference on Signals, Systems and Computers, ACSSC 2016",

}

TY - GEN

T1 - Models for objective evaluation of dysarthric speech from data annotated by multiple listeners

AU - Tu, Ming

AU - Jiao, Yishan

AU - Berisha, Visar

AU - Liss, Julie

PY - 2017/3/1

Y1 - 2017/3/1

N2 - In subjective evaluation of dysarthric speech, the inter-rater agreement between clinicians can be low. Disagreement among clinicians results from differences in their perceptual assessment abilities, familiarization with a client, clinical experiences, etc. Recently, there has been interest in developing signal processing and machine learning models for objective evaluation of subjective speech quality. In this paper, we propose a new method to address this problem by collecting subjective ratings from multiple evaluators and modeling the reliability of each annotator within a machine learning framework. In contrast to previous work, our model explicitly models the dependence of the speaker on an evaluators reliability. We evaluate the model on a series of experiments on a dysarthric speech database and show that our method outperforms other similar approaches.

AB - In subjective evaluation of dysarthric speech, the inter-rater agreement between clinicians can be low. Disagreement among clinicians results from differences in their perceptual assessment abilities, familiarization with a client, clinical experiences, etc. Recently, there has been interest in developing signal processing and machine learning models for objective evaluation of subjective speech quality. In this paper, we propose a new method to address this problem by collecting subjective ratings from multiple evaluators and modeling the reliability of each annotator within a machine learning framework. In contrast to previous work, our model explicitly models the dependence of the speaker on an evaluators reliability. We evaluate the model on a series of experiments on a dysarthric speech database and show that our method outperforms other similar approaches.

UR - http://www.scopus.com/inward/record.url?scp=85016248707&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85016248707&partnerID=8YFLogxK

U2 - 10.1109/ACSSC.2016.7869163

DO - 10.1109/ACSSC.2016.7869163

M3 - Conference contribution

AN - SCOPUS:85016248707

T3 - Conference Record - Asilomar Conference on Signals, Systems and Computers

SP - 827

EP - 830

BT - Conference Record of the 50th Asilomar Conference on Signals, Systems and Computers, ACSSC 2016

A2 - Matthews, Michael B.

PB - IEEE Computer Society

T2 - 50th Asilomar Conference on Signals, Systems and Computers, ACSSC 2016

Y2 - 6 November 2016 through 9 November 2016

ER -

Models for objective evaluation of dysarthric speech from data annotated by multiple listeners

Abstract

Publication series

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this