TY - GEN
T1 - Estimating speaking rate in spontaneous discourse
AU - Jiao, Yishan
AU - Berisha, Visar
AU - Tu, Ming
AU - Huston, Timothy
AU - Liss, Julie
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2016/2/26
Y1 - 2016/2/26
N2 - In this paper we consider the problem of estimating the speaking rate directly from the speech waveform. We propose an algorithm that poses the speaking rate estimation problem as a convex optimization problem. In contrast to existing methods, we avoid the more difficult task of detecting individual syllables within the speech signal and we avoid heuristics like thresholding a loudness function. The algorithm was evaluated on the ICSI Switchboard spontaneous speech corpus and a speech corpus obtained from publicly-available interviews on Youtube.
AB - In this paper we consider the problem of estimating the speaking rate directly from the speech waveform. We propose an algorithm that poses the speaking rate estimation problem as a convex optimization problem. In contrast to existing methods, we avoid the more difficult task of detecting individual syllables within the speech signal and we avoid heuristics like thresholding a loudness function. The algorithm was evaluated on the ICSI Switchboard spontaneous speech corpus and a speech corpus obtained from publicly-available interviews on Youtube.
UR - http://www.scopus.com/inward/record.url?scp=84969902393&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84969902393&partnerID=8YFLogxK
U2 - 10.1109/ACSSC.2015.7421328
DO - 10.1109/ACSSC.2015.7421328
M3 - Conference contribution
AN - SCOPUS:84969902393
T3 - Conference Record - Asilomar Conference on Signals, Systems and Computers
SP - 1189
EP - 1192
BT - Conference Record of the 49th Asilomar Conference on Signals, Systems and Computers, ACSSC 2015
A2 - Matthews, Michael B.
PB - IEEE Computer Society
T2 - 49th Asilomar Conference on Signals, Systems and Computers, ACSSC 2015
Y2 - 8 November 2015 through 11 November 2015
ER -