Estimating speaking rate in spontaneous discourse

Yishan Jiao, Visar Berisha, Ming Tu, Timothy Huston, Julie Liss

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

In this paper we consider the problem of estimating the speaking rate directly from the speech waveform. We propose an algorithm that poses the speaking rate estimation problem as a convex optimization problem. In contrast to existing methods, we avoid the more difficult task of detecting individual syllables within the speech signal and we avoid heuristics like thresholding a loudness function. The algorithm was evaluated on the ICSI Switchboard spontaneous speech corpus and a speech corpus obtained from publicly-available interviews on Youtube.

Original languageEnglish (US)
Title of host publicationConference Record - Asilomar Conference on Signals, Systems and Computers
PublisherIEEE Computer Society
Pages1189-1192
Number of pages4
Volume2016-February
ISBN (Print)9781467385763
DOIs
StatePublished - Feb 26 2016
Event49th Asilomar Conference on Signals, Systems and Computers, ACSSC 2015 - Pacific Grove, United States
Duration: Nov 8 2015Nov 11 2015

Other

Other49th Asilomar Conference on Signals, Systems and Computers, ACSSC 2015
CountryUnited States
CityPacific Grove
Period11/8/1511/11/15

    Fingerprint

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Signal Processing

Cite this

Jiao, Y., Berisha, V., Tu, M., Huston, T., & Liss, J. (2016). Estimating speaking rate in spontaneous discourse. In Conference Record - Asilomar Conference on Signals, Systems and Computers (Vol. 2016-February, pp. 1189-1192). [7421328] IEEE Computer Society. https://doi.org/10.1109/ACSSC.2015.7421328