Estimating speaking rate in spontaneous discourse

Yishan Jiao, Visar Berisha, Ming Tu, Timothy Huston, Julie Liss

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations


In this paper we consider the problem of estimating the speaking rate directly from the speech waveform. We propose an algorithm that poses the speaking rate estimation problem as a convex optimization problem. In contrast to existing methods, we avoid the more difficult task of detecting individual syllables within the speech signal and we avoid heuristics like thresholding a loudness function. The algorithm was evaluated on the ICSI Switchboard spontaneous speech corpus and a speech corpus obtained from publicly-available interviews on Youtube.

Original languageEnglish (US)
Title of host publicationConference Record - Asilomar Conference on Signals, Systems and Computers
PublisherIEEE Computer Society
Number of pages4
ISBN (Print)9781467385763
StatePublished - Feb 26 2016
Event49th Asilomar Conference on Signals, Systems and Computers, ACSSC 2015 - Pacific Grove, United States
Duration: Nov 8 2015Nov 11 2015


Other49th Asilomar Conference on Signals, Systems and Computers, ACSSC 2015
Country/TerritoryUnited States
CityPacific Grove

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Signal Processing


Dive into the research topics of 'Estimating speaking rate in spontaneous discourse'. Together they form a unique fingerprint.

Cite this