Abstract
In this paper we consider the problem of estimating the speaking rate directly from the speech waveform. We propose an algorithm that poses the speaking rate estimation problem as a convex optimization problem. In contrast to existing methods, we avoid the more difficult task of detecting individual syllables within the speech signal and we avoid heuristics like thresholding a loudness function. The algorithm was evaluated on the ICSI Switchboard spontaneous speech corpus and a speech corpus obtained from publicly-available interviews on Youtube.
Original language | English (US) |
---|---|
Title of host publication | Conference Record - Asilomar Conference on Signals, Systems and Computers |
Publisher | IEEE Computer Society |
Pages | 1189-1192 |
Number of pages | 4 |
Volume | 2016-February |
ISBN (Print) | 9781467385763 |
DOIs | |
State | Published - Feb 26 2016 |
Event | 49th Asilomar Conference on Signals, Systems and Computers, ACSSC 2015 - Pacific Grove, United States Duration: Nov 8 2015 → Nov 11 2015 |
Other
Other | 49th Asilomar Conference on Signals, Systems and Computers, ACSSC 2015 |
---|---|
Country | United States |
City | Pacific Grove |
Period | 11/8/15 → 11/11/15 |
ASJC Scopus subject areas
- Computer Networks and Communications
- Signal Processing