Abstract

This plenary session will cover speech processing research advances with the emphasis on speech and audio coding methods. In the session, we will discuss the fundamental principles, techniques, and algorithms used in current coding applications including a summary of codecs for telecommunication standards. The session will start with a discussion on: the basic speech representation methods, the performance measures used to evaluate coded speech, and the role of the standards. Brief algorithm descriptions include: ADPCM, sub-band coding, adaptive transform coding, sinusoidal transform coding (STC), linear predictive coding (LPC), and analysis-by-synthesis LPC (sparse excitation, code excited LPC, and ACELP). The presentation will feature audio, and computer demonstrations of recent speech coding standards including voice-over IP algorithms. The plenary session will also cover wideband audio standards such as MPEG audio and other layers (e.g., MP3, AAC). Recent algorithms will also be described including the following: Variable-Rate Multimode Wideband (VMR-WB), Speex, G722.1, OGG Vorbis 2012, iLBC, SELT, SILK, Opus 2013, Qualcomm wideband 5G codecs. At the end of the session, we will cover briefly recent applications that use voice features for detecting speech pathologies, and also discuss how long-term speech parameters can be used as predictors of other diseases such as tremors, Alzheimer's etc.

Original languageEnglish (US)
Title of host publicationIISA 2015 - 6th International Conference on Information, Intelligence, Systems and Applications
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Print)9781467393119
DOIs
StatePublished - Jan 20 2016
Event6th International Conference on Information, Intelligence, Systems and Applications, IISA 2015 - Corfu, Greece
Duration: Jul 6 2015Jul 8 2015

Other

Other6th International Conference on Information, Intelligence, Systems and Applications, IISA 2015
CountryGreece
CityCorfu
Period7/6/157/8/15

Fingerprint

coding
Processing
Speech coding
Speech processing
Pathology
Telecommunication
Demonstrations
pathology
telecommunication
Disease
performance

Keywords

  • Speech and Audio Coding
  • Standardized Codecs

ASJC Scopus subject areas

  • Computer Science Applications
  • Social Sciences (miscellaneous)
  • Artificial Intelligence
  • Information Systems

Cite this

Spanias, A. (2016). Advances in speech and audio processing and coding. In IISA 2015 - 6th International Conference on Information, Intelligence, Systems and Applications [7388064] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IISA.2015.7388064

Advances in speech and audio processing and coding. / Spanias, Andreas.

IISA 2015 - 6th International Conference on Information, Intelligence, Systems and Applications. Institute of Electrical and Electronics Engineers Inc., 2016. 7388064.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Spanias, A 2016, Advances in speech and audio processing and coding. in IISA 2015 - 6th International Conference on Information, Intelligence, Systems and Applications., 7388064, Institute of Electrical and Electronics Engineers Inc., 6th International Conference on Information, Intelligence, Systems and Applications, IISA 2015, Corfu, Greece, 7/6/15. https://doi.org/10.1109/IISA.2015.7388064
Spanias A. Advances in speech and audio processing and coding. In IISA 2015 - 6th International Conference on Information, Intelligence, Systems and Applications. Institute of Electrical and Electronics Engineers Inc. 2016. 7388064 https://doi.org/10.1109/IISA.2015.7388064
Spanias, Andreas. / Advances in speech and audio processing and coding. IISA 2015 - 6th International Conference on Information, Intelligence, Systems and Applications. Institute of Electrical and Electronics Engineers Inc., 2016.
@inproceedings{55763146801a4920af737905b6b8a184,
title = "Advances in speech and audio processing and coding",
abstract = "This plenary session will cover speech processing research advances with the emphasis on speech and audio coding methods. In the session, we will discuss the fundamental principles, techniques, and algorithms used in current coding applications including a summary of codecs for telecommunication standards. The session will start with a discussion on: the basic speech representation methods, the performance measures used to evaluate coded speech, and the role of the standards. Brief algorithm descriptions include: ADPCM, sub-band coding, adaptive transform coding, sinusoidal transform coding (STC), linear predictive coding (LPC), and analysis-by-synthesis LPC (sparse excitation, code excited LPC, and ACELP). The presentation will feature audio, and computer demonstrations of recent speech coding standards including voice-over IP algorithms. The plenary session will also cover wideband audio standards such as MPEG audio and other layers (e.g., MP3, AAC). Recent algorithms will also be described including the following: Variable-Rate Multimode Wideband (VMR-WB), Speex, G722.1, OGG Vorbis 2012, iLBC, SELT, SILK, Opus 2013, Qualcomm wideband 5G codecs. At the end of the session, we will cover briefly recent applications that use voice features for detecting speech pathologies, and also discuss how long-term speech parameters can be used as predictors of other diseases such as tremors, Alzheimer's etc.",
keywords = "Speech and Audio Coding, Standardized Codecs",
author = "Andreas Spanias",
year = "2016",
month = "1",
day = "20",
doi = "10.1109/IISA.2015.7388064",
language = "English (US)",
isbn = "9781467393119",
booktitle = "IISA 2015 - 6th International Conference on Information, Intelligence, Systems and Applications",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - Advances in speech and audio processing and coding

AU - Spanias, Andreas

PY - 2016/1/20

Y1 - 2016/1/20

N2 - This plenary session will cover speech processing research advances with the emphasis on speech and audio coding methods. In the session, we will discuss the fundamental principles, techniques, and algorithms used in current coding applications including a summary of codecs for telecommunication standards. The session will start with a discussion on: the basic speech representation methods, the performance measures used to evaluate coded speech, and the role of the standards. Brief algorithm descriptions include: ADPCM, sub-band coding, adaptive transform coding, sinusoidal transform coding (STC), linear predictive coding (LPC), and analysis-by-synthesis LPC (sparse excitation, code excited LPC, and ACELP). The presentation will feature audio, and computer demonstrations of recent speech coding standards including voice-over IP algorithms. The plenary session will also cover wideband audio standards such as MPEG audio and other layers (e.g., MP3, AAC). Recent algorithms will also be described including the following: Variable-Rate Multimode Wideband (VMR-WB), Speex, G722.1, OGG Vorbis 2012, iLBC, SELT, SILK, Opus 2013, Qualcomm wideband 5G codecs. At the end of the session, we will cover briefly recent applications that use voice features for detecting speech pathologies, and also discuss how long-term speech parameters can be used as predictors of other diseases such as tremors, Alzheimer's etc.

AB - This plenary session will cover speech processing research advances with the emphasis on speech and audio coding methods. In the session, we will discuss the fundamental principles, techniques, and algorithms used in current coding applications including a summary of codecs for telecommunication standards. The session will start with a discussion on: the basic speech representation methods, the performance measures used to evaluate coded speech, and the role of the standards. Brief algorithm descriptions include: ADPCM, sub-band coding, adaptive transform coding, sinusoidal transform coding (STC), linear predictive coding (LPC), and analysis-by-synthesis LPC (sparse excitation, code excited LPC, and ACELP). The presentation will feature audio, and computer demonstrations of recent speech coding standards including voice-over IP algorithms. The plenary session will also cover wideband audio standards such as MPEG audio and other layers (e.g., MP3, AAC). Recent algorithms will also be described including the following: Variable-Rate Multimode Wideband (VMR-WB), Speex, G722.1, OGG Vorbis 2012, iLBC, SELT, SILK, Opus 2013, Qualcomm wideband 5G codecs. At the end of the session, we will cover briefly recent applications that use voice features for detecting speech pathologies, and also discuss how long-term speech parameters can be used as predictors of other diseases such as tremors, Alzheimer's etc.

KW - Speech and Audio Coding

KW - Standardized Codecs

UR - http://www.scopus.com/inward/record.url?scp=84963861063&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84963861063&partnerID=8YFLogxK

U2 - 10.1109/IISA.2015.7388064

DO - 10.1109/IISA.2015.7388064

M3 - Conference contribution

SN - 9781467393119

BT - IISA 2015 - 6th International Conference on Information, Intelligence, Systems and Applications

PB - Institute of Electrical and Electronics Engineers Inc.

ER -