Unifying semantic and content-based approaches for retrieval of environmental sounds

Gordon Wichern, Harvey Thornburg, Andreas Spanias

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Scopus citations

Abstract

Creating a database of user-contributed recordings allows sounds to be linked not only by the semantic tags and labels applied to them, but also to other sounds with similar acoustic characteristics. Of paramount importance in navigating these databases are the problems of retrieving similar sounds using text or sound-based queries, and automatically annotating unlabeled sounds. We propose an integrated system, which can be used for text-based retrieval of unlabeled audio, content-based query-by-example, and automatic annotation. To this end, we introduce an ontological framework where sounds are connected to each other based on a measure of perceptual similarity, while words and sounds are connected by optimizing link weights given user preference data. Results on a freely available database of environmental sounds contributed and labeled by non-expert users, demonstrate effective average precision scores for both the text-based retrieval and annotation tasks.

Original languageEnglish (US)
Title of host publication2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009
Pages13-16
Number of pages4
DOIs
StatePublished - Dec 1 2009
Event2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009 - New Paltz, NY, United States
Duration: Oct 18 2009Oct 21 2009

Publication series

NameIEEE Workshop on Applications of Signal Processing to Audio and Acoustics

Other

Other2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009
CountryUnited States
CityNew Paltz, NY
Period10/18/0910/21/09

Keywords

  • Acoustic signal analysis
  • Clustering methods
  • Database query processing
  • Hidden markov models

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Computer Science Applications

Fingerprint Dive into the research topics of 'Unifying semantic and content-based approaches for retrieval of environmental sounds'. Together they form a unique fingerprint.

  • Cite this

    Wichern, G., Thornburg, H., & Spanias, A. (2009). Unifying semantic and content-based approaches for retrieval of environmental sounds. In 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009 (pp. 13-16). [5346493] (IEEE Workshop on Applications of Signal Processing to Audio and Acoustics). https://doi.org/10.1109/ASPAA.2009.5346493