Unifying semantic and content-based approaches for retrieval of environmental sounds

Gordon Wichern; Harvey Thornburg; Andreas Spanias

doi:10.1109/ASPAA.2009.5346493

Unifying semantic and content-based approaches for retrieval of environmental sounds

Gordon Wichern, Harvey Thornburg, Andreas Spanias

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

7 Scopus citations

Abstract

Creating a database of user-contributed recordings allows sounds to be linked not only by the semantic tags and labels applied to them, but also to other sounds with similar acoustic characteristics. Of paramount importance in navigating these databases are the problems of retrieving similar sounds using text or sound-based queries, and automatically annotating unlabeled sounds. We propose an integrated system, which can be used for text-based retrieval of unlabeled audio, content-based query-by-example, and automatic annotation. To this end, we introduce an ontological framework where sounds are connected to each other based on a measure of perceptual similarity, while words and sounds are connected by optimizing link weights given user preference data. Results on a freely available database of environmental sounds contributed and labeled by non-expert users, demonstrate effective average precision scores for both the text-based retrieval and annotation tasks.

Original language	English (US)
Title of host publication	2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009
Pages	13-16
Number of pages	4
DOIs	https://doi.org/10.1109/ASPAA.2009.5346493
State	Published - 2009
Event	2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009 - New Paltz, NY, United States Duration: Oct 18 2009 → Oct 21 2009

Publication series

Name	IEEE Workshop on Applications of Signal Processing to Audio and Acoustics

Other

Other	2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009
Country/Territory	United States
City	New Paltz, NY
Period	10/18/09 → 10/21/09

Keywords

Acoustic signal analysis
Clustering methods
Database query processing
Hidden markov models

ASJC Scopus subject areas

Electrical and Electronic Engineering
Computer Science Applications

Access to Document

10.1109/ASPAA.2009.5346493

Cite this

Wichern, G., Thornburg, H., & Spanias, A. (2009). Unifying semantic and content-based approaches for retrieval of environmental sounds. In 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009 (pp. 13-16). Article 5346493 (IEEE Workshop on Applications of Signal Processing to Audio and Acoustics). https://doi.org/10.1109/ASPAA.2009.5346493

Unifying semantic and content-based approaches for retrieval of environmental sounds. / Wichern, Gordon; Thornburg, Harvey; Spanias, Andreas.
2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009. 2009. p. 13-16 5346493 (IEEE Workshop on Applications of Signal Processing to Audio and Acoustics).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Wichern, G, Thornburg, H & Spanias, A 2009, Unifying semantic and content-based approaches for retrieval of environmental sounds. in 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009., 5346493, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 13-16, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009, New Paltz, NY, United States, 10/18/09. https://doi.org/10.1109/ASPAA.2009.5346493

@inproceedings{0ae3f14991a942b8b2dd424c06c042c8,

title = "Unifying semantic and content-based approaches for retrieval of environmental sounds",

abstract = "Creating a database of user-contributed recordings allows sounds to be linked not only by the semantic tags and labels applied to them, but also to other sounds with similar acoustic characteristics. Of paramount importance in navigating these databases are the problems of retrieving similar sounds using text or sound-based queries, and automatically annotating unlabeled sounds. We propose an integrated system, which can be used for text-based retrieval of unlabeled audio, content-based query-by-example, and automatic annotation. To this end, we introduce an ontological framework where sounds are connected to each other based on a measure of perceptual similarity, while words and sounds are connected by optimizing link weights given user preference data. Results on a freely available database of environmental sounds contributed and labeled by non-expert users, demonstrate effective average precision scores for both the text-based retrieval and annotation tasks.",

keywords = "Acoustic signal analysis, Clustering methods, Database query processing, Hidden markov models",

author = "Gordon Wichern and Harvey Thornburg and Andreas Spanias",

year = "2009",

doi = "10.1109/ASPAA.2009.5346493",

language = "English (US)",

isbn = "9781424436798",

series = "IEEE Workshop on Applications of Signal Processing to Audio and Acoustics",

pages = "13--16",

booktitle = "2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009",

note = "2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009 ; Conference date: 18-10-2009 Through 21-10-2009",

}

TY - GEN

T1 - Unifying semantic and content-based approaches for retrieval of environmental sounds

AU - Wichern, Gordon

AU - Thornburg, Harvey

AU - Spanias, Andreas

PY - 2009

Y1 - 2009

N2 - Creating a database of user-contributed recordings allows sounds to be linked not only by the semantic tags and labels applied to them, but also to other sounds with similar acoustic characteristics. Of paramount importance in navigating these databases are the problems of retrieving similar sounds using text or sound-based queries, and automatically annotating unlabeled sounds. We propose an integrated system, which can be used for text-based retrieval of unlabeled audio, content-based query-by-example, and automatic annotation. To this end, we introduce an ontological framework where sounds are connected to each other based on a measure of perceptual similarity, while words and sounds are connected by optimizing link weights given user preference data. Results on a freely available database of environmental sounds contributed and labeled by non-expert users, demonstrate effective average precision scores for both the text-based retrieval and annotation tasks.

AB - Creating a database of user-contributed recordings allows sounds to be linked not only by the semantic tags and labels applied to them, but also to other sounds with similar acoustic characteristics. Of paramount importance in navigating these databases are the problems of retrieving similar sounds using text or sound-based queries, and automatically annotating unlabeled sounds. We propose an integrated system, which can be used for text-based retrieval of unlabeled audio, content-based query-by-example, and automatic annotation. To this end, we introduce an ontological framework where sounds are connected to each other based on a measure of perceptual similarity, while words and sounds are connected by optimizing link weights given user preference data. Results on a freely available database of environmental sounds contributed and labeled by non-expert users, demonstrate effective average precision scores for both the text-based retrieval and annotation tasks.

KW - Acoustic signal analysis

KW - Clustering methods

KW - Database query processing

KW - Hidden markov models

UR - http://www.scopus.com/inward/record.url?scp=76949096922&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=76949096922&partnerID=8YFLogxK

U2 - 10.1109/ASPAA.2009.5346493

DO - 10.1109/ASPAA.2009.5346493

M3 - Conference contribution

AN - SCOPUS:76949096922

SN - 9781424436798

T3 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics

SP - 13

EP - 16

BT - 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009

T2 - 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009

Y2 - 18 October 2009 through 21 October 2009

ER -

Unifying semantic and content-based approaches for retrieval of environmental sounds

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this