TY - GEN
T1 - Unifying semantic and content-based approaches for retrieval of environmental sounds
AU - Wichern, Gordon
AU - Thornburg, Harvey
AU - Spanias, Andreas
PY - 2009
Y1 - 2009
N2 - Creating a database of user-contributed recordings allows sounds to be linked not only by the semantic tags and labels applied to them, but also to other sounds with similar acoustic characteristics. Of paramount importance in navigating these databases are the problems of retrieving similar sounds using text or sound-based queries, and automatically annotating unlabeled sounds. We propose an integrated system, which can be used for text-based retrieval of unlabeled audio, content-based query-by-example, and automatic annotation. To this end, we introduce an ontological framework where sounds are connected to each other based on a measure of perceptual similarity, while words and sounds are connected by optimizing link weights given user preference data. Results on a freely available database of environmental sounds contributed and labeled by non-expert users, demonstrate effective average precision scores for both the text-based retrieval and annotation tasks.
AB - Creating a database of user-contributed recordings allows sounds to be linked not only by the semantic tags and labels applied to them, but also to other sounds with similar acoustic characteristics. Of paramount importance in navigating these databases are the problems of retrieving similar sounds using text or sound-based queries, and automatically annotating unlabeled sounds. We propose an integrated system, which can be used for text-based retrieval of unlabeled audio, content-based query-by-example, and automatic annotation. To this end, we introduce an ontological framework where sounds are connected to each other based on a measure of perceptual similarity, while words and sounds are connected by optimizing link weights given user preference data. Results on a freely available database of environmental sounds contributed and labeled by non-expert users, demonstrate effective average precision scores for both the text-based retrieval and annotation tasks.
KW - Acoustic signal analysis
KW - Clustering methods
KW - Database query processing
KW - Hidden markov models
UR - http://www.scopus.com/inward/record.url?scp=76949096922&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=76949096922&partnerID=8YFLogxK
U2 - 10.1109/ASPAA.2009.5346493
DO - 10.1109/ASPAA.2009.5346493
M3 - Conference contribution
AN - SCOPUS:76949096922
SN - 9781424436798
T3 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
SP - 13
EP - 16
BT - 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009
T2 - 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009
Y2 - 18 October 2009 through 21 October 2009
ER -