TY - GEN
T1 - Combining semantic, social, and acoustic similarity for retrieval of environmental sounds
AU - Mechtley, Brandon
AU - Wichern, Gordon
AU - Thornburg, Harvey
AU - Spanias, Andreas
PY - 2010
Y1 - 2010
N2 - Recent work in audio information retrieval has demonstrated the effectiveness of combining semantic information, such as descriptive, tags with acoustic content. However, these methods largely ignore the possibility of tag queries that do not yet exist in the database and the possibility of similar terms. In this work, we propose a network structure integrating similarity between semantic tags, content-based similarity between environmental audio recordings, and the collective sound descriptions provided by a user community. We then demonstrate the effectiveness of our approach by comparing the use of existing similarity measures for incorporating new vocabulary into an audio annotation and retrieval system.
AB - Recent work in audio information retrieval has demonstrated the effectiveness of combining semantic information, such as descriptive, tags with acoustic content. However, these methods largely ignore the possibility of tag queries that do not yet exist in the database and the possibility of similar terms. In this work, we propose a network structure integrating similarity between semantic tags, content-based similarity between environmental audio recordings, and the collective sound descriptions provided by a user community. We then demonstrate the effectiveness of our approach by comparing the use of existing similarity measures for incorporating new vocabulary into an audio annotation and retrieval system.
KW - Acoustic signal analysis
KW - Database query processing
KW - Multimedia databases
KW - Semantic networks
UR - http://www.scopus.com/inward/record.url?scp=78049395849&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78049395849&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2010.5496225
DO - 10.1109/ICASSP.2010.5496225
M3 - Conference contribution
AN - SCOPUS:78049395849
SN - 9781424442966
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 2402
EP - 2405
BT - 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 - Proceedings
T2 - 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010
Y2 - 14 March 2010 through 19 March 2010
ER -