TY - GEN
T1 - Distortion-aware query-by-example for environmental sounds
AU - Wiehern, Gordon
AU - Xue, Jiachen
AU - Thornburg, Harvey
AU - Spanias, Andreas
PY - 2007
Y1 - 2007
N2 - There has been much recent progress in the technical infrastructure necessary to continuously characterize and archive all sounds that occur within a given space or human life. Efficient and intuitive access, however, remains a considerable challenge. In other domains, i.e., melody retrieval, query-by-example (QBE) has found considerable success in accessing music that matches a specific query. We propose an extension of the QBE paradigm to the broad class of natural and environmental sounds. These sounds occur frequently in continuous recordings, and are often difficult for humans to imitate. We utilize a probabilistic QBE scheme that is flexible in the presence of time, level, and scale distortions along with a clustering approach to efficiently organize and retrieve the archived audio. Experiments on a test database demonstrate accurate retrieval of archived sounds, whose relevance to example queries is determined by human users.
AB - There has been much recent progress in the technical infrastructure necessary to continuously characterize and archive all sounds that occur within a given space or human life. Efficient and intuitive access, however, remains a considerable challenge. In other domains, i.e., melody retrieval, query-by-example (QBE) has found considerable success in accessing music that matches a specific query. We propose an extension of the QBE paradigm to the broad class of natural and environmental sounds. These sounds occur frequently in continuous recordings, and are often difficult for humans to imitate. We utilize a probabilistic QBE scheme that is flexible in the presence of time, level, and scale distortions along with a clustering approach to efficiently organize and retrieve the archived audio. Experiments on a test database demonstrate accurate retrieval of archived sounds, whose relevance to example queries is determined by human users.
UR - http://www.scopus.com/inward/record.url?scp=50249158039&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=50249158039&partnerID=8YFLogxK
U2 - 10.1109/ASPAA.2007.4393051
DO - 10.1109/ASPAA.2007.4393051
M3 - Conference contribution
AN - SCOPUS:50249158039
SN - 9781424416196
T3 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
SP - 335
EP - 338
BT - 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA
T2 - 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA
Y2 - 21 October 2007 through 24 October 2007
ER -