TY - GEN
T1 - A scalable feature learning and tag prediction framework for natural environment sounds
AU - Sattigeri, P.
AU - Thiagarajan, J. J.
AU - Shah, M.
AU - Ramamurthy, K. N.
AU - Spanias, Andreas
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2015/4/24
Y1 - 2015/4/24
N2 - Building feature extraction approaches that can effectively characterize natural environment sounds is challenging due to the dynamic nature. In this paper, we develop a framework for feature extraction and obtaining semantic inferences from such data. In particular, we propose a new pooling strategy for deep architectures, that can preserve the temporal dynamics in the resulting representation. By constructing an ensemble of semantic embeddings, we employ an l1-reconstruction based prediction algorithm for estimating the relevant tags. We evaluate our approach on challenging environmental sound recognition datasets, and show that the proposed features outperform traditional spectral features.
AB - Building feature extraction approaches that can effectively characterize natural environment sounds is challenging due to the dynamic nature. In this paper, we develop a framework for feature extraction and obtaining semantic inferences from such data. In particular, we propose a new pooling strategy for deep architectures, that can preserve the temporal dynamics in the resulting representation. By constructing an ensemble of semantic embeddings, we employ an l1-reconstruction based prediction algorithm for estimating the relevant tags. We evaluate our approach on challenging environmental sound recognition datasets, and show that the proposed features outperform traditional spectral features.
UR - http://www.scopus.com/inward/record.url?scp=84940510433&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84940510433&partnerID=8YFLogxK
U2 - 10.1109/ACSSC.2014.7094773
DO - 10.1109/ACSSC.2014.7094773
M3 - Conference contribution
AN - SCOPUS:84940510433
T3 - Conference Record - Asilomar Conference on Signals, Systems and Computers
SP - 1779
EP - 1783
BT - Conference Record of the 48th Asilomar Conference on Signals, Systems and Computers
A2 - Matthews, Michael B.
PB - IEEE Computer Society
T2 - 48th Asilomar Conference on Signals, Systems and Computers, ACSSC 2015
Y2 - 2 November 2014 through 5 November 2014
ER -