Elastic functional coding of human actions: From vector-fields to latent variables

Rushil Anirudh; Pavan Turaga; Jingyong Su; Anuj Srivastava

doi:10.1109/CVPR.2015.7298934

Elastic functional coding of human actions: From vector-fields to latent variables

Rushil Anirudh, Pavan Turaga, Jingyong Su, Anuj Srivastava

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

74 Scopus citations

Abstract

Human activities observed from visual sensors often give rise to a sequence of smoothly varying features. In many cases, the space of features can be formally defined as a manifold, where the action becomes a trajectory on the manifold. Such trajectories are high dimensional in addition to being non-linear, which can severely limit computations on them. We also argue that by their nature, human actions themselves lie on a much lower dimensional manifold compared to the high dimensional feature space. Learning an accurate low dimensional embedding for actions could have a huge impact in the areas of efficient search and retrieval, visualization, learning, and recognition. Traditional manifold learning addresses this problem for static points in ℝⁿ, but its extension to trajectories on Riemannian manifolds is non-trivial and has remained unexplored. The challenge arises due to the inherent non-linearity, and temporal variability that can significantly distort the distance metric between trajectories. To address these issues we use the transport square-root velocity function (TSRVF) space, a recently proposed representation that provides a metric which has favorable theoretical properties such as invariance to group action. We propose to learn the low dimensional embedding with a manifold functional variant of principal component analysis (mfPCA). We show that mf-PCA effectively models the manifold trajectories in several applications such as action recognition, clustering and diverse sequence sampling while reducing the dimensionality by a factor of ∼ 250×. The mfPCA features can also be reconstructed back to the original manifold to allow for easy visualization of the latent variable space.

Original language	English (US)
Title of host publication	IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015
Publisher	IEEE Computer Society
Pages	3147-3155
Number of pages	9
ISBN (Electronic)	9781467369640
DOIs	https://doi.org/10.1109/CVPR.2015.7298934
State	Published - Oct 14 2015
Event	IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015 - Boston, United States Duration: Jun 7 2015 → Jun 12 2015

Publication series

Name	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Volume	07-12-June-2015
ISSN (Print)	1063-6919

Other

Other	IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015
Country/Territory	United States
City	Boston
Period	6/7/15 → 6/12/15

ASJC Scopus subject areas

Software
Computer Vision and Pattern Recognition

Access to Document

10.1109/CVPR.2015.7298934

Cite this

Anirudh, R., Turaga, P., Su, J., & Srivastava, A. (2015). Elastic functional coding of human actions: From vector-fields to latent variables. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015 (pp. 3147-3155). Article 7298934 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; Vol. 07-12-June-2015). IEEE Computer Society. https://doi.org/10.1109/CVPR.2015.7298934

Elastic functional coding of human actions: From vector-fields to latent variables. / Anirudh, Rushil; Turaga, Pavan; Su, Jingyong et al.
IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015. IEEE Computer Society, 2015. p. 3147-3155 7298934 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; Vol. 07-12-June-2015).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Anirudh, R, Turaga, P, Su, J & Srivastava, A 2015, Elastic functional coding of human actions: From vector-fields to latent variables. in IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015., 7298934, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 07-12-June-2015, IEEE Computer Society, pp. 3147-3155, IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, United States, 6/7/15. https://doi.org/10.1109/CVPR.2015.7298934

Anirudh R, Turaga P, Su J, Srivastava A. Elastic functional coding of human actions: From vector-fields to latent variables. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015. IEEE Computer Society. 2015. p. 3147-3155. 7298934. (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition). doi: 10.1109/CVPR.2015.7298934

@inproceedings{fae48b6c5fcb48b2ac93822a889c8299,

title = "Elastic functional coding of human actions: From vector-fields to latent variables",

abstract = "Human activities observed from visual sensors often give rise to a sequence of smoothly varying features. In many cases, the space of features can be formally defined as a manifold, where the action becomes a trajectory on the manifold. Such trajectories are high dimensional in addition to being non-linear, which can severely limit computations on them. We also argue that by their nature, human actions themselves lie on a much lower dimensional manifold compared to the high dimensional feature space. Learning an accurate low dimensional embedding for actions could have a huge impact in the areas of efficient search and retrieval, visualization, learning, and recognition. Traditional manifold learning addresses this problem for static points in ℝn, but its extension to trajectories on Riemannian manifolds is non-trivial and has remained unexplored. The challenge arises due to the inherent non-linearity, and temporal variability that can significantly distort the distance metric between trajectories. To address these issues we use the transport square-root velocity function (TSRVF) space, a recently proposed representation that provides a metric which has favorable theoretical properties such as invariance to group action. We propose to learn the low dimensional embedding with a manifold functional variant of principal component analysis (mfPCA). We show that mf-PCA effectively models the manifold trajectories in several applications such as action recognition, clustering and diverse sequence sampling while reducing the dimensionality by a factor of ∼ 250×. The mfPCA features can also be reconstructed back to the original manifold to allow for easy visualization of the latent variable space.",

author = "Rushil Anirudh and Pavan Turaga and Jingyong Su and Anuj Srivastava",

note = "Publisher Copyright: {\textcopyright} 2015 IEEE.; IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015 ; Conference date: 07-06-2015 Through 12-06-2015",

year = "2015",

month = oct,

day = "14",

doi = "10.1109/CVPR.2015.7298934",

language = "English (US)",

series = "Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition",

publisher = "IEEE Computer Society",

pages = "3147--3155",

booktitle = "IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015",

}

TY - GEN

T1 - Elastic functional coding of human actions

T2 - IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015

AU - Anirudh, Rushil

AU - Turaga, Pavan

AU - Su, Jingyong

AU - Srivastava, Anuj

PY - 2015/10/14

Y1 - 2015/10/14

N2 - Human activities observed from visual sensors often give rise to a sequence of smoothly varying features. In many cases, the space of features can be formally defined as a manifold, where the action becomes a trajectory on the manifold. Such trajectories are high dimensional in addition to being non-linear, which can severely limit computations on them. We also argue that by their nature, human actions themselves lie on a much lower dimensional manifold compared to the high dimensional feature space. Learning an accurate low dimensional embedding for actions could have a huge impact in the areas of efficient search and retrieval, visualization, learning, and recognition. Traditional manifold learning addresses this problem for static points in ℝn, but its extension to trajectories on Riemannian manifolds is non-trivial and has remained unexplored. The challenge arises due to the inherent non-linearity, and temporal variability that can significantly distort the distance metric between trajectories. To address these issues we use the transport square-root velocity function (TSRVF) space, a recently proposed representation that provides a metric which has favorable theoretical properties such as invariance to group action. We propose to learn the low dimensional embedding with a manifold functional variant of principal component analysis (mfPCA). We show that mf-PCA effectively models the manifold trajectories in several applications such as action recognition, clustering and diverse sequence sampling while reducing the dimensionality by a factor of ∼ 250×. The mfPCA features can also be reconstructed back to the original manifold to allow for easy visualization of the latent variable space.

AB - Human activities observed from visual sensors often give rise to a sequence of smoothly varying features. In many cases, the space of features can be formally defined as a manifold, where the action becomes a trajectory on the manifold. Such trajectories are high dimensional in addition to being non-linear, which can severely limit computations on them. We also argue that by their nature, human actions themselves lie on a much lower dimensional manifold compared to the high dimensional feature space. Learning an accurate low dimensional embedding for actions could have a huge impact in the areas of efficient search and retrieval, visualization, learning, and recognition. Traditional manifold learning addresses this problem for static points in ℝn, but its extension to trajectories on Riemannian manifolds is non-trivial and has remained unexplored. The challenge arises due to the inherent non-linearity, and temporal variability that can significantly distort the distance metric between trajectories. To address these issues we use the transport square-root velocity function (TSRVF) space, a recently proposed representation that provides a metric which has favorable theoretical properties such as invariance to group action. We propose to learn the low dimensional embedding with a manifold functional variant of principal component analysis (mfPCA). We show that mf-PCA effectively models the manifold trajectories in several applications such as action recognition, clustering and diverse sequence sampling while reducing the dimensionality by a factor of ∼ 250×. The mfPCA features can also be reconstructed back to the original manifold to allow for easy visualization of the latent variable space.

UR - http://www.scopus.com/inward/record.url?scp=84959193616&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84959193616&partnerID=8YFLogxK

U2 - 10.1109/CVPR.2015.7298934

DO - 10.1109/CVPR.2015.7298934

M3 - Conference contribution

AN - SCOPUS:84959193616

T3 - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

SP - 3147

EP - 3155

BT - IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015

PB - IEEE Computer Society

Y2 - 7 June 2015 through 12 June 2015

ER -

Elastic functional coding of human actions: From vector-fields to latent variables

Abstract

Publication series

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this