Action attribute detection from sports videos with contextual constraints

Xiaodong Yu; Ching Lik Teo; Yezhou Yang; Cornelia Fermüller; Yiannis Aloimonos

doi:10.5244/C.27.79

Action attribute detection from sports videos with contextual constraints

Xiaodong Yu, Ching Lik Teo, Yezhou Yang, Cornelia Fermüller, Yiannis Aloimonos

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

2 Scopus citations

Abstract

In this paper, we are interested in detecting action attributes from sports videos for event understanding and video analysis. Action attribute is a middle layer between low level motion features and high level action classes, which includes various motion patterns of human limbs and bodies and the interaction between human and objects. Successfully detecting action attributes provides a richer video description that facilitates many other important tasks, such action classification, video understanding, automatic video transcript, etc. A naive approach to deal with this challenging problem is to train a classifier for each attribute and then use them to detect attributes in novel videos independently. However, this independence assumption is often too strong, and as we show in our experiments, produces a large number of false positives in practice. We propose a novel approach that incorporates the contextual constraints for activity attribute detection. The temporal contexts within an attribute and the co-occurrence contexts between different attributes are modelled by a factorial conditional random field, which encourages agreement between different time points and attributes. The effectiveness of our methods are clearly illustrated by the experimental evaluations.

Original language	English (US)
Title of host publication	BMVC 2013 - Electronic Proceedings of the British Machine Vision Conference 2013
Publisher	British Machine Vision Association, BMVA
DOIs	https://doi.org/10.5244/C.27.79
State	Published - 2013
Externally published	Yes
Event	2013 24th British Machine Vision Conference, BMVC 2013 - Bristol, United Kingdom Duration: Sep 9 2013 → Sep 13 2013

Other

Other	2013 24th British Machine Vision Conference, BMVC 2013
Country/Territory	United Kingdom
City	Bristol
Period	9/9/13 → 9/13/13

ASJC Scopus subject areas

Computer Vision and Pattern Recognition

Access to Document

10.5244/C.27.79

Cite this

Yu, X, Teo, CL, Yang, Y, Fermüller, C & Aloimonos, Y 2013, Action attribute detection from sports videos with contextual constraints. in BMVC 2013 - Electronic Proceedings of the British Machine Vision Conference 2013. British Machine Vision Association, BMVA, 2013 24th British Machine Vision Conference, BMVC 2013, Bristol, United Kingdom, 9/9/13. https://doi.org/10.5244/C.27.79

@inproceedings{5cc3252a03f740e68f9e727162c9f66c,

title = "Action attribute detection from sports videos with contextual constraints",

abstract = "In this paper, we are interested in detecting action attributes from sports videos for event understanding and video analysis. Action attribute is a middle layer between low level motion features and high level action classes, which includes various motion patterns of human limbs and bodies and the interaction between human and objects. Successfully detecting action attributes provides a richer video description that facilitates many other important tasks, such action classification, video understanding, automatic video transcript, etc. A naive approach to deal with this challenging problem is to train a classifier for each attribute and then use them to detect attributes in novel videos independently. However, this independence assumption is often too strong, and as we show in our experiments, produces a large number of false positives in practice. We propose a novel approach that incorporates the contextual constraints for activity attribute detection. The temporal contexts within an attribute and the co-occurrence contexts between different attributes are modelled by a factorial conditional random field, which encourages agreement between different time points and attributes. The effectiveness of our methods are clearly illustrated by the experimental evaluations.",

author = "Xiaodong Yu and Teo, {Ching Lik} and Yezhou Yang and Cornelia Ferm{\"u}ller and Yiannis Aloimonos",

year = "2013",

doi = "10.5244/C.27.79",

language = "English (US)",

booktitle = "BMVC 2013 - Electronic Proceedings of the British Machine Vision Conference 2013",

publisher = "British Machine Vision Association, BMVA",

note = "2013 24th British Machine Vision Conference, BMVC 2013 ; Conference date: 09-09-2013 Through 13-09-2013",

}

TY - GEN

T1 - Action attribute detection from sports videos with contextual constraints

AU - Yu, Xiaodong

AU - Teo, Ching Lik

AU - Yang, Yezhou

AU - Fermüller, Cornelia

AU - Aloimonos, Yiannis

PY - 2013

Y1 - 2013

N2 - In this paper, we are interested in detecting action attributes from sports videos for event understanding and video analysis. Action attribute is a middle layer between low level motion features and high level action classes, which includes various motion patterns of human limbs and bodies and the interaction between human and objects. Successfully detecting action attributes provides a richer video description that facilitates many other important tasks, such action classification, video understanding, automatic video transcript, etc. A naive approach to deal with this challenging problem is to train a classifier for each attribute and then use them to detect attributes in novel videos independently. However, this independence assumption is often too strong, and as we show in our experiments, produces a large number of false positives in practice. We propose a novel approach that incorporates the contextual constraints for activity attribute detection. The temporal contexts within an attribute and the co-occurrence contexts between different attributes are modelled by a factorial conditional random field, which encourages agreement between different time points and attributes. The effectiveness of our methods are clearly illustrated by the experimental evaluations.

AB - In this paper, we are interested in detecting action attributes from sports videos for event understanding and video analysis. Action attribute is a middle layer between low level motion features and high level action classes, which includes various motion patterns of human limbs and bodies and the interaction between human and objects. Successfully detecting action attributes provides a richer video description that facilitates many other important tasks, such action classification, video understanding, automatic video transcript, etc. A naive approach to deal with this challenging problem is to train a classifier for each attribute and then use them to detect attributes in novel videos independently. However, this independence assumption is often too strong, and as we show in our experiments, produces a large number of false positives in practice. We propose a novel approach that incorporates the contextual constraints for activity attribute detection. The temporal contexts within an attribute and the co-occurrence contexts between different attributes are modelled by a factorial conditional random field, which encourages agreement between different time points and attributes. The effectiveness of our methods are clearly illustrated by the experimental evaluations.

UR - http://www.scopus.com/inward/record.url?scp=84898428895&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84898428895&partnerID=8YFLogxK

U2 - 10.5244/C.27.79

DO - 10.5244/C.27.79

M3 - Conference contribution

AN - SCOPUS:84898428895

BT - BMVC 2013 - Electronic Proceedings of the British Machine Vision Conference 2013

PB - British Machine Vision Association, BMVA

T2 - 2013 24th British Machine Vision Conference, BMVC 2013

Y2 - 9 September 2013 through 13 September 2013

ER -

Action attribute detection from sports videos with contextual constraints

Abstract

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this