Using a minimal action grammar for activity understanding in the real world

Douglas Summers-Stay; Ching L. Teo; Yezhou Yang; Cornelia Fermuller; Yiannis Aloimonos

doi:10.1109/IROS.2012.6385483

Using a minimal action grammar for activity understanding in the real world

Douglas Summers-Stay, Ching L. Teo, Yezhou Yang, Cornelia Fermuller, Yiannis Aloimonos

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

30 Scopus citations

Abstract

There is good reason to believe that humans use some kind of recursive grammatical structure when we recognize and perform complex manipulation activities. We have built a system to automatically build a tree structure from observations of an actor performing such activities. The activity trees that result form a framework for search and understanding, tying action to language. We explore and evaluate the system by performing experiments over a novel complex activity dataset taken using synchronized Kinect and SR4000 Time of Flight cameras. Processing of the combined 3D and 2D image data provides the necessary terminals and events to build the tree from the bottom-up. Experimental results highlight the contribution of the action grammar in: 1) providing a robust structure for complex activity recognition over real data and 2) disambiguating interleaved activities from within the same sequence.

Original language	English (US)
Title of host publication	2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2012
Pages	4104-4111
Number of pages	8
DOIs	https://doi.org/10.1109/IROS.2012.6385483
State	Published - 2012
Externally published	Yes
Event	25th IEEE/RSJ International Conference on Robotics and Intelligent Systems, IROS 2012 - Vilamoura, Algarve, Portugal Duration: Oct 7 2012 → Oct 12 2012

Publication series

Name	IEEE International Conference on Intelligent Robots and Systems
ISSN (Print)	2153-0858
ISSN (Electronic)	2153-0866

Other

Other	25th IEEE/RSJ International Conference on Robotics and Intelligent Systems, IROS 2012
Country/Territory	Portugal
City	Vilamoura, Algarve
Period	10/7/12 → 10/12/12

ASJC Scopus subject areas

Control and Systems Engineering
Software
Computer Vision and Pattern Recognition
Computer Science Applications

Access to Document

10.1109/IROS.2012.6385483

Cite this

Summers-Stay, D., Teo, C. L., Yang, Y., Fermuller, C., & Aloimonos, Y. (2012). Using a minimal action grammar for activity understanding in the real world. In 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2012 (pp. 4104-4111). Article 6385483 (IEEE International Conference on Intelligent Robots and Systems). https://doi.org/10.1109/IROS.2012.6385483

Using a minimal action grammar for activity understanding in the real world. / Summers-Stay, Douglas; Teo, Ching L.; Yang, Yezhou et al.
2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2012. 2012. p. 4104-4111 6385483 (IEEE International Conference on Intelligent Robots and Systems).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Summers-Stay, D, Teo, CL, Yang, Y, Fermuller, C & Aloimonos, Y 2012, Using a minimal action grammar for activity understanding in the real world. in 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2012., 6385483, IEEE International Conference on Intelligent Robots and Systems, pp. 4104-4111, 25th IEEE/RSJ International Conference on Robotics and Intelligent Systems, IROS 2012, Vilamoura, Algarve, Portugal, 10/7/12. https://doi.org/10.1109/IROS.2012.6385483

@inproceedings{04661771069948fca0f17e392b54019f,

title = "Using a minimal action grammar for activity understanding in the real world",

abstract = "There is good reason to believe that humans use some kind of recursive grammatical structure when we recognize and perform complex manipulation activities. We have built a system to automatically build a tree structure from observations of an actor performing such activities. The activity trees that result form a framework for search and understanding, tying action to language. We explore and evaluate the system by performing experiments over a novel complex activity dataset taken using synchronized Kinect and SR4000 Time of Flight cameras. Processing of the combined 3D and 2D image data provides the necessary terminals and events to build the tree from the bottom-up. Experimental results highlight the contribution of the action grammar in: 1) providing a robust structure for complex activity recognition over real data and 2) disambiguating interleaved activities from within the same sequence.",

author = "Douglas Summers-Stay and Teo, {Ching L.} and Yezhou Yang and Cornelia Fermuller and Yiannis Aloimonos",

year = "2012",

doi = "10.1109/IROS.2012.6385483",

language = "English (US)",

isbn = "9781467317375",

series = "IEEE International Conference on Intelligent Robots and Systems",

pages = "4104--4111",

booktitle = "2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2012",

note = "25th IEEE/RSJ International Conference on Robotics and Intelligent Systems, IROS 2012 ; Conference date: 07-10-2012 Through 12-10-2012",

}

TY - GEN

T1 - Using a minimal action grammar for activity understanding in the real world

AU - Summers-Stay, Douglas

AU - Teo, Ching L.

AU - Yang, Yezhou

AU - Fermuller, Cornelia

AU - Aloimonos, Yiannis

PY - 2012

Y1 - 2012

N2 - There is good reason to believe that humans use some kind of recursive grammatical structure when we recognize and perform complex manipulation activities. We have built a system to automatically build a tree structure from observations of an actor performing such activities. The activity trees that result form a framework for search and understanding, tying action to language. We explore and evaluate the system by performing experiments over a novel complex activity dataset taken using synchronized Kinect and SR4000 Time of Flight cameras. Processing of the combined 3D and 2D image data provides the necessary terminals and events to build the tree from the bottom-up. Experimental results highlight the contribution of the action grammar in: 1) providing a robust structure for complex activity recognition over real data and 2) disambiguating interleaved activities from within the same sequence.

AB - There is good reason to believe that humans use some kind of recursive grammatical structure when we recognize and perform complex manipulation activities. We have built a system to automatically build a tree structure from observations of an actor performing such activities. The activity trees that result form a framework for search and understanding, tying action to language. We explore and evaluate the system by performing experiments over a novel complex activity dataset taken using synchronized Kinect and SR4000 Time of Flight cameras. Processing of the combined 3D and 2D image data provides the necessary terminals and events to build the tree from the bottom-up. Experimental results highlight the contribution of the action grammar in: 1) providing a robust structure for complex activity recognition over real data and 2) disambiguating interleaved activities from within the same sequence.

UR - http://www.scopus.com/inward/record.url?scp=84872358456&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84872358456&partnerID=8YFLogxK

U2 - 10.1109/IROS.2012.6385483

DO - 10.1109/IROS.2012.6385483

M3 - Conference contribution

AN - SCOPUS:84872358456

SN - 9781467317375

T3 - IEEE International Conference on Intelligent Robots and Systems

SP - 4104

EP - 4111

BT - 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2012

T2 - 25th IEEE/RSJ International Conference on Robotics and Intelligent Systems, IROS 2012

Y2 - 7 October 2012 through 12 October 2012

ER -

Using a minimal action grammar for activity understanding in the real world

Abstract

Publication series

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this