Modeling and visualization of human activities for multicamera networks

Aswin C. Sankaranarayanan; Robert Patro; Pavan Turaga; Amitabh Varshney; Rama Chellappa

doi:10.1155/2009/259860

Modeling and visualization of human activities for multicamera networks

Aswin C. Sankaranarayanan, Robert Patro, Pavan Turaga, Amitabh Varshney, Rama Chellappa

Research output: Contribution to journal › Article › peer-review

10 Scopus citations

Abstract

Multicamera networks are becoming complex involving larger sensing areas in order to capture activities and behavior that evolve over long spatial and temporal windows. This necessitates novel methods to process the information sensed by the network and visualize it for an end user. In this paper, we describe a system for modeling and on-demand visualization of activities of groups of humans. Using the prior knowledge of the 3D structure of the scene as well as camera calibration, the system localizes humans as they navigate the scene. Activities of interest are detected by matching models of these activities learnt a priori against the multiview observations. The trajectories and the activity index for each individual summarize the dynamic content of the scene. These are used to render the scene with virtual 3D human models that mimic the observed activities of real humans. In particular, the rendering framework is designed to handle large displays with a cluster of GPUs as well as reduce the cognitive dissonance by rendering realistic weather effects and illumination. We envision use of this system for immersive visualization as well as summarization of videos that capture group behavior.

Original language	English (US)
Article number	259860
Journal	Eurasip Journal on Image and Video Processing
Volume	2009
DOIs	https://doi.org/10.1155/2009/259860
State	Published - 2009
Externally published	Yes

ASJC Scopus subject areas

Signal Processing
Information Systems
Electrical and Electronic Engineering

Access to Document

10.1155/2009/259860

Cite this

@article{7e553e8d2b0849e79a7a99ed515e8e09,

title = "Modeling and visualization of human activities for multicamera networks",

abstract = "Multicamera networks are becoming complex involving larger sensing areas in order to capture activities and behavior that evolve over long spatial and temporal windows. This necessitates novel methods to process the information sensed by the network and visualize it for an end user. In this paper, we describe a system for modeling and on-demand visualization of activities of groups of humans. Using the prior knowledge of the 3D structure of the scene as well as camera calibration, the system localizes humans as they navigate the scene. Activities of interest are detected by matching models of these activities learnt a priori against the multiview observations. The trajectories and the activity index for each individual summarize the dynamic content of the scene. These are used to render the scene with virtual 3D human models that mimic the observed activities of real humans. In particular, the rendering framework is designed to handle large displays with a cluster of GPUs as well as reduce the cognitive dissonance by rendering realistic weather effects and illumination. We envision use of this system for immersive visualization as well as summarization of videos that capture group behavior.",

author = "Sankaranarayanan, {Aswin C.} and Robert Patro and Pavan Turaga and Amitabh Varshney and Rama Chellappa",

note = "Funding Information: This work was supported by DARPA Flexiview Grant HR001107C0059 and NSF CNS 04-03313.",

year = "2009",

doi = "10.1155/2009/259860",

language = "English (US)",

volume = "2009",

journal = "Eurasip Journal on Image and Video Processing",

issn = "1687-5176",

publisher = "Springer Publishing Company",

}

TY - JOUR

T1 - Modeling and visualization of human activities for multicamera networks

AU - Sankaranarayanan, Aswin C.

AU - Patro, Robert

AU - Turaga, Pavan

AU - Varshney, Amitabh

AU - Chellappa, Rama

N1 - Funding Information: This work was supported by DARPA Flexiview Grant HR001107C0059 and NSF CNS 04-03313.

PY - 2009

Y1 - 2009

N2 - Multicamera networks are becoming complex involving larger sensing areas in order to capture activities and behavior that evolve over long spatial and temporal windows. This necessitates novel methods to process the information sensed by the network and visualize it for an end user. In this paper, we describe a system for modeling and on-demand visualization of activities of groups of humans. Using the prior knowledge of the 3D structure of the scene as well as camera calibration, the system localizes humans as they navigate the scene. Activities of interest are detected by matching models of these activities learnt a priori against the multiview observations. The trajectories and the activity index for each individual summarize the dynamic content of the scene. These are used to render the scene with virtual 3D human models that mimic the observed activities of real humans. In particular, the rendering framework is designed to handle large displays with a cluster of GPUs as well as reduce the cognitive dissonance by rendering realistic weather effects and illumination. We envision use of this system for immersive visualization as well as summarization of videos that capture group behavior.

AB - Multicamera networks are becoming complex involving larger sensing areas in order to capture activities and behavior that evolve over long spatial and temporal windows. This necessitates novel methods to process the information sensed by the network and visualize it for an end user. In this paper, we describe a system for modeling and on-demand visualization of activities of groups of humans. Using the prior knowledge of the 3D structure of the scene as well as camera calibration, the system localizes humans as they navigate the scene. Activities of interest are detected by matching models of these activities learnt a priori against the multiview observations. The trajectories and the activity index for each individual summarize the dynamic content of the scene. These are used to render the scene with virtual 3D human models that mimic the observed activities of real humans. In particular, the rendering framework is designed to handle large displays with a cluster of GPUs as well as reduce the cognitive dissonance by rendering realistic weather effects and illumination. We envision use of this system for immersive visualization as well as summarization of videos that capture group behavior.

UR - http://www.scopus.com/inward/record.url?scp=76649133794&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=76649133794&partnerID=8YFLogxK

U2 - 10.1155/2009/259860

DO - 10.1155/2009/259860

M3 - Article

AN - SCOPUS:76649133794

SN - 1687-5176

VL - 2009

JO - Eurasip Journal on Image and Video Processing

JF - Eurasip Journal on Image and Video Processing

M1 - 259860

ER -

Modeling and visualization of human activities for multicamera networks

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this