Content based indexing in the MPEG - [1, 2 & 4] domains

M. Zubair, J. Bhalod, Sethuraman Panchanathan

Research output: Contribution to journalArticle

Abstract

Retrieval of videos from large databases using the inherent content as a key, is an important and challenging problem with many applications. The large volume of data associated with visual information presents challenges from the perspectives of storage, browsing, indexing and retrieval. The Moving Pictures Expert Group (MPEG) has addressed the issue of compression by proposing a family of video compression standards, namely MPEG 1, MPEG 2 and MPEG 4. In this paper, we propose a unified scheme for indexing the visual content in the MPEG 1, 2, and 4 domains. A video is first segmented into elemental units called shots. In the case of MPEG 1 and 2 videos containing simple camera operations (without significant object motion), we propose to generate a mosaic which is representative of the visual content of the entire shot in contrast to existing approaches where videos with both little and large motion employ one of the frames (say the first frame) of the shot as a representative key frame. In the case of MPEG 4 videos, sprites (proposed by the MPEG 4 standard) are used as the mosaic reflecting the background content of the shot. We propose a scheme for indexing the visual content by extracting features from the mosaic/key frame, which is tagged along with the temporal parameters obtained from the shot. The quantification and qualification of the color and texture information in the keyframes are obtained by using a supervised classifier. The shape information is extracted at the local and global levels using the concept of edge histograms. In addition, the shape information available from the binary alpha planes of the foreground video object in MPEG 4 is approximated by a B-Spline representation and used as a feature vector. A representation scheme has been developed which generates an XML file that contains the extracted content descriptors in accordance with the Data Description Language (DDL) of MPEG 7.

Original languageEnglish (US)
Pages (from-to)183-194
Number of pages12
JournalUnknown Journal
Volume4210
StatePublished - 2000

Fingerprint

Data description
Image compression
XML
Splines
shot
Classifiers
Language
Color
Textures
Cameras
Databases
retrieval
video compression
qualifications
splines
classifiers
files
histograms
textures
cameras

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Condensed Matter Physics

Cite this

Content based indexing in the MPEG - [1, 2 & 4] domains. / Zubair, M.; Bhalod, J.; Panchanathan, Sethuraman.

In: Unknown Journal, Vol. 4210, 2000, p. 183-194.

Research output: Contribution to journalArticle

Zubair, M. ; Bhalod, J. ; Panchanathan, Sethuraman. / Content based indexing in the MPEG - [1, 2 & 4] domains. In: Unknown Journal. 2000 ; Vol. 4210. pp. 183-194.
@article{b2eed1cb71614b74a0f69adbf41a8021,
title = "Content based indexing in the MPEG - [1, 2 & 4] domains",
abstract = "Retrieval of videos from large databases using the inherent content as a key, is an important and challenging problem with many applications. The large volume of data associated with visual information presents challenges from the perspectives of storage, browsing, indexing and retrieval. The Moving Pictures Expert Group (MPEG) has addressed the issue of compression by proposing a family of video compression standards, namely MPEG 1, MPEG 2 and MPEG 4. In this paper, we propose a unified scheme for indexing the visual content in the MPEG 1, 2, and 4 domains. A video is first segmented into elemental units called shots. In the case of MPEG 1 and 2 videos containing simple camera operations (without significant object motion), we propose to generate a mosaic which is representative of the visual content of the entire shot in contrast to existing approaches where videos with both little and large motion employ one of the frames (say the first frame) of the shot as a representative key frame. In the case of MPEG 4 videos, sprites (proposed by the MPEG 4 standard) are used as the mosaic reflecting the background content of the shot. We propose a scheme for indexing the visual content by extracting features from the mosaic/key frame, which is tagged along with the temporal parameters obtained from the shot. The quantification and qualification of the color and texture information in the keyframes are obtained by using a supervised classifier. The shape information is extracted at the local and global levels using the concept of edge histograms. In addition, the shape information available from the binary alpha planes of the foreground video object in MPEG 4 is approximated by a B-Spline representation and used as a feature vector. A representation scheme has been developed which generates an XML file that contains the extracted content descriptors in accordance with the Data Description Language (DDL) of MPEG 7.",
author = "M. Zubair and J. Bhalod and Sethuraman Panchanathan",
year = "2000",
language = "English (US)",
volume = "4210",
pages = "183--194",
journal = "Scanning Electron Microscopy",
issn = "0586-5581",
publisher = "Scanning Microscopy International",

}

TY - JOUR

T1 - Content based indexing in the MPEG - [1, 2 & 4] domains

AU - Zubair, M.

AU - Bhalod, J.

AU - Panchanathan, Sethuraman

PY - 2000

Y1 - 2000

N2 - Retrieval of videos from large databases using the inherent content as a key, is an important and challenging problem with many applications. The large volume of data associated with visual information presents challenges from the perspectives of storage, browsing, indexing and retrieval. The Moving Pictures Expert Group (MPEG) has addressed the issue of compression by proposing a family of video compression standards, namely MPEG 1, MPEG 2 and MPEG 4. In this paper, we propose a unified scheme for indexing the visual content in the MPEG 1, 2, and 4 domains. A video is first segmented into elemental units called shots. In the case of MPEG 1 and 2 videos containing simple camera operations (without significant object motion), we propose to generate a mosaic which is representative of the visual content of the entire shot in contrast to existing approaches where videos with both little and large motion employ one of the frames (say the first frame) of the shot as a representative key frame. In the case of MPEG 4 videos, sprites (proposed by the MPEG 4 standard) are used as the mosaic reflecting the background content of the shot. We propose a scheme for indexing the visual content by extracting features from the mosaic/key frame, which is tagged along with the temporal parameters obtained from the shot. The quantification and qualification of the color and texture information in the keyframes are obtained by using a supervised classifier. The shape information is extracted at the local and global levels using the concept of edge histograms. In addition, the shape information available from the binary alpha planes of the foreground video object in MPEG 4 is approximated by a B-Spline representation and used as a feature vector. A representation scheme has been developed which generates an XML file that contains the extracted content descriptors in accordance with the Data Description Language (DDL) of MPEG 7.

AB - Retrieval of videos from large databases using the inherent content as a key, is an important and challenging problem with many applications. The large volume of data associated with visual information presents challenges from the perspectives of storage, browsing, indexing and retrieval. The Moving Pictures Expert Group (MPEG) has addressed the issue of compression by proposing a family of video compression standards, namely MPEG 1, MPEG 2 and MPEG 4. In this paper, we propose a unified scheme for indexing the visual content in the MPEG 1, 2, and 4 domains. A video is first segmented into elemental units called shots. In the case of MPEG 1 and 2 videos containing simple camera operations (without significant object motion), we propose to generate a mosaic which is representative of the visual content of the entire shot in contrast to existing approaches where videos with both little and large motion employ one of the frames (say the first frame) of the shot as a representative key frame. In the case of MPEG 4 videos, sprites (proposed by the MPEG 4 standard) are used as the mosaic reflecting the background content of the shot. We propose a scheme for indexing the visual content by extracting features from the mosaic/key frame, which is tagged along with the temporal parameters obtained from the shot. The quantification and qualification of the color and texture information in the keyframes are obtained by using a supervised classifier. The shape information is extracted at the local and global levels using the concept of edge histograms. In addition, the shape information available from the binary alpha planes of the foreground video object in MPEG 4 is approximated by a B-Spline representation and used as a feature vector. A representation scheme has been developed which generates an XML file that contains the extracted content descriptors in accordance with the Data Description Language (DDL) of MPEG 7.

UR - http://www.scopus.com/inward/record.url?scp=0034427492&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0034427492&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:0034427492

VL - 4210

SP - 183

EP - 194

JO - Scanning Electron Microscopy

JF - Scanning Electron Microscopy

SN - 0586-5581

ER -