Bridging the semantic gap in sports video retrieval and summarization

Baoxin Li, James H. Errico, Hao Pan, Ibrahim Sezan

Research output: Contribution to journalArticle

34 Citations (Scopus)

Abstract

One of the major challenges facing current media management systems and related applications is the so-called "semantic gap" between the rich meaning that a user desires and the shallowness of the content descriptions that are automatically extracted from the media. In this paper, we address the problem of bridging this gap in the sports domain. We propose a general framework for indexing and summarizing sports broadcast programs, with a high-level model of sports broadcast video using the concept of an event, defined according to domain-specific knowledge for different types of sports. Within this general framework, we develop automatic event detection algorithms that are based on automatic analysis of the visual and aural signals in the media. We have successfully applied the event detection algorithms to different types of sports including American football, baseball, Japanese sumo wrestling, and soccer. Event modeling and detection contribute to the reduction of the semantic gap by providing rudimentary semantic information obtained through media analysis. We further propose a novel approach, which makes use of independently generated rich textual metadata, to fill the gap completely through synchronization of the information-laden textual data with the basic event segments. We implemented an MPEG-7 compliant browsing system for semantic retrieval and summarization of sports video using the proposed algorithms.

Original languageEnglish (US)
Pages (from-to)393-424
Number of pages32
JournalJournal of Visual Communication and Image Representation
Volume15
Issue number3
DOIs
StatePublished - Sep 2004
Externally publishedYes

Fingerprint

Sports
Semantics
Metadata
Synchronization

Keywords

  • Event detection
  • Semantic video analysis
  • Video summarization

ASJC Scopus subject areas

  • Computer Vision and Pattern Recognition
  • Signal Processing
  • Electrical and Electronic Engineering

Cite this

Bridging the semantic gap in sports video retrieval and summarization. / Li, Baoxin; Errico, James H.; Pan, Hao; Sezan, Ibrahim.

In: Journal of Visual Communication and Image Representation, Vol. 15, No. 3, 09.2004, p. 393-424.

Research output: Contribution to journalArticle

Li, Baoxin ; Errico, James H. ; Pan, Hao ; Sezan, Ibrahim. / Bridging the semantic gap in sports video retrieval and summarization. In: Journal of Visual Communication and Image Representation. 2004 ; Vol. 15, No. 3. pp. 393-424.
@article{b2daaa7fc48d46b98ca621bc62395094,
title = "Bridging the semantic gap in sports video retrieval and summarization",
abstract = "One of the major challenges facing current media management systems and related applications is the so-called {"}semantic gap{"} between the rich meaning that a user desires and the shallowness of the content descriptions that are automatically extracted from the media. In this paper, we address the problem of bridging this gap in the sports domain. We propose a general framework for indexing and summarizing sports broadcast programs, with a high-level model of sports broadcast video using the concept of an event, defined according to domain-specific knowledge for different types of sports. Within this general framework, we develop automatic event detection algorithms that are based on automatic analysis of the visual and aural signals in the media. We have successfully applied the event detection algorithms to different types of sports including American football, baseball, Japanese sumo wrestling, and soccer. Event modeling and detection contribute to the reduction of the semantic gap by providing rudimentary semantic information obtained through media analysis. We further propose a novel approach, which makes use of independently generated rich textual metadata, to fill the gap completely through synchronization of the information-laden textual data with the basic event segments. We implemented an MPEG-7 compliant browsing system for semantic retrieval and summarization of sports video using the proposed algorithms.",
keywords = "Event detection, Semantic video analysis, Video summarization",
author = "Baoxin Li and Errico, {James H.} and Hao Pan and Ibrahim Sezan",
year = "2004",
month = "9",
doi = "10.1016/j.jvcir.2004.04.006",
language = "English (US)",
volume = "15",
pages = "393--424",
journal = "Journal of Visual Communication and Image Representation",
issn = "1047-3203",
publisher = "Academic Press Inc.",
number = "3",

}

TY - JOUR

T1 - Bridging the semantic gap in sports video retrieval and summarization

AU - Li, Baoxin

AU - Errico, James H.

AU - Pan, Hao

AU - Sezan, Ibrahim

PY - 2004/9

Y1 - 2004/9

N2 - One of the major challenges facing current media management systems and related applications is the so-called "semantic gap" between the rich meaning that a user desires and the shallowness of the content descriptions that are automatically extracted from the media. In this paper, we address the problem of bridging this gap in the sports domain. We propose a general framework for indexing and summarizing sports broadcast programs, with a high-level model of sports broadcast video using the concept of an event, defined according to domain-specific knowledge for different types of sports. Within this general framework, we develop automatic event detection algorithms that are based on automatic analysis of the visual and aural signals in the media. We have successfully applied the event detection algorithms to different types of sports including American football, baseball, Japanese sumo wrestling, and soccer. Event modeling and detection contribute to the reduction of the semantic gap by providing rudimentary semantic information obtained through media analysis. We further propose a novel approach, which makes use of independently generated rich textual metadata, to fill the gap completely through synchronization of the information-laden textual data with the basic event segments. We implemented an MPEG-7 compliant browsing system for semantic retrieval and summarization of sports video using the proposed algorithms.

AB - One of the major challenges facing current media management systems and related applications is the so-called "semantic gap" between the rich meaning that a user desires and the shallowness of the content descriptions that are automatically extracted from the media. In this paper, we address the problem of bridging this gap in the sports domain. We propose a general framework for indexing and summarizing sports broadcast programs, with a high-level model of sports broadcast video using the concept of an event, defined according to domain-specific knowledge for different types of sports. Within this general framework, we develop automatic event detection algorithms that are based on automatic analysis of the visual and aural signals in the media. We have successfully applied the event detection algorithms to different types of sports including American football, baseball, Japanese sumo wrestling, and soccer. Event modeling and detection contribute to the reduction of the semantic gap by providing rudimentary semantic information obtained through media analysis. We further propose a novel approach, which makes use of independently generated rich textual metadata, to fill the gap completely through synchronization of the information-laden textual data with the basic event segments. We implemented an MPEG-7 compliant browsing system for semantic retrieval and summarization of sports video using the proposed algorithms.

KW - Event detection

KW - Semantic video analysis

KW - Video summarization

UR - http://www.scopus.com/inward/record.url?scp=4644273979&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=4644273979&partnerID=8YFLogxK

U2 - 10.1016/j.jvcir.2004.04.006

DO - 10.1016/j.jvcir.2004.04.006

M3 - Article

VL - 15

SP - 393

EP - 424

JO - Journal of Visual Communication and Image Representation

JF - Journal of Visual Communication and Image Representation

SN - 1047-3203

IS - 3

ER -