Intelligent polar cyberinfrastructure

enabling semantic search in geospatial metadata catalogue to support polar data discovery

WenWen Li, Vidit Bhatia, Kai Cao

Research output: Contribution to journalArticle

7 Citations (Scopus)

Abstract

Polar regions have garnered substantial research attention in recent years because they are key drivers of the Earth’s climate, a source of rich mineral resources, and the home of a variety of marine life. Nevertheless, global warming over the past century is pushing the polar systems towards a tipping point: the systems are at high-risk from melting snow and sea ice covers, permafrost thawing, and acidification of the Arctic oceans. To increase understanding of the polar environment, the National Science Foundation established a Polar Cyberinfrastructure (CI) program, aimed at utilizing advanced software architecture to support polar data analysis and decision-making. At the center of this Polar CI research are data resources and data discovery components that facilitate the search and retrieval of polar data. This paper reports our development of a semantic search tool that supports the intelligent discovery of polar datasets. This tool is built on latent semantic analysis techniques, which improves search performance by identifying hidden semantic associations between terminologies used in the various datasets’ metadata. The software tool is implemented using an object-oriented design pattern and has been successfully integrated into a popular open source metadata catalog as a new semantic search support. A semantic matrix is maintained persistently within the catalogue to store the semantic associations. A dynamic update mechanism was also developed to allow automated update of semantics once more metadata are loaded into or removed from the catalog. We explored the effects of rank reduction to the effectiveness of this semantic search module and demonstrated its better performance than the traditional search techniques.

Original languageEnglish (US)
Pages (from-to)111-123
Number of pages13
JournalEarth Science Informatics
Volume8
Issue number1
DOIs
StatePublished - 2015

Fingerprint

metadata
software
thawing
terminology
polar region
mineral resource
ice cover
permafrost
acidification
sea ice
global warming
snow
melting
decision making
matrix
climate
resource

Keywords

  • Antarctic
  • Arctic
  • Catalog
  • Discovery
  • Metadata
  • Polar
  • Spatial data infrastructure

ASJC Scopus subject areas

  • Earth and Planetary Sciences(all)

Cite this

Intelligent polar cyberinfrastructure : enabling semantic search in geospatial metadata catalogue to support polar data discovery. / Li, WenWen; Bhatia, Vidit; Cao, Kai.

In: Earth Science Informatics, Vol. 8, No. 1, 2015, p. 111-123.

Research output: Contribution to journalArticle

@article{a833aeff466e4214bb3b37e692c7e588,
title = "Intelligent polar cyberinfrastructure: enabling semantic search in geospatial metadata catalogue to support polar data discovery",
abstract = "Polar regions have garnered substantial research attention in recent years because they are key drivers of the Earth’s climate, a source of rich mineral resources, and the home of a variety of marine life. Nevertheless, global warming over the past century is pushing the polar systems towards a tipping point: the systems are at high-risk from melting snow and sea ice covers, permafrost thawing, and acidification of the Arctic oceans. To increase understanding of the polar environment, the National Science Foundation established a Polar Cyberinfrastructure (CI) program, aimed at utilizing advanced software architecture to support polar data analysis and decision-making. At the center of this Polar CI research are data resources and data discovery components that facilitate the search and retrieval of polar data. This paper reports our development of a semantic search tool that supports the intelligent discovery of polar datasets. This tool is built on latent semantic analysis techniques, which improves search performance by identifying hidden semantic associations between terminologies used in the various datasets’ metadata. The software tool is implemented using an object-oriented design pattern and has been successfully integrated into a popular open source metadata catalog as a new semantic search support. A semantic matrix is maintained persistently within the catalogue to store the semantic associations. A dynamic update mechanism was also developed to allow automated update of semantics once more metadata are loaded into or removed from the catalog. We explored the effects of rank reduction to the effectiveness of this semantic search module and demonstrated its better performance than the traditional search techniques.",
keywords = "Antarctic, Arctic, Catalog, Discovery, Metadata, Polar, Spatial data infrastructure",
author = "WenWen Li and Vidit Bhatia and Kai Cao",
year = "2015",
doi = "10.1007/s12145-014-0185-z",
language = "English (US)",
volume = "8",
pages = "111--123",
journal = "Earth Science Informatics",
issn = "1865-0473",
publisher = "Springer Verlag",
number = "1",

}

TY - JOUR

T1 - Intelligent polar cyberinfrastructure

T2 - enabling semantic search in geospatial metadata catalogue to support polar data discovery

AU - Li, WenWen

AU - Bhatia, Vidit

AU - Cao, Kai

PY - 2015

Y1 - 2015

N2 - Polar regions have garnered substantial research attention in recent years because they are key drivers of the Earth’s climate, a source of rich mineral resources, and the home of a variety of marine life. Nevertheless, global warming over the past century is pushing the polar systems towards a tipping point: the systems are at high-risk from melting snow and sea ice covers, permafrost thawing, and acidification of the Arctic oceans. To increase understanding of the polar environment, the National Science Foundation established a Polar Cyberinfrastructure (CI) program, aimed at utilizing advanced software architecture to support polar data analysis and decision-making. At the center of this Polar CI research are data resources and data discovery components that facilitate the search and retrieval of polar data. This paper reports our development of a semantic search tool that supports the intelligent discovery of polar datasets. This tool is built on latent semantic analysis techniques, which improves search performance by identifying hidden semantic associations between terminologies used in the various datasets’ metadata. The software tool is implemented using an object-oriented design pattern and has been successfully integrated into a popular open source metadata catalog as a new semantic search support. A semantic matrix is maintained persistently within the catalogue to store the semantic associations. A dynamic update mechanism was also developed to allow automated update of semantics once more metadata are loaded into or removed from the catalog. We explored the effects of rank reduction to the effectiveness of this semantic search module and demonstrated its better performance than the traditional search techniques.

AB - Polar regions have garnered substantial research attention in recent years because they are key drivers of the Earth’s climate, a source of rich mineral resources, and the home of a variety of marine life. Nevertheless, global warming over the past century is pushing the polar systems towards a tipping point: the systems are at high-risk from melting snow and sea ice covers, permafrost thawing, and acidification of the Arctic oceans. To increase understanding of the polar environment, the National Science Foundation established a Polar Cyberinfrastructure (CI) program, aimed at utilizing advanced software architecture to support polar data analysis and decision-making. At the center of this Polar CI research are data resources and data discovery components that facilitate the search and retrieval of polar data. This paper reports our development of a semantic search tool that supports the intelligent discovery of polar datasets. This tool is built on latent semantic analysis techniques, which improves search performance by identifying hidden semantic associations between terminologies used in the various datasets’ metadata. The software tool is implemented using an object-oriented design pattern and has been successfully integrated into a popular open source metadata catalog as a new semantic search support. A semantic matrix is maintained persistently within the catalogue to store the semantic associations. A dynamic update mechanism was also developed to allow automated update of semantics once more metadata are loaded into or removed from the catalog. We explored the effects of rank reduction to the effectiveness of this semantic search module and demonstrated its better performance than the traditional search techniques.

KW - Antarctic

KW - Arctic

KW - Catalog

KW - Discovery

KW - Metadata

KW - Polar

KW - Spatial data infrastructure

UR - http://www.scopus.com/inward/record.url?scp=84925503468&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84925503468&partnerID=8YFLogxK

U2 - 10.1007/s12145-014-0185-z

DO - 10.1007/s12145-014-0185-z

M3 - Article

VL - 8

SP - 111

EP - 123

JO - Earth Science Informatics

JF - Earth Science Informatics

SN - 1865-0473

IS - 1

ER -