Spatiotemporal data representation and its effect on the performance of spatial analysis in a cyberinfrastructure environment - A case study with raster zonal analysis

Miaomiao Song, WenWen Li, Bin Zhou, Ting Lei

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

This paper conducts a systematic research to uncover the impact of spatiotemporal data representation on the performance of raster analysis in a cyberinfrastructure environment. Two broad categories of data organization based on file system and database system are presented and discussed. In particular, these include five specific approaches of storing time-series raster data involving tiling (partitioning the entire image file into non-overlapping pieces), stacking (compositing multiple single-band images into a large multi-band image) techniques, and a combination of tiling and stacking in files or database tables. Raster zonal statistics, which have been used to support a variety of GIS applications ranging from watershed analysis to summarizing forest products, is selected as an example raster analysis algorithm. A series of experiments were conducted to evaluate the performance of the five proposed approaches using different spatial and spatiotemporal queries. The results show that spatiotemporal data representation, though largely ignored in the design of a cyberinfrastructure system, does play an important role in system performance. Specifically, tiling techniques with the support of spatial database outperforms all other approaches, especially those adopting stacking techniques in the data organization. For illustration, the best raster analysis solution was implemented and integrated into an operational cyberinfrastructure in the context of providing spatial decision support in polar science. We expect this work to offer insights to develop efficient cyberinfrastructure modules to support spatial analysis through a thorough analysis of spatiotemporal data representation.

Original languageEnglish (US)
Pages (from-to)11-21
Number of pages11
JournalComputers and Geosciences
Volume87
DOIs
StatePublished - Feb 1 2016

Fingerprint

raster
spatial analysis
stacking
Watersheds
Geographic information systems
Time series
Statistics
forest product
effect
analysis
partitioning
Experiments
GIS
watershed
time series
experiment

Keywords

  • Performance
  • Polar cyberinfrastructure
  • Polar science
  • Spatiotemporal data representation
  • Web service

ASJC Scopus subject areas

  • Information Systems
  • Computers in Earth Sciences

Cite this

@article{cba49e4922614cdc857a5174e9de5adc,
title = "Spatiotemporal data representation and its effect on the performance of spatial analysis in a cyberinfrastructure environment - A case study with raster zonal analysis",
abstract = "This paper conducts a systematic research to uncover the impact of spatiotemporal data representation on the performance of raster analysis in a cyberinfrastructure environment. Two broad categories of data organization based on file system and database system are presented and discussed. In particular, these include five specific approaches of storing time-series raster data involving tiling (partitioning the entire image file into non-overlapping pieces), stacking (compositing multiple single-band images into a large multi-band image) techniques, and a combination of tiling and stacking in files or database tables. Raster zonal statistics, which have been used to support a variety of GIS applications ranging from watershed analysis to summarizing forest products, is selected as an example raster analysis algorithm. A series of experiments were conducted to evaluate the performance of the five proposed approaches using different spatial and spatiotemporal queries. The results show that spatiotemporal data representation, though largely ignored in the design of a cyberinfrastructure system, does play an important role in system performance. Specifically, tiling techniques with the support of spatial database outperforms all other approaches, especially those adopting stacking techniques in the data organization. For illustration, the best raster analysis solution was implemented and integrated into an operational cyberinfrastructure in the context of providing spatial decision support in polar science. We expect this work to offer insights to develop efficient cyberinfrastructure modules to support spatial analysis through a thorough analysis of spatiotemporal data representation.",
keywords = "Performance, Polar cyberinfrastructure, Polar science, Spatiotemporal data representation, Web service",
author = "Miaomiao Song and WenWen Li and Bin Zhou and Ting Lei",
year = "2016",
month = "2",
day = "1",
doi = "10.1016/j.cageo.2015.11.005",
language = "English (US)",
volume = "87",
pages = "11--21",
journal = "Computers and Geosciences",
issn = "0098-3004",
publisher = "Elsevier Limited",

}

TY - JOUR

T1 - Spatiotemporal data representation and its effect on the performance of spatial analysis in a cyberinfrastructure environment - A case study with raster zonal analysis

AU - Song, Miaomiao

AU - Li, WenWen

AU - Zhou, Bin

AU - Lei, Ting

PY - 2016/2/1

Y1 - 2016/2/1

N2 - This paper conducts a systematic research to uncover the impact of spatiotemporal data representation on the performance of raster analysis in a cyberinfrastructure environment. Two broad categories of data organization based on file system and database system are presented and discussed. In particular, these include five specific approaches of storing time-series raster data involving tiling (partitioning the entire image file into non-overlapping pieces), stacking (compositing multiple single-band images into a large multi-band image) techniques, and a combination of tiling and stacking in files or database tables. Raster zonal statistics, which have been used to support a variety of GIS applications ranging from watershed analysis to summarizing forest products, is selected as an example raster analysis algorithm. A series of experiments were conducted to evaluate the performance of the five proposed approaches using different spatial and spatiotemporal queries. The results show that spatiotemporal data representation, though largely ignored in the design of a cyberinfrastructure system, does play an important role in system performance. Specifically, tiling techniques with the support of spatial database outperforms all other approaches, especially those adopting stacking techniques in the data organization. For illustration, the best raster analysis solution was implemented and integrated into an operational cyberinfrastructure in the context of providing spatial decision support in polar science. We expect this work to offer insights to develop efficient cyberinfrastructure modules to support spatial analysis through a thorough analysis of spatiotemporal data representation.

AB - This paper conducts a systematic research to uncover the impact of spatiotemporal data representation on the performance of raster analysis in a cyberinfrastructure environment. Two broad categories of data organization based on file system and database system are presented and discussed. In particular, these include five specific approaches of storing time-series raster data involving tiling (partitioning the entire image file into non-overlapping pieces), stacking (compositing multiple single-band images into a large multi-band image) techniques, and a combination of tiling and stacking in files or database tables. Raster zonal statistics, which have been used to support a variety of GIS applications ranging from watershed analysis to summarizing forest products, is selected as an example raster analysis algorithm. A series of experiments were conducted to evaluate the performance of the five proposed approaches using different spatial and spatiotemporal queries. The results show that spatiotemporal data representation, though largely ignored in the design of a cyberinfrastructure system, does play an important role in system performance. Specifically, tiling techniques with the support of spatial database outperforms all other approaches, especially those adopting stacking techniques in the data organization. For illustration, the best raster analysis solution was implemented and integrated into an operational cyberinfrastructure in the context of providing spatial decision support in polar science. We expect this work to offer insights to develop efficient cyberinfrastructure modules to support spatial analysis through a thorough analysis of spatiotemporal data representation.

KW - Performance

KW - Polar cyberinfrastructure

KW - Polar science

KW - Spatiotemporal data representation

KW - Web service

UR - http://www.scopus.com/inward/record.url?scp=84947433086&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84947433086&partnerID=8YFLogxK

U2 - 10.1016/j.cageo.2015.11.005

DO - 10.1016/j.cageo.2015.11.005

M3 - Article

VL - 87

SP - 11

EP - 21

JO - Computers and Geosciences

JF - Computers and Geosciences

SN - 0098-3004

ER -