A MapReduce algorithm to create contiguity weights for spatial analysis of Big data

Xun Li, WenWen Li, Luc Anselin, Sergio Rey, Julia Koschinsky

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Scopus citations

Abstract

Spatial analysis of Big data is a key component of Cyber- GIS. However, how to utilize existing cyberinfrastructure (e.g. large computing clusters) to perform parallel and distributed spatial analysis on Big data remains a huge challenge. Problems such as efficient spatial weights creation, spatial statistics and spatial regression of Big data still need investigation. In this research, we propose a MapReduce algorithm for creating contiguity-based spatial weights. This algorithm provides the ability to create spatial weights from very large spatial datasets efficiently by using computing resources that are organized in the Hadoop framework. It works in the paradigm of MapReduce: mappers are distributed in computing clusters to find contiguous neighbors in parallel, then reducers collect the results and generate the weights matrix. To test the performance of this algorithm, we design experiment to create contiguity-based weights matrix from artificial spatial data with up to 190 million polygons using Amazon's Hadoop framework called Elastic MapReduce. The experiment demonstrates the scalability of this parallel algorithm which utilizes large computing clusters to solve the problem of creating contiguity weights on Big data.

Original languageEnglish (US)
Title of host publicationProceedings of the 3rd ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, BigSpatial 2014
EditorsVarun Chandola, Ranga Raju Vatsavai
PublisherAssociation for Computing Machinery
Pages50-53
Number of pages4
ISBN (Electronic)9781450331326
DOIs
StatePublished - Nov 4 2014
Event3rd ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, BigSpatial 2014 - Dallas, United States
Duration: Nov 4 2014 → …

Publication series

NameProceedings of the 3rd ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, BigSpatial 2014

Other

Other3rd ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, BigSpatial 2014
Country/TerritoryUnited States
CityDallas
Period11/4/14 → …

Keywords

  • Big data
  • Mapreduce
  • Spatial weights

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Information Systems
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'A MapReduce algorithm to create contiguity weights for spatial analysis of Big data'. Together they form a unique fingerprint.

Cite this