The quality of big (geo)data

Research output: Contribution to journalArticle

62 Citations (Scopus)

Abstract

Big data is distinguished by volume, velocity, and variety. A large proportion of all big data is likely to be geographically referenced, and much may be real time. While examples can be found of high-quality big data, problems arise in meeting the normal scientific standards of replicability and rigorous sampling. These standards can be relaxed in certain stages of science, during hypothesis generation and exploration. Three methods of quality improvement and assurance are proposed. Only the third is sufficiently robust and rapid, especially in time-critical situations.

Original languageEnglish (US)
Pages (from-to)280-284
Number of pages5
JournalDialogues in Human Geography
Volume3
Issue number3
DOIs
StatePublished - Jan 1 2013
Externally publishedYes

Fingerprint

data quality
science
sampling
time
method

Keywords

  • crowdsourcing
  • geographic information
  • provenance
  • quality assurance
  • trust

ASJC Scopus subject areas

  • Geography, Planning and Development

Cite this

The quality of big (geo)data. / Goodchild, Michael.

In: Dialogues in Human Geography, Vol. 3, No. 3, 01.01.2013, p. 280-284.

Research output: Contribution to journalArticle

@article{4cf6a5f3876340cc816a507407019b53,
title = "The quality of big (geo)data",
abstract = "Big data is distinguished by volume, velocity, and variety. A large proportion of all big data is likely to be geographically referenced, and much may be real time. While examples can be found of high-quality big data, problems arise in meeting the normal scientific standards of replicability and rigorous sampling. These standards can be relaxed in certain stages of science, during hypothesis generation and exploration. Three methods of quality improvement and assurance are proposed. Only the third is sufficiently robust and rapid, especially in time-critical situations.",
keywords = "crowdsourcing, geographic information, provenance, quality assurance, trust",
author = "Michael Goodchild",
year = "2013",
month = "1",
day = "1",
doi = "10.1177/2043820613513392",
language = "English (US)",
volume = "3",
pages = "280--284",
journal = "Dialogues in Human Geography",
issn = "2043-8206",
publisher = "SAGE Publications Inc.",
number = "3",

}

TY - JOUR

T1 - The quality of big (geo)data

AU - Goodchild, Michael

PY - 2013/1/1

Y1 - 2013/1/1

N2 - Big data is distinguished by volume, velocity, and variety. A large proportion of all big data is likely to be geographically referenced, and much may be real time. While examples can be found of high-quality big data, problems arise in meeting the normal scientific standards of replicability and rigorous sampling. These standards can be relaxed in certain stages of science, during hypothesis generation and exploration. Three methods of quality improvement and assurance are proposed. Only the third is sufficiently robust and rapid, especially in time-critical situations.

AB - Big data is distinguished by volume, velocity, and variety. A large proportion of all big data is likely to be geographically referenced, and much may be real time. While examples can be found of high-quality big data, problems arise in meeting the normal scientific standards of replicability and rigorous sampling. These standards can be relaxed in certain stages of science, during hypothesis generation and exploration. Three methods of quality improvement and assurance are proposed. Only the third is sufficiently robust and rapid, especially in time-critical situations.

KW - crowdsourcing

KW - geographic information

KW - provenance

KW - quality assurance

KW - trust

UR - http://www.scopus.com/inward/record.url?scp=84975095132&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84975095132&partnerID=8YFLogxK

U2 - 10.1177/2043820613513392

DO - 10.1177/2043820613513392

M3 - Article

VL - 3

SP - 280

EP - 284

JO - Dialogues in Human Geography

JF - Dialogues in Human Geography

SN - 2043-8206

IS - 3

ER -