GeoNat v1.0: A dataset for natural feature mapping with artificial intelligence and supervised learning

Samantha T. Arundel; Wenwen Li; Sizhe Wang

doi:10.1111/tgis.12633

GeoNat v1.0: A dataset for natural feature mapping with artificial intelligence and supervised learning

Samantha T. Arundel, Wenwen Li, Sizhe Wang

Geographical Sciences and Urban Planning, School of (SGSUP)

Research output: Contribution to journal › Article › peer-review

8 Scopus citations

Abstract

Machine learning allows “the machine” to deduce the complex and sometimes unrecognized rules governing spatial systems, particularly topographic mapping, by exposing it to the end product. Often, the obstacle to this approach is the acquisition of many good and labeled training examples of the desired result. Such is the case with most types of natural features. To address such limitations, this research introduces GeoNat v1.0, a natural feature dataset, used to support artificial intelligence-based mapping and automated detection of natural features under a supervised learning paradigm. The dataset was created by randomly selecting points from the U.S. Geological Survey’s Geographic Names Information System and includes approximately 200 examples each of 10 classes of natural features. Resulting data were tested in an object-detection problem using a region-based convolutional neural network. The object-detection tests resulted in a 62% mean average precision as baseline results. Major challenges in developing training data in the geospatial domain, such as scale and geographical representativeness, are addressed in this article. We hope that the resulting dataset will be useful for a variety of applications and shed light on training data collection and labeling in the geospatial artificial intelligence domain.

Original language	English (US)
Pages (from-to)	556-572
Number of pages	17
Journal	Transactions in GIS
Volume	24
Issue number	3
DOIs	https://doi.org/10.1111/tgis.12633
State	Published - Jun 1 2020

ASJC Scopus subject areas

General Earth and Planetary Sciences

Access to Document

10.1111/tgis.12633

Cite this

@article{29550271ca354daf8eb743a4fe7657d4,

title = "GeoNat v1.0: A dataset for natural feature mapping with artificial intelligence and supervised learning",

abstract = "Machine learning allows “the machine” to deduce the complex and sometimes unrecognized rules governing spatial systems, particularly topographic mapping, by exposing it to the end product. Often, the obstacle to this approach is the acquisition of many good and labeled training examples of the desired result. Such is the case with most types of natural features. To address such limitations, this research introduces GeoNat v1.0, a natural feature dataset, used to support artificial intelligence-based mapping and automated detection of natural features under a supervised learning paradigm. The dataset was created by randomly selecting points from the U.S. Geological Survey{\textquoteright}s Geographic Names Information System and includes approximately 200 examples each of 10 classes of natural features. Resulting data were tested in an object-detection problem using a region-based convolutional neural network. The object-detection tests resulted in a 62% mean average precision as baseline results. Major challenges in developing training data in the geospatial domain, such as scale and geographical representativeness, are addressed in this article. We hope that the resulting dataset will be useful for a variety of applications and shed light on training data collection and labeling in the geospatial artificial intelligence domain.",

author = "Arundel, {Samantha T.} and Wenwen Li and Sizhe Wang",

note = "Funding Information: This research was supported in part by the National Science Foundation, Grant Nos. 1853864, 1455349, and 1937908. Any use of trade, firm, or product names is for descriptive purposes only and does not imply endorsement by the U.S. Government. Publisher Copyright: {\textcopyright} 2020 This article is a U.S. Government work and is in the public domain in the USA.",

year = "2020",

month = jun,

day = "1",

doi = "10.1111/tgis.12633",

language = "English (US)",

volume = "24",

pages = "556--572",

journal = "Transactions in GIS",

issn = "1361-1682",

publisher = "Wiley-Blackwell",

number = "3",

}

TY - JOUR

T1 - GeoNat v1.0

T2 - A dataset for natural feature mapping with artificial intelligence and supervised learning

AU - Arundel, Samantha T.

AU - Li, Wenwen

AU - Wang, Sizhe

N1 - Funding Information: This research was supported in part by the National Science Foundation, Grant Nos. 1853864, 1455349, and 1937908. Any use of trade, firm, or product names is for descriptive purposes only and does not imply endorsement by the U.S. Government. Publisher Copyright: © 2020 This article is a U.S. Government work and is in the public domain in the USA.

PY - 2020/6/1

Y1 - 2020/6/1

N2 - Machine learning allows “the machine” to deduce the complex and sometimes unrecognized rules governing spatial systems, particularly topographic mapping, by exposing it to the end product. Often, the obstacle to this approach is the acquisition of many good and labeled training examples of the desired result. Such is the case with most types of natural features. To address such limitations, this research introduces GeoNat v1.0, a natural feature dataset, used to support artificial intelligence-based mapping and automated detection of natural features under a supervised learning paradigm. The dataset was created by randomly selecting points from the U.S. Geological Survey’s Geographic Names Information System and includes approximately 200 examples each of 10 classes of natural features. Resulting data were tested in an object-detection problem using a region-based convolutional neural network. The object-detection tests resulted in a 62% mean average precision as baseline results. Major challenges in developing training data in the geospatial domain, such as scale and geographical representativeness, are addressed in this article. We hope that the resulting dataset will be useful for a variety of applications and shed light on training data collection and labeling in the geospatial artificial intelligence domain.

AB - Machine learning allows “the machine” to deduce the complex and sometimes unrecognized rules governing spatial systems, particularly topographic mapping, by exposing it to the end product. Often, the obstacle to this approach is the acquisition of many good and labeled training examples of the desired result. Such is the case with most types of natural features. To address such limitations, this research introduces GeoNat v1.0, a natural feature dataset, used to support artificial intelligence-based mapping and automated detection of natural features under a supervised learning paradigm. The dataset was created by randomly selecting points from the U.S. Geological Survey’s Geographic Names Information System and includes approximately 200 examples each of 10 classes of natural features. Resulting data were tested in an object-detection problem using a region-based convolutional neural network. The object-detection tests resulted in a 62% mean average precision as baseline results. Major challenges in developing training data in the geospatial domain, such as scale and geographical representativeness, are addressed in this article. We hope that the resulting dataset will be useful for a variety of applications and shed light on training data collection and labeling in the geospatial artificial intelligence domain.

UR - http://www.scopus.com/inward/record.url?scp=85085115832&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85085115832&partnerID=8YFLogxK

U2 - 10.1111/tgis.12633

DO - 10.1111/tgis.12633

M3 - Article

AN - SCOPUS:85085115832

SN - 1361-1682

VL - 24

SP - 556

EP - 572

JO - Transactions in GIS

JF - Transactions in GIS

IS - 3

ER -

GeoNat v1.0: A dataset for natural feature mapping with artificial intelligence and supervised learning

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this