Longitudinal global positioning system travel data and breach of privacy via enhanced spatial and demographic analysis

Vetri Venthan Elango, Sara Khoeini, Yanzhi Xu, Randall Guensler

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Longitudinal Global Positioning System (GPS) travel data provide a wealth of information related to travel behavior and on-road vehicle behavior that is very valuable to researchers. Sharing the data publicly allows researchers to explore the data and create new knowledge beyond the initial research objectives. However, if any data are to be used outside a secure server, the data must be processed in such a manner that ensures that the confidentiality of the data will not be breached. High-resolution GPS data (e.g., second-by-second speed and location information), when associated with the individual households or drivers, compromise privacy and have a significant potential to harm human subjects. This paper explores how data from the Commute Atlanta study in Georgia could be processed to make it useful to researchers while participants' privacy is protected. The research developed and assessed methodologies designed to identify the individual participant's home location from processed data and then tested analytical data sets for breach of privacy. The research effort found that the home location could be identified to within reasonably small neighborhoods; when the household demographic information was included in the data sets (which was necessary for researchers), exact households could be identified. Although some new data-processing approaches might be used to eliminate privacy concerns, until such systems are developed and proved to be unbreachable through rigorous analysis, the Georgia Institute of Technology team has determined that researchers should access the high-resolution data in controlled secure labs and that the data sets should not be made public without additional efforts to ensure that home locations cannot be identified when external data sources are leveraged in the analyses.

Original languageEnglish (US)
Pages (from-to)86-98
Number of pages13
JournalTransportation Research Record
Issue number2354
DOIs
StatePublished - Oct 31 2013
Externally publishedYes

Fingerprint

Global positioning system
Servers

ASJC Scopus subject areas

  • Civil and Structural Engineering
  • Mechanical Engineering

Cite this

Longitudinal global positioning system travel data and breach of privacy via enhanced spatial and demographic analysis. / Elango, Vetri Venthan; Khoeini, Sara; Xu, Yanzhi; Guensler, Randall.

In: Transportation Research Record, No. 2354, 31.10.2013, p. 86-98.

Research output: Contribution to journalArticle

@article{39fca24b79ee4b10a24437a86bd03561,
title = "Longitudinal global positioning system travel data and breach of privacy via enhanced spatial and demographic analysis",
abstract = "Longitudinal Global Positioning System (GPS) travel data provide a wealth of information related to travel behavior and on-road vehicle behavior that is very valuable to researchers. Sharing the data publicly allows researchers to explore the data and create new knowledge beyond the initial research objectives. However, if any data are to be used outside a secure server, the data must be processed in such a manner that ensures that the confidentiality of the data will not be breached. High-resolution GPS data (e.g., second-by-second speed and location information), when associated with the individual households or drivers, compromise privacy and have a significant potential to harm human subjects. This paper explores how data from the Commute Atlanta study in Georgia could be processed to make it useful to researchers while participants' privacy is protected. The research developed and assessed methodologies designed to identify the individual participant's home location from processed data and then tested analytical data sets for breach of privacy. The research effort found that the home location could be identified to within reasonably small neighborhoods; when the household demographic information was included in the data sets (which was necessary for researchers), exact households could be identified. Although some new data-processing approaches might be used to eliminate privacy concerns, until such systems are developed and proved to be unbreachable through rigorous analysis, the Georgia Institute of Technology team has determined that researchers should access the high-resolution data in controlled secure labs and that the data sets should not be made public without additional efforts to ensure that home locations cannot be identified when external data sources are leveraged in the analyses.",
author = "Elango, {Vetri Venthan} and Sara Khoeini and Yanzhi Xu and Randall Guensler",
year = "2013",
month = "10",
day = "31",
doi = "10.3141/2354-09",
language = "English (US)",
pages = "86--98",
journal = "Transportation Research Record",
issn = "0361-1981",
publisher = "US National Research Council",
number = "2354",

}

TY - JOUR

T1 - Longitudinal global positioning system travel data and breach of privacy via enhanced spatial and demographic analysis

AU - Elango, Vetri Venthan

AU - Khoeini, Sara

AU - Xu, Yanzhi

AU - Guensler, Randall

PY - 2013/10/31

Y1 - 2013/10/31

N2 - Longitudinal Global Positioning System (GPS) travel data provide a wealth of information related to travel behavior and on-road vehicle behavior that is very valuable to researchers. Sharing the data publicly allows researchers to explore the data and create new knowledge beyond the initial research objectives. However, if any data are to be used outside a secure server, the data must be processed in such a manner that ensures that the confidentiality of the data will not be breached. High-resolution GPS data (e.g., second-by-second speed and location information), when associated with the individual households or drivers, compromise privacy and have a significant potential to harm human subjects. This paper explores how data from the Commute Atlanta study in Georgia could be processed to make it useful to researchers while participants' privacy is protected. The research developed and assessed methodologies designed to identify the individual participant's home location from processed data and then tested analytical data sets for breach of privacy. The research effort found that the home location could be identified to within reasonably small neighborhoods; when the household demographic information was included in the data sets (which was necessary for researchers), exact households could be identified. Although some new data-processing approaches might be used to eliminate privacy concerns, until such systems are developed and proved to be unbreachable through rigorous analysis, the Georgia Institute of Technology team has determined that researchers should access the high-resolution data in controlled secure labs and that the data sets should not be made public without additional efforts to ensure that home locations cannot be identified when external data sources are leveraged in the analyses.

AB - Longitudinal Global Positioning System (GPS) travel data provide a wealth of information related to travel behavior and on-road vehicle behavior that is very valuable to researchers. Sharing the data publicly allows researchers to explore the data and create new knowledge beyond the initial research objectives. However, if any data are to be used outside a secure server, the data must be processed in such a manner that ensures that the confidentiality of the data will not be breached. High-resolution GPS data (e.g., second-by-second speed and location information), when associated with the individual households or drivers, compromise privacy and have a significant potential to harm human subjects. This paper explores how data from the Commute Atlanta study in Georgia could be processed to make it useful to researchers while participants' privacy is protected. The research developed and assessed methodologies designed to identify the individual participant's home location from processed data and then tested analytical data sets for breach of privacy. The research effort found that the home location could be identified to within reasonably small neighborhoods; when the household demographic information was included in the data sets (which was necessary for researchers), exact households could be identified. Although some new data-processing approaches might be used to eliminate privacy concerns, until such systems are developed and proved to be unbreachable through rigorous analysis, the Georgia Institute of Technology team has determined that researchers should access the high-resolution data in controlled secure labs and that the data sets should not be made public without additional efforts to ensure that home locations cannot be identified when external data sources are leveraged in the analyses.

UR - http://www.scopus.com/inward/record.url?scp=84886536601&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84886536601&partnerID=8YFLogxK

U2 - 10.3141/2354-09

DO - 10.3141/2354-09

M3 - Article

AN - SCOPUS:84886536601

SP - 86

EP - 98

JO - Transportation Research Record

JF - Transportation Research Record

SN - 0361-1981

IS - 2354

ER -