Development of commute mode choice model by integrating actively and passively collected travel data

Ruone Zhang, Xin Ye, Ke Wang, Dongjin Li, Jiayu Zhu

Research output: Contribution to journalArticle

Abstract

Travel data collection, which is necessary for travel demand modeling, is always of great concern to modelers due to its huge cost and effort when a large sample is required to achieve satisfactory model precisions. In this paper, travel data collected based on a survey questionnaire and travelers' active participation are called actively collected data (ACD). It is difficult to guarantee absolute randomness and unbiasedness in a sample when the ACD are collected due to self-selection issues. The aim of this study is to improve the model precision at low cost by using passively collected data (PCD), such as in-vehicle GPS data and transit smart card data, to release sample size restriction and reduce sampling bias of ACD in a commute mode choice model. In an empirical study, a multinomial-logit-based joint model is developed for commute mode choice by integrating ACD and PCD based on the choice-based sampling theory. A comprehensive set of explanatory variables are specified through data integration. Both simulation and empirical results show great improvement in coefficient precisions in the proposed joint model, relative to those in the ACD model and PCD model. In this study, ACD and PCD samples of Shanghai are integrated in the joint model so that several significantly influential level-of-service attributes are identified for auto, rail, and bus modes, and their impacts on commute mode choice probabilities are quantified. The findings can aid in better evaluating the program to improve the existing transit system.

Original languageEnglish (US)
Article number2730
JournalSustainability (Switzerland)
Volume11
Issue number10
DOIs
StatePublished - May 1 2019
Externally publishedYes

Fingerprint

travel
Data structures
Sampling
Smart cards
Data integration
Rails
Global positioning system
Costs
sampling theory
travel demand
sampling bias
costs
questionnaire survey
cost
guarantee
GPS
simulation
participation
questionnaire
demand

Keywords

  • Actively collected data
  • Choice-based sampling
  • Commute mode choice
  • Data integration
  • Multinomial logit model
  • Passively collected data

ASJC Scopus subject areas

  • Geography, Planning and Development
  • Renewable Energy, Sustainability and the Environment
  • Management, Monitoring, Policy and Law

Cite this

Development of commute mode choice model by integrating actively and passively collected travel data. / Zhang, Ruone; Ye, Xin; Wang, Ke; Li, Dongjin; Zhu, Jiayu.

In: Sustainability (Switzerland), Vol. 11, No. 10, 2730, 01.05.2019.

Research output: Contribution to journalArticle

@article{fe1230f040df45f682ab3c6d96f7bacb,
title = "Development of commute mode choice model by integrating actively and passively collected travel data",
abstract = "Travel data collection, which is necessary for travel demand modeling, is always of great concern to modelers due to its huge cost and effort when a large sample is required to achieve satisfactory model precisions. In this paper, travel data collected based on a survey questionnaire and travelers' active participation are called actively collected data (ACD). It is difficult to guarantee absolute randomness and unbiasedness in a sample when the ACD are collected due to self-selection issues. The aim of this study is to improve the model precision at low cost by using passively collected data (PCD), such as in-vehicle GPS data and transit smart card data, to release sample size restriction and reduce sampling bias of ACD in a commute mode choice model. In an empirical study, a multinomial-logit-based joint model is developed for commute mode choice by integrating ACD and PCD based on the choice-based sampling theory. A comprehensive set of explanatory variables are specified through data integration. Both simulation and empirical results show great improvement in coefficient precisions in the proposed joint model, relative to those in the ACD model and PCD model. In this study, ACD and PCD samples of Shanghai are integrated in the joint model so that several significantly influential level-of-service attributes are identified for auto, rail, and bus modes, and their impacts on commute mode choice probabilities are quantified. The findings can aid in better evaluating the program to improve the existing transit system.",
keywords = "Actively collected data, Choice-based sampling, Commute mode choice, Data integration, Multinomial logit model, Passively collected data",
author = "Ruone Zhang and Xin Ye and Ke Wang and Dongjin Li and Jiayu Zhu",
year = "2019",
month = "5",
day = "1",
doi = "10.3390/su11102730",
language = "English (US)",
volume = "11",
journal = "Sustainability",
issn = "2071-1050",
publisher = "Mary Ann Liebert Inc.",
number = "10",

}

TY - JOUR

T1 - Development of commute mode choice model by integrating actively and passively collected travel data

AU - Zhang, Ruone

AU - Ye, Xin

AU - Wang, Ke

AU - Li, Dongjin

AU - Zhu, Jiayu

PY - 2019/5/1

Y1 - 2019/5/1

N2 - Travel data collection, which is necessary for travel demand modeling, is always of great concern to modelers due to its huge cost and effort when a large sample is required to achieve satisfactory model precisions. In this paper, travel data collected based on a survey questionnaire and travelers' active participation are called actively collected data (ACD). It is difficult to guarantee absolute randomness and unbiasedness in a sample when the ACD are collected due to self-selection issues. The aim of this study is to improve the model precision at low cost by using passively collected data (PCD), such as in-vehicle GPS data and transit smart card data, to release sample size restriction and reduce sampling bias of ACD in a commute mode choice model. In an empirical study, a multinomial-logit-based joint model is developed for commute mode choice by integrating ACD and PCD based on the choice-based sampling theory. A comprehensive set of explanatory variables are specified through data integration. Both simulation and empirical results show great improvement in coefficient precisions in the proposed joint model, relative to those in the ACD model and PCD model. In this study, ACD and PCD samples of Shanghai are integrated in the joint model so that several significantly influential level-of-service attributes are identified for auto, rail, and bus modes, and their impacts on commute mode choice probabilities are quantified. The findings can aid in better evaluating the program to improve the existing transit system.

AB - Travel data collection, which is necessary for travel demand modeling, is always of great concern to modelers due to its huge cost and effort when a large sample is required to achieve satisfactory model precisions. In this paper, travel data collected based on a survey questionnaire and travelers' active participation are called actively collected data (ACD). It is difficult to guarantee absolute randomness and unbiasedness in a sample when the ACD are collected due to self-selection issues. The aim of this study is to improve the model precision at low cost by using passively collected data (PCD), such as in-vehicle GPS data and transit smart card data, to release sample size restriction and reduce sampling bias of ACD in a commute mode choice model. In an empirical study, a multinomial-logit-based joint model is developed for commute mode choice by integrating ACD and PCD based on the choice-based sampling theory. A comprehensive set of explanatory variables are specified through data integration. Both simulation and empirical results show great improvement in coefficient precisions in the proposed joint model, relative to those in the ACD model and PCD model. In this study, ACD and PCD samples of Shanghai are integrated in the joint model so that several significantly influential level-of-service attributes are identified for auto, rail, and bus modes, and their impacts on commute mode choice probabilities are quantified. The findings can aid in better evaluating the program to improve the existing transit system.

KW - Actively collected data

KW - Choice-based sampling

KW - Commute mode choice

KW - Data integration

KW - Multinomial logit model

KW - Passively collected data

UR - http://www.scopus.com/inward/record.url?scp=85066984803&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85066984803&partnerID=8YFLogxK

U2 - 10.3390/su11102730

DO - 10.3390/su11102730

M3 - Article

AN - SCOPUS:85066984803

VL - 11

JO - Sustainability

JF - Sustainability

SN - 2071-1050

IS - 10

M1 - 2730

ER -