A semi-nonparametric Poisson regression model for analyzing motor vehicle crash data

Xin Ye, Ke Wang, Yajie Zou, Dominique Lord

Research output: Contribution to journalArticle

25 Citations (Scopus)

Abstract

This paper develops a semi-nonparametric Poisson regression model to analyze motor vehicle crash frequency data collected from rural multilane highway segments in California, US. Motor vehicle crash frequency on rural highway is a topic of interest in the area of transportation safety due to higher driving speeds and the resultant severity level. Unlike the traditional Negative Binomial (NB) model, the semi-nonparametric Poisson regression model can accommodate an unobserved heterogeneity following a highly flexible semi-nonparametric (SNP) distribution. Simulation experiments are conducted to demonstrate that the SNP distribution can well mimic a large family of distributions, including normal distributions, log-gamma distributions, bimodal and trimodal distributions. Empirical estimation results show that such flexibility offered by the SNP distribution can greatly improve model precision and the overall goodness-of-fit. The semi-nonparametric distribution can provide a better understanding of crash data structure through its ability to capture potential multimodality in the distribution of unobserved heterogeneity. When estimated coefficients in empirical models are compared, SNP and NB models are found to have a substantially different coefficient for the dummy variable indicating the lane width. The SNP model with better statistical performance suggests that the NB model overestimates the effect of lane width on crash frequency reduction by 83.1%.

Original languageEnglish (US)
Article numbere0197338
JournalPloS one
Volume13
Issue number5
DOIs
StatePublished - May 2018
Externally publishedYes

Fingerprint

Statistical Models
Motor Vehicles
Normal Distribution
Safety
Normal distribution
Data structures
Experiments

ASJC Scopus subject areas

  • Biochemistry, Genetics and Molecular Biology(all)
  • Agricultural and Biological Sciences(all)

Cite this

A semi-nonparametric Poisson regression model for analyzing motor vehicle crash data. / Ye, Xin; Wang, Ke; Zou, Yajie; Lord, Dominique.

In: PloS one, Vol. 13, No. 5, e0197338, 05.2018.

Research output: Contribution to journalArticle

Ye, Xin ; Wang, Ke ; Zou, Yajie ; Lord, Dominique. / A semi-nonparametric Poisson regression model for analyzing motor vehicle crash data. In: PloS one. 2018 ; Vol. 13, No. 5.
@article{a55d69661bbe43d2a3c225d8114fc236,
title = "A semi-nonparametric Poisson regression model for analyzing motor vehicle crash data",
abstract = "This paper develops a semi-nonparametric Poisson regression model to analyze motor vehicle crash frequency data collected from rural multilane highway segments in California, US. Motor vehicle crash frequency on rural highway is a topic of interest in the area of transportation safety due to higher driving speeds and the resultant severity level. Unlike the traditional Negative Binomial (NB) model, the semi-nonparametric Poisson regression model can accommodate an unobserved heterogeneity following a highly flexible semi-nonparametric (SNP) distribution. Simulation experiments are conducted to demonstrate that the SNP distribution can well mimic a large family of distributions, including normal distributions, log-gamma distributions, bimodal and trimodal distributions. Empirical estimation results show that such flexibility offered by the SNP distribution can greatly improve model precision and the overall goodness-of-fit. The semi-nonparametric distribution can provide a better understanding of crash data structure through its ability to capture potential multimodality in the distribution of unobserved heterogeneity. When estimated coefficients in empirical models are compared, SNP and NB models are found to have a substantially different coefficient for the dummy variable indicating the lane width. The SNP model with better statistical performance suggests that the NB model overestimates the effect of lane width on crash frequency reduction by 83.1{\%}.",
author = "Xin Ye and Ke Wang and Yajie Zou and Dominique Lord",
year = "2018",
month = "5",
doi = "10.1371/journal.pone.0197338",
language = "English (US)",
volume = "13",
journal = "PLoS One",
issn = "1932-6203",
publisher = "Public Library of Science",
number = "5",

}

TY - JOUR

T1 - A semi-nonparametric Poisson regression model for analyzing motor vehicle crash data

AU - Ye, Xin

AU - Wang, Ke

AU - Zou, Yajie

AU - Lord, Dominique

PY - 2018/5

Y1 - 2018/5

N2 - This paper develops a semi-nonparametric Poisson regression model to analyze motor vehicle crash frequency data collected from rural multilane highway segments in California, US. Motor vehicle crash frequency on rural highway is a topic of interest in the area of transportation safety due to higher driving speeds and the resultant severity level. Unlike the traditional Negative Binomial (NB) model, the semi-nonparametric Poisson regression model can accommodate an unobserved heterogeneity following a highly flexible semi-nonparametric (SNP) distribution. Simulation experiments are conducted to demonstrate that the SNP distribution can well mimic a large family of distributions, including normal distributions, log-gamma distributions, bimodal and trimodal distributions. Empirical estimation results show that such flexibility offered by the SNP distribution can greatly improve model precision and the overall goodness-of-fit. The semi-nonparametric distribution can provide a better understanding of crash data structure through its ability to capture potential multimodality in the distribution of unobserved heterogeneity. When estimated coefficients in empirical models are compared, SNP and NB models are found to have a substantially different coefficient for the dummy variable indicating the lane width. The SNP model with better statistical performance suggests that the NB model overestimates the effect of lane width on crash frequency reduction by 83.1%.

AB - This paper develops a semi-nonparametric Poisson regression model to analyze motor vehicle crash frequency data collected from rural multilane highway segments in California, US. Motor vehicle crash frequency on rural highway is a topic of interest in the area of transportation safety due to higher driving speeds and the resultant severity level. Unlike the traditional Negative Binomial (NB) model, the semi-nonparametric Poisson regression model can accommodate an unobserved heterogeneity following a highly flexible semi-nonparametric (SNP) distribution. Simulation experiments are conducted to demonstrate that the SNP distribution can well mimic a large family of distributions, including normal distributions, log-gamma distributions, bimodal and trimodal distributions. Empirical estimation results show that such flexibility offered by the SNP distribution can greatly improve model precision and the overall goodness-of-fit. The semi-nonparametric distribution can provide a better understanding of crash data structure through its ability to capture potential multimodality in the distribution of unobserved heterogeneity. When estimated coefficients in empirical models are compared, SNP and NB models are found to have a substantially different coefficient for the dummy variable indicating the lane width. The SNP model with better statistical performance suggests that the NB model overestimates the effect of lane width on crash frequency reduction by 83.1%.

UR - http://www.scopus.com/inward/record.url?scp=85047557337&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85047557337&partnerID=8YFLogxK

U2 - 10.1371/journal.pone.0197338

DO - 10.1371/journal.pone.0197338

M3 - Article

C2 - 29791481

AN - SCOPUS:85047557337

VL - 13

JO - PLoS One

JF - PLoS One

SN - 1932-6203

IS - 5

M1 - e0197338

ER -