EBSCO Logo
Connecting you to content on EBSCOhost
Results
Title

Investigating the contributors to hit-and-run crashes using gradient boosting decision trees.

Authors

Han, Baorui; Huang, Haibo; Li, Gen; Jiang, Chenming; Yang, Zhen; Zhu, Zhenjun

Abstract

A classification prediction model is established based on a nonlinear method—Gradient Boosting Decision Tree (GBDT) to investigate the factors contributing to a perpetrator's escape behavior in hit-and-run crashes. Given the U.S. Crash Report Sampling System (CRSS) dataset, the model is trained and compared with the state-of-art methods (Classification and Regression Tree, Random Forest, and Logistic Regression). The results show that the GBDT outperforms other methods, achieving the lowest negative log-likelihood (0.282), misclassification rate (0.096), and the highest AUC (0.803). GBDT also demonstrates superior computational efficiency, with a LIFT value of 4.087, making it a more accurate and efficient model for predicting hit-and-run crashes compared to CART, Random Forest, and Logistic Regression. The results obtained from the GBDT show that the relative importance of crash type and relation to trafficway rank 4th and 5th, respectively. Neither is mentioned in previous studies, indicating that GBDT has the ability to mine hidden information. In addition, the interaction between influencing variables can also be obtained to investigate the joint effect of various variables. The results of this study have practical applications in hit-and-run incident prevention, accident safety analysis, and other engineering applications.

Subjects

DECISION trees; RANDOM forest algorithms; LOGISTIC regression analysis; REGRESSION trees; PREDICTION models

Publication

PLoS ONE, 2025, Vol 20, Issue 1, p1

ISSN

1932-6203

Publication type

Academic Journal

DOI

10.1371/journal.pone.0314939

EBSCO Connect | Privacy policy | Terms of use | Copyright | Manage my cookies
Journals | Subjects | Sitemap
© 2025 EBSCO Industries, Inc. All rights reserved