We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Evaluation and interpretation of driving risks: Automobile claim frequency modeling with telematics data.
- Authors
Gao, Yaqian; Huang, Yifan; Meng, Shengwang
- Abstract
With the development of vehicle telematics and data mining technology, usage‐based insurance (UBI) has aroused widespread interest from both academia and industry. The extensive driving behavior features make it possible to further understand the risks of insured vehicles, but pose challenges in the identification and interpretation of important ratemaking factors. This study, based on the telematics data of policyholders in China's mainland, analyzes insurance claim frequency of commercial trucks using both Poisson regression and several machine learning models, including regression tree, random forest, gradient boosting tree, XGBoost and neural network. After selecting the best model, we analyze feature importance, feature effects and the contribution of each feature to the prediction from an actuarial perspective. Our empirical study shows that XGBoost greatly outperforms the traditional models and detects some important risk factors, such as the average speed, the average mileage traveled per day, the fraction of night driving, the number of sudden brakes and the fraction of left/right turns at intersections. These features usually have a nonlinear effect on driving risk, and there are complex interactions between features. To further distinguish high−/low‐risk drivers, we run supervised clustering for risk segmentation according to drivers' driving habits. In summary, this study not only provide a more accurate prediction of driving risk, but also greatly satisfy the interpretability requirements of insurance regulators and risk management.
- Subjects
CHINA; REGRESSION trees; AUTOMOBILE driving; TRAFFIC safety; POISSON regression; INSURANCE claims; MACHINE learning; MOTOR vehicle driving
- Publication
Statistical Analysis & Data Mining, 2023, Vol 16, Issue 2, p97
- ISSN
1932-1864
- Publication type
Article
- DOI
10.1002/sam.11599