LGAPMay 10, 2022

Nightly Automobile Claims Prediction from Telematics-Derived Features: A Multilevel Approach

arXiv:2205.04616v13 citationsh-index: 22
Originality Synthesis-oriented
AI Analysis

This work addresses risk identification for drivers in insurance pricing, enabling potential interventions to avoid claims, but it is incremental as it builds on existing telematics and classification methods.

The paper tackled the problem of predicting automobile claims by using telematics data to classify trips that precede a claim, achieving an area under the ROC curve above 0.6 to show feasibility of advance prediction.

In recent years it has become possible to collect GPS data from drivers and to incorporate this data into automobile insurance pricing for the driver. This data is continuously collected and processed nightly into metadata consisting of mileage and time summaries of each discrete trip taken, and a set of behavioral scores describing attributes of the trip (e.g, driver fatigue or driver distraction) so we examine whether it can be used to identify periods of increased risk by successfully classifying trips that occur immediately before a trip in which there was an incident leading to a claim for that driver. Identification of periods of increased risk for a driver is valuable because it creates an opportunity for intervention and, potentially, avoidance of a claim. We examine metadata for each trip a driver takes and train a classifier to predict whether \textit{the following trip} is one in which a claim occurs for that driver. By achieving a area under the receiver-operator characteristic above 0.6, we show that it is possible to predict claims in advance. Additionally, we compare the predictive power, as measured by the area under the receiver-operator characteristic of XGBoost classifiers trained to predict whether a driver will have a claim using exposure features such as driven miles, and those trained using behavioral features such as a computed speed score.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes