LGEMOct 22, 2020

Theory-based residual neural networks: A synergy of discrete choice models and deep neural networks

arXiv:2010.11644v149 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of balancing prediction, interpretation, and robustness in travel behavior modeling for researchers, though it is incremental as it builds on existing DCM and DNN methods.

This study tackled the complementary limitations of data-driven and theory-driven models in travel behavior analysis by designing a theory-based residual neural network (TB-ResNet) framework that synergizes discrete choice models (DCMs) and deep neural networks (DNNs). The result showed that TB-ResNets provided greater prediction accuracy than pure DCMs and modestly improved prediction while significantly enhancing interpretation and robustness compared to pure DNNs, as tested on three datasets.

Researchers often treat data-driven and theory-driven models as two disparate or even conflicting methods in travel behavior analysis. However, the two methods are highly complementary because data-driven methods are more predictive but less interpretable and robust, while theory-driven methods are more interpretable and robust but less predictive. Using their complementary nature, this study designs a theory-based residual neural network (TB-ResNet) framework, which synergizes discrete choice models (DCMs) and deep neural networks (DNNs) based on their shared utility interpretation. The TB-ResNet framework is simple, as it uses a ($δ$, 1-$δ$) weighting to take advantage of DCMs' simplicity and DNNs' richness, and to prevent underfitting from the DCMs and overfitting from the DNNs. This framework is also flexible: three instances of TB-ResNets are designed based on multinomial logit model (MNL-ResNets), prospect theory (PT-ResNets), and hyperbolic discounting (HD-ResNets), which are tested on three data sets. Compared to pure DCMs, the TB-ResNets provide greater prediction accuracy and reveal a richer set of behavioral mechanisms owing to the utility function augmented by the DNN component in the TB-ResNets. Compared to pure DNNs, the TB-ResNets can modestly improve prediction and significantly improve interpretation and robustness, because the DCM component in the TB-ResNets stabilizes the utility functions and input gradients. Overall, this study demonstrates that it is both feasible and desirable to synergize DCMs and DNNs by combining their utility specifications under a TB-ResNet framework. Although some limitations remain, this TB-ResNet framework is an important first step to create mutual benefits between DCMs and DNNs for travel behavior modeling, with joint improvement in prediction, interpretation, and robustness.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes