Extended Fiducial Inference for Individual Treatment Effects via Deep Neural Networks
This work provides a rigorous statistical framework for uncertainty quantification in deep neural networks, advancing methodology for large-scale models in causal inference.
The paper tackles individual treatment effect estimation by introducing the Double Neural Network method within extended fiducial inference, demonstrating superior performance over conformal quantile regression and proving theoretical improvements in model size scaling from O(n^ζ) with 0≤ζ<1/2 to 0≤ζ<1.
Individual treatment effect estimation has gained significant attention in recent data science literature. This work introduces the Double Neural Network (Double-NN) method to address this problem within the framework of extended fiducial inference (EFI). In the proposed method, deep neural networks are used to model the treatment and control effect functions, while an additional neural network is employed to estimate their parameters. The universal approximation capability of deep neural networks ensures the broad applicability of this method. Numerical results highlight the superior performance of the proposed Double-NN method compared to the conformal quantile regression (CQR) method in individual treatment effect estimation. From the perspective of statistical inference, this work advances the theory and methodology for statistical inference of large models. Specifically, it is theoretically proven that the proposed method permits the model size to increase with the sample size $n$ at a rate of $O(n^ζ)$ for some $0 \leq ζ<1$, while still maintaining proper quantification of uncertainty in the model parameters. This result marks a significant improvement compared to the range $0\leq ζ< \frac{1}{2}$ required by the classical central limit theorem. Furthermore, this work provides a rigorous framework for quantifying the uncertainty of deep neural networks under the neural scaling law, representing a substantial contribution to the statistical understanding of large-scale neural network models.