Jonathan Ish-Horowicz

MLJan 28, 2019

Interpreting Deep Neural Networks Through Variable Importance

Jonathan Ish-Horowicz, Dana Udwin, Seth Flaxman et al.

While the success of deep neural networks (DNNs) is well-established across a variety of domains, our ability to explain and interpret these methods is limited. Unlike previously proposed local methods which try to explain particular classification decisions, we focus on global interpretability and ask a universally applicable question: given a trained model, which features are the most important? In the context of neural networks, a feature is rarely important on its own, so our strategy is specifically designed to leverage partial covariance structures and incorporate variable dependence into feature ranking. Our methodological contributions in this paper are two-fold. First, we propose an effect size analogue for DNNs that is appropriate for applications with highly collinear predictors (ubiquitous in computer vision). Second, we extend the recently proposed "RelATive cEntrality" (RATE) measure (Crawford et al., 2019) to the Bayesian deep learning setting. RATE applies an information theoretic criterion to the posterior distribution of effect sizes to assess feature significance. We apply our framework to three broad application areas: computer vision, natural language processing, and social science.

NAJul 21, 2017

Fully discrete finite element data assimilation method for the heat equation

Erik Burman, Jonathan Ish-Horowicz, Lauri Oksanen

We consider a finite element discretization for the reconstruction of the final state of the heat equation, when the initial data is unknown, but additional data is given in a sub domain in the space time. For the discretization in space we consider standard continuous affine finite element approximation, and the time derivative is discretized using a backward differentiation. We regularize the discrete system by adding a penalty of the $H^1$-semi-norm of the initial data, scaled with the mesh-parameter. The analysis of the method uses techniques developed in E. Burman and L. Oksanen, Data assimilation for the heat equation using stabilized finite element methods, arXiv, 2016, combining discrete stability of the numerical method with sharp Carleman estimates for the physical problem, to derive optimal error estimates for the approximate solution. For the natural space time energy norm, away from $t=0$, the convergence is the same as for the classical problem with known initial data, but contrary to the classical case, we do not obtain faster convergence for the $L^2$-norm at the final time.

Jonathan Ish-Horowicz

2 Papers