Targeted Deep Learning: Framework, Methods, and Applications
This addresses the need for more efficient and accurate deep learning in applications like personalized medicine where inputs are few and known, though it appears incremental as it adapts existing pipelines.
The paper tackles the problem of adapting deep learning for scenarios where test inputs are known beforehand but unlabeled, such as personalized medicine, by proposing a targeted deep learning framework and a data-preprocessing method. The result shows that this approach makes standard deep learning faster and more accurate on real-world data.
Deep learning systems are typically designed to perform for a wide range of test inputs. For example, deep learning systems in autonomous cars are supposed to deal with traffic situations for which they were not specifically trained. In general, the ability to cope with a broad spectrum of unseen test inputs is called generalization. Generalization is definitely important in applications where the possible test inputs are known but plentiful or simply unknown, but there are also cases where the possible inputs are few and unlabeled but known beforehand. For example, medicine is currently interested in targeting treatments to individual patients; the number of patients at any given time is usually small (typically one), their diagnoses/responses/... are still unknown, but their general characteristics (such as genome information, protein levels in the blood, and so forth) are known before the treatment. We propose to call deep learning in such applications targeted deep learning. In this paper, we introduce a framework for targeted deep learning, and we devise and test an approach for adapting standard pipelines to the requirements of targeted deep learning. The approach is very general yet easy to use: it can be implemented as a simple data-preprocessing step. We demonstrate on a variety of real-world data that our approach can indeed render standard deep learning faster and more accurate when the test inputs are known beforehand.