SEAICVLGDec 22, 2023

FetaFix: Automatic Fault Localization and Repair of Deep Learning Model Conversions

arXiv:2312.15101v43 citationsh-index: 9EASE
Originality Incremental advance
AI Analysis

This addresses a practical issue for developers and researchers who need to deploy models across different frameworks, though it is incremental as it builds on existing conversion processes.

The paper tackles the problem of bugs in deep learning model conversions between frameworks, which can degrade prediction correctness, and presents FetaFix, an automated approach that fixed 462 out of 755 detected faults and improved performance in 14 out of 15 erroneous cases.

Converting deep learning models between frameworks is a common step to maximize model compatibility across devices and leverage optimization features that may be exclusively provided in one deep learning framework. However, this conversion process may be riddled with bugs, making the converted models either undeployable or problematic, considerably degrading their prediction correctness. In this paper, we propose an automated approach for fault localization and repair, FetaFix, during model conversion between deep learning frameworks. FetaFix is capable of detecting and fixing faults introduced in model input, parameters, hyperparameters, and the model graph during conversion. FetaFix uses a set of fault types (mined from surveying common conversion issues reported in code repositories and forums) to localize potential conversion faults in the converted target model and then repair them appropriately, e.g., replacing the parameters of the target model with those from the source model. This is done iteratively for every image in the dataset, comparing output label differences between the source model and the converted target model until all differences are resolved. We evaluate the effectiveness of FetaFix in fixing model conversion bugs of three widely used image recognition models converted across four different deep learning frameworks. Overall, FetaFix was able to fix $462$ out of $755$ detected conversion faults, either completely repairing or significantly improving the performance of $14$ out of the $15$ erroneous conversion cases.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes