LGCVMar 15, 2022

Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary Perspective

arXiv:2203.08124v185 citationsh-index: 108Has Code
Originality Synthesis-oriented
AI Analysis

This work addresses reproducibility issues in neural network training for researchers, providing visual insights into double descent phenomena, though it is incremental in nature.

The paper investigates neural network reproducibility and double descent by visualizing decision boundaries, finding that model width strongly affects reproducibility, with fragmented boundaries near interpolation thresholds and high reproducibility in very narrow or wide networks.

We discuss methods for visualizing neural network decision boundaries and decision regions. We use these visualizations to investigate issues related to reproducibility and generalization in neural network training. We observe that changes in model architecture (and its associate inductive bias) cause visible changes in decision boundaries, while multiple runs with the same architecture yield results with strong similarities, especially in the case of wide architectures. We also use decision boundary methods to visualize double descent phenomena. We see that decision boundary reproducibility depends strongly on model width. Near the threshold of interpolation, neural network decision boundaries become fragmented into many small decision regions, and these regions are non-reproducible. Meanwhile, very narrows and very wide networks have high levels of reproducibility in their decision boundaries with relatively few decision regions. We discuss how our observations relate to the theory of double descent phenomena in convex models. Code is available at https://github.com/somepago/dbViz

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes