LG CVMar 15, 2022

Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary Perspective

Gowthami Somepalli, Liam Fowl, Arpit Bansal, Ping Yeh-Chiang, Yehuda Dar, Richard Baraniuk, Micah Goldblum, Tom Goldstein

arXiv:2203.08124v128.885 citationsh-index: 108Has Code

Originality Synthesis-oriented

AI Analysis

This work addresses reproducibility issues in neural network training for researchers, providing visual insights into double descent phenomena, though it is incremental in nature.

The paper investigates neural network reproducibility and double descent by visualizing decision boundaries, finding that model width strongly affects reproducibility, with fragmented boundaries near interpolation thresholds and high reproducibility in very narrow or wide networks.

We discuss methods for visualizing neural network decision boundaries and decision regions. We use these visualizations to investigate issues related to reproducibility and generalization in neural network training. We observe that changes in model architecture (and its associate inductive bias) cause visible changes in decision boundaries, while multiple runs with the same architecture yield results with strong similarities, especially in the case of wide architectures. We also use decision boundary methods to visualize double descent phenomena. We see that decision boundary reproducibility depends strongly on model width. Near the threshold of interpolation, neural network decision boundaries become fragmented into many small decision regions, and these regions are non-reproducible. Meanwhile, very narrows and very wide networks have high levels of reproducibility in their decision boundaries with relatively few decision regions. We discuss how our observations relate to the theory of double descent phenomena in convex models. Code is available at https://github.com/somepago/dbViz

View on arXiv PDF Code

Similar