IV CV LG SPSep 4, 2025

Inferring the Graph Structure of Images for Graph Neural Networks

Mayur S Gowda, John Shi, Augusto Santos, José M. F. Moura

arXiv:2509.04677v1h-index: 2

Originality Synthesis-oriented

AI Analysis

This work addresses a domain-specific problem for researchers in graph neural networks and image processing, but it is incremental as it builds on existing methods for graph representation.

The paper tackled the problem of improving graph neural network accuracy for image classification by proposing alternative graph structures beyond traditional grid graphs, resulting in improved accuracy on MNIST and Fashion-MNIST datasets.

Image datasets such as MNIST are a key benchmark for testing Graph Neural Network (GNN) architectures. The images are traditionally represented as a grid graph with each node representing a pixel and edges connecting neighboring pixels (vertically and horizontally). The graph signal is the values (intensities) of each pixel in the image. The graphs are commonly used as input to graph neural networks (e.g., Graph Convolutional Neural Networks (Graph CNNs) [1, 2], Graph Attention Networks (GAT) [3], GatedGCN [4]) to classify the images. In this work, we improve the accuracy of downstream graph neural network tasks by finding alternative graphs to the grid graph and superpixel methods to represent the dataset images, following the approach in [5, 6]. We find row correlation, column correlation, and product graphs for each image in MNIST and Fashion-MNIST using correlations between the pixel values building on the method in [5, 6]. Experiments show that using these different graph representations and features as input into downstream GNN models improves the accuracy over using the traditional grid graph and superpixel methods in the literature.

View on arXiv PDF

Similar