CLAIDSMar 11

Instruction set for the representation of graphs

arXiv:2603.11039v115.9h-index: 22
Predicted impact top 6% in CL · last 90 daysOriginality Incremental advance
AI Analysis

This provides a compact, isomorphism-invariant encoding for graph similarity search and generation, addressing a domain-specific need in graph machine learning.

The authors tackled the problem of representing graph structures as compact strings for machine learning applications, resulting in IsalGraph, which encodes any connected graph into a string with strong correlation to graph edit distance (e.g., on benchmark datasets like IAM Letter and AIDS).

We present IsalGraph, a method for representing the structure of any finite, simple graph as a compact string over a nine-character instruction alphabet. The encoding is executed by a small virtual machine comprising a sparse graph, a circular doubly-linked list (CDLL) of graph-node references, and two traversal pointers. Instructions either move a pointer through the CDLL or insert a node or edge into the graph. A key design property is that every string over the alphabet decodes to a valid graph, with no invalid states reachable. A greedy \emph{GraphToString} algorithm encodes any connected graph into a string in time polynomial in the number of nodes; an exhaustive-backtracking variant produces a canonical string by selecting the lexicographically smallest shortest string across all starting nodes and all valid traversal orders. We evaluate the representation on five real-world graph benchmark datasets (IAM Letter LOW/MED/HIGH, LINUX, and AIDS) and show that the Levenshtein distance between IsalGraph strings correlates strongly with graph edit distance (GED). Together, these properties make IsalGraph strings a compact, isomorphism-invariant, and language-model-compatible sequential encoding of graph structure, with direct applications in graph similarity search, graph generation, and graph-conditioned language modelling

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes