LGCLCYSIJun 16, 2022

All the World's a (Hyper)Graph: A Data Drama

arXiv:2206.08225v33 citationsh-index: 39
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of representation robustness in graph learning and mining for researchers, though it is incremental as it focuses on dataset creation and analysis.

The authors introduced Hyperbard, a dataset of diverse relational data representations from Shakespeare's plays, and demonstrated that solutions to graph mining problems are highly dependent on representation choice, calling current graph curation practices into question.

We introduce Hyperbard, a dataset of diverse relational data representations derived from Shakespeare's plays. Our representations range from simple graphs capturing character co-occurrence in single scenes to hypergraphs encoding complex communication settings and character contributions as hyperedges with edge-specific node weights. By making multiple intuitive representations readily available for experimentation, we facilitate rigorous representation robustness checks in graph learning, graph mining, and network analysis, highlighting the advantages and drawbacks of specific representations. Leveraging the data released in Hyperbard, we demonstrate that many solutions to popular graph mining problems are highly dependent on the representation choice, thus calling current graph curation practices into question. As an homage to our data source, and asserting that science can also be art, we present all our points in the form of a play.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes