LGSISOC-PHMLJun 28, 2019

Angular separability of data clusters or network communities in geometrical space and its relevance to hyperbolic embedding

arXiv:1907.00025v110 citations
Originality Incremental advance
AI Analysis

This work provides a quantitative tool for researchers in network science and machine learning to analyze hyperbolic embeddings, though it is incremental as it builds on prior qualitative assessments.

The authors tackled the problem of quantifying angular separation of data clusters or network communities in hyperbolic embeddings, introducing the Angular Separation Index (ASI) with a statistical test. They demonstrated ASI's utility by revealing a 'dimensionality jump' in hyperbolic network models with temperature increase and detecting intrinsic dimensionality in networks growing in hidden spaces.

Analysis of 'big data' characterized by high-dimensionality such as word vectors and complex networks requires often their representation in a geometrical space by embedding. Recent developments in machine learning and network geometry have pointed out the hyperbolic space as a useful framework for the representation of this data derived by real complex physical systems. In the hyperbolic space, the radial coordinate of the nodes characterizes their hierarchy, whereas the angular distance between them represents their similarity. Several studies have highlighted the relationship between the angular coordinates of the nodes embedded in the hyperbolic space and the community metadata available. However, such analyses have been often limited to a visual or qualitative assessment. Here, we introduce the angular separation index (ASI), to quantitatively evaluate the separation of node network communities or data clusters over the angular coordinates of a geometrical space. ASI is particularly useful in the hyperbolic space - where it is extensively tested along this study - but can be used in general for any assessment of angular separation regardless of the adopted geometry. ASI is proposed together with an exact test statistic based on a uniformly random null model to assess the statistical significance of the separation. We show that ASI allows to discover two significant phenomena in network geometry. The first is that the increase of temperature in 2D hyperbolic network generative models, not only reduces the network clustering but also induces a 'dimensionality jump' of the network to dimensions higher than two. The second is that ASI can be successfully applied to detect the intrinsic dimensionality of network structures that grow in a hidden geometrical space.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes