Giovanni Barbarino

NA
h-index6
10papers
11citations
Novelty44%
AI Score39

10 Papers

17.5OCMay 26
Computing cone-constrained singular values of matrices

Giovanni Barbarino, Nicolas Gillis, David Sossa

This paper deals with the numerical computation of the least singular value of a rectangular matrix $A$ relative to a pair of closed convex cones $(P,Q)$, which is defined as the optimal value of the non-convex optimization problem of minimizing $\langle u,Av\rangle$ such that $u$ and $v$ are unit vectors in $P$ and $Q$, respectively. When $A$ is the identity matrix, the least singular value coincides with the cosine of the largest angle between $P$ and $Q$. When $P$ and $Q$ are positive orthants, the least singular value is called the least Pareto singular value of $A$ and has applications, for instance, in graph theory. We prove the NP-hardness of all the above problems, while identifying cases when such problems can be solved in polynomial time. We then propose four algorithms. Two are exact algorithms, meaning that they are guaranteed to compute a globally optimal solution; one uses an exact non-convex quadratic programming solver, and the other a brute-force active-set method. The other two are heuristics, meaning that they rapidly compute locally optimal solutions; one uses an alternating projection algorithm with extrapolation, and the other a sequential partial linearization approach based on fractional programming. We illustrate the use of these algorithms on several examples.

NAMar 30, 2017
Equivalence between GLT sequences and measurable functions

Giovanni Barbarino

The theory of Generalized Locally Toeplitz (GLT) sequences of matrices has been developed in order to study the asymptotic behaviour of particular spectral distributions when the dimension of the matrices tends to infinity. A key concepts in this theory are the notion of Approximating Classes of Sequences (a.c.s.), and spectral symbols, that lead to define a metric structure on the space of matrix sequences, and provide a link with the measurable functions. In this document we prove additional results regarding theoretical aspects, such as the completeness of the matrix sequences space with respect to the metric a.c.s., and the identification of the space of GLT sequences with the space of measurable functions.

NAOct 2, 2017
Diagonal Matrix Sequences and their Spectral Symbols

Giovanni Barbarino

The spectral symbols are useful tools to analyse the eigenvalue distribution when dealing with high dimensional linear systems. Given a matrix sequence with an asymptotic symbol, the last one depends only on the spectra of the individual matrices, seen as a not ordered set. We can then focus only on diagonal sequences and sort the eigenvalues so that they become an approximation of the symbol sampling. We show that this is linked to the concept of diagonal Generalized Locally Toeplitz (GLT) sequences, and in particular we prove that any diagonal sequence with a real valued symbol can be permuted in order to obtain a diagonal GLT sequence with the same symbol.

FAJul 18, 2018
Notes on asymptotic eigenvalues distribution on complex circles

Giovanni Barbarino

With tools of measure theory and symbols of matrix sequences, we explore the results regarding curves on finite fields and Weil Systems. This document wants to draw a bridge between the two areas and link the concepts of distribution of Frobenius roots in the context of Zeta functions on curves, with the spectral distributions already studied in linear algebra.

MLNov 6, 2025
Robustness of Minimum-Volume Nonnegative Matrix Factorization under an Expanded Sufficiently Scattered Condition

Giovanni Barbarino, Nicolas Gillis, Subhayan Saha

Minimum-volume nonnegative matrix factorization (min-vol NMF) has been used successfully in many applications, such as hyperspectral imaging, chemical kinetics, spectroscopy, topic modeling, and audio source separation. However, its robustness to noise has been a long-standing open problem. In this paper, we prove that min-vol NMF identifies the groundtruth factors in the presence of noise under a condition referred to as the expanded sufficiently scattered condition which requires the data points to be sufficiently well scattered in the latent simplex generated by the basis vectors.

NANov 25, 2024
On the Robustness of the Successive Projection Algorithm

Giovanni Barbarino, Nicolas Gillis

The successive projection algorithm (SPA) is a workhorse algorithm to learn the $r$ vertices of the convex hull of a set of $(r-1)$-dimensional data points, a.k.a. a latent simplex, which has numerous applications in data science. In this paper, we revisit the robustness to noise of SPA and several of its variants. In particular, when $r \geq 3$, we prove the tightness of the existing error bounds for SPA and for two more robust preconditioned variants of SPA. We also provide significantly improved error bounds for SPA, by a factor proportional to the conditioning of the $r$ vertices, in two special cases: for the first extracted vertex, and when $r \leq 2$. We then provide further improvements for the error bounds of a translated version of SPA proposed by Arora et al. (''A practical algorithm for topic modeling with provable guarantees'', ICML, 2013) in two special cases: for the first two extracted vertices, and when $r \leq 3$. Finally, we propose a new more robust variant of SPA that first shifts and lifts the data points in order to minimize the conditioning of the problem. We illustrate our results on synthetic data.

NAMar 29, 2024
Dual Simplex Volume Maximization for Simplex-Structured Matrix Factorization

Maryam Abdolali, Giovanni Barbarino, Nicolas Gillis

Simplex-structured matrix factorization (SSMF) is a generalization of nonnegative matrix factorization, a fundamental interpretable data analysis model, and has applications in hyperspectral unmixing and topic modeling. To obtain identifiable solutions, a standard approach is to find minimum-volume solutions. By taking advantage of the duality/polarity concept for polytopes, we convert minimum-volume SSMF in the primal space to a maximum-volume problem in the dual space. We first prove the identifiability of this maximum-volume dual problem. Then, we use this dual formulation to provide a novel optimization approach which bridges the gap between two existing families of algorithms for SSMF, namely volume minimization and facet identification. Numerical experiments show that the proposed approach performs favorably compared to the state-of-the-art SSMF algorithms.

NAMay 19, 2025
Identifiability of Nonnegative Tucker Decompositions -- Part I: Theory

Subhayan Saha, Giovanni Barbarino, Nicolas Gillis

Tensor decompositions have become a central tool in data science, with applications in areas such as data analysis, signal processing, and machine learning. A key property of many tensor decompositions, such as the canonical polyadic decomposition, is identifiability: the factors are unique, up to trivial scaling and permutation ambiguities. This allows one to recover the groundtruth sources that generated the data. The Tucker decomposition (TD) is a central and widely used tensor decomposition model. However, it is in general not identifiable. In this paper, we study the identifiability of the nonnegative TD (nTD). By adapting and extending identifiability results of nonnegative matrix factorization (NMF), we provide uniqueness results for nTD. Our results require the nonnegative matrix factors to have some degree of sparsity (namely, satisfy the separability condition, or the sufficiently scattered condition), while the core tensor only needs to have some slices (or linear combinations of them) or unfoldings with full column rank (but does not need to be nonnegative). Under such conditions, we derive several procedures, using either unfoldings or slices of the input tensor, to obtain identifiable nTDs by minimizing the volume of unfoldings or slices of the core tensor.

CVAug 3, 2016
Permutation NMF

Giovanni Barbarino

Nonnegative Matrix Factorization(NMF) is a common used technique in machine learning to extract features out of data such as text documents and images thanks to its natural clustering properties. In particular, it is popular in image processing since it can decompose several pictures and recognize common parts if they're located in the same position over the photos. This paper's aim is to present a way to add the translation invariance to the classical NMF, that is, the algorithms presented are able to detect common features, even when they're shifted, in different original images.