OC DM LGJun 25, 2021

Binary Matrix Factorisation and Completion via Integer Programming

Reka A. Kovacs, Oktay Gunluk, Raphael A. Hauser

arXiv:2106.13434v24.01 citationsh-index: 27Has Code

Originality Incremental advance

AI Analysis

This work addresses binary matrix factorisation for identifying discrete patterns in binary data, representing an incremental improvement with a novel objective function and computational method.

The authors tackled the rank-k binary matrix factorisation problem under Boolean arithmetic by developing integer programming formulations, including a compact and exponential-size IPs with a new objective function that weights zero entries proportionally to erroneous coverage. Their approach, tested on synthetic and real datasets, achieved competitive performance and accurate low-error factorisations compared to existing methods.

Binary matrix factorisation is an essential tool for identifying discrete patterns in binary data. In this paper we consider the rank-k binary matrix factorisation problem (k-BMF) under Boolean arithmetic: we are given an n x m binary matrix X with possibly missing entries and need to find two binary matrices A and B of dimension n x k and k x m respectively, which minimise the distance between X and the Boolean product of A and B in the squared Frobenius distance. We present a compact and two exponential size integer programs (IPs) for k-BMF and show that the compact IP has a weak LP relaxation, while the exponential size IPs have a stronger equivalent LP relaxation. We introduce a new objective function, which differs from the traditional squared Frobenius objective in attributing a weight to zero entries of the input matrix that is proportional to the number of times the zero is erroneously covered in a rank-k factorisation. For one of the exponential size IPs we describe a computational approach based on column generation. Experimental results on synthetic and real word datasets suggest that our integer programming approach is competitive against available methods for k-BMF and provides accurate low-error factorisations.

View on arXiv PDF Code

Similar