LGCENAMLJun 27, 2012

Matrix Tile Analysis

arXiv:1206.6833v112 citations
Originality Incremental advance
AI Analysis

This addresses a limitation in matrix analysis for domains like biology where element operations are not defined, though it is incremental as it builds on clustering ideas.

The paper tackles the problem of finding groups of elements in matrices where traditional factorization methods are unsuitable, by introducing matrix tile analysis (MTA) to decompose matrices into non-overlapping tiles without algebraic constraints, and shows it finds biologically relevant gene interactions in yeast data.

Many tasks require finding groups of elements in a matrix of numbers, symbols or class likelihoods. One approach is to use efficient bi- or tri-linear factorization techniques including PCA, ICA, sparse matrix factorization and plaid analysis. These techniques are not appropriate when addition and multiplication of matrix elements are not sensibly defined. More directly, methods like bi-clustering can be used to classify matrix elements, but these methods make the overly-restrictive assumption that the class of each element is a function of a row class and a column class. We introduce a general computational problem, `matrix tile analysis' (MTA), which consists of decomposing a matrix into a set of non-overlapping tiles, each of which is defined by a subset of usually nonadjacent rows and columns. MTA does not require an algebra for combining tiles, but must search over discrete combinations of tile assignments. Exact MTA is a computationally intractable integer programming problem, but we describe an approximate iterative technique and a computationally efficient sum-product relaxation of the integer program. We compare the effectiveness of these methods to PCA and plaid on hundreds of randomly generated tasks. Using double-gene-knockout data, we show that MTA finds groups of interacting yeast genes that have biologically-related functions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes