LGAIMLNov 28, 2019

The Weighted Tsetlin Machine: Compressed Representations with Weighted Clauses

arXiv:1911.12607v44 citations
Originality Incremental advance
AI Analysis

This work addresses efficiency issues in interpretable pattern recognition for machine learning practitioners, though it is incremental as it builds directly on the Tsetlin Machine.

The paper tackles the computational and memory inefficiency of the Tsetlin Machine by introducing the Weighted Tsetlin Machine, which uses weighted clauses to reduce resource usage while maintaining or improving accuracy, achieving up to 1/50 of the clauses needed and a 7x speedup in sampling.

The Tsetlin Machine (TM) is an interpretable mechanism for pattern recognition that constructs conjunctive clauses from data. The clauses capture frequent patterns with high discriminating power, providing increasing expression power with each additional clause. However, the resulting accuracy gain comes at the cost of linear growth in computation time and memory usage. In this paper, we present the Weighted Tsetlin Machine (WTM), which reduces computation time and memory usage by weighting the clauses. Real-valued weighting allows one clause to replace multiple, and supports fine-tuning the impact of each clause. Our novel scheme simultaneously learns both the composition of the clauses and their weights. Furthermore, we increase training efficiency by replacing $k$ Bernoulli trials of success probability $p$ with a uniform sample of average size $p k$, the size drawn from a binomial distribution. In our empirical evaluation, the WTM achieved the same accuracy as the TM on MNIST, IMDb, and Connect-4, requiring only $1/4$, $1/3$, and $1/50$ of the clauses, respectively. With the same number of clauses, the WTM outperformed the TM, obtaining peak test accuracies of respectively $98.63\%$, $90.37\%$, and $87.91\%$. Finally, our novel sampling scheme reduced sample generation time by a factor of $7$.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes