An Optimized Toolbox for Advanced Image Processing with Tsetlin Machine Composites
This work addresses the challenge of enhancing Tsetlin Machine capabilities in computer vision for researchers and practitioners, though it appears incremental as it builds on existing TM Composites architecture.
The paper tackled the problem of improving Tsetlin Machine performance on color image classification, specifically on CIFAR-10, by introducing a toolbox of TM Specialists with various image processing techniques and hyperparameter optimization, achieving a new state-of-the-art accuracy of 82.8% for TMs.
The Tsetlin Machine (TM) has achieved competitive results on several image classification benchmarks, including MNIST, K-MNIST, F-MNIST, and CIFAR-2. However, color image classification is arguably still in its infancy for TMs, with CIFAR-10 being a focal point for tracking progress. Over the past few years, TM's CIFAR-10 accuracy has increased from around 61% in 2020 to 75.1% in 2023 with the introduction of Drop Clause. In this paper, we leverage the recently proposed TM Composites architecture and introduce a range of TM Specialists that use various image processing techniques. These include Canny edge detection, Histogram of Oriented Gradients, adaptive mean thresholding, adaptive Gaussian thresholding, Otsu's thresholding, color thermometers, and adaptive color thermometers. In addition, we conduct a rigorous hyperparameter search, where we uncover optimal hyperparameters for several of the TM Specialists. The result is a toolbox that provides new state-of-the-art results on CIFAR-10 for TMs with an accuracy of 82.8%. In conclusion, our toolbox of TM Specialists forms a foundation for new TM applications and a landmark for further research on TM Composites in image analysis.