LG ARJan 31, 2025

An All-digital 8.6-nJ/Frame 65-nm Tsetlin Machine Image Classification Accelerator

Svein Anders Tunheim, Yujin Zheng, Lei Jiao, Rishad Shafik, Alex Yakovlev, Ole-Christoffer Granmo

arXiv:2501.19347v314.45 citationsh-index: 33Has CodeIEEE Trans Circuit Syst I-regular Pap

Originality Synthesis-oriented

AI Analysis

This work addresses energy efficiency for embedded or edge computing applications, though it is incremental as it implements an existing algorithm in hardware.

The researchers tackled image classification by developing an energy-efficient hardware accelerator based on the Tsetlin machine, achieving 60.3k classifications per second with 8.6 nJ per classification and accuracies up to 97.42% on MNIST.

We present an all-digital programmable machine learning accelerator chip for image classification, underpinning on the Tsetlin machine (TM) principles. The TM is an emerging machine learning algorithm founded on propositional logic, utilizing sub-pattern recognition expressions called clauses. The accelerator implements the coalesced TM version with convolution, and classifies booleanized images of 28$\times$28 pixels with 10 categories. A configuration with 128 clauses is used in a highly parallel architecture. Fast clause evaluation is achieved by keeping all clause weights and Tsetlin automata (TA) action signals in registers. The chip is implemented in a 65 nm low-leakage CMOS technology, and occupies an active area of 2.7 mm$^2$. At a clock frequency of 27.8 MHz, the accelerator achieves 60.3k classifications per second, and consumes 8.6 nJ per classification. This demonstrates the energy-efficiency of the TM, which was the main motivation for developing this chip. The latency for classifying a single image is 25.4 $μ$s which includes system timing overhead. The accelerator achieves 97.42%, 84.54% and 82.55% test accuracies for the datasets MNIST, Fashion-MNIST and Kuzushiji-MNIST, respectively, matching the TM software models.

View on arXiv PDF Code

Similar