LGJan 24, 2023

Efficient learning of large sets of locally optimal classification rules

Van Quoc Phuong Huynh, Johannes Fürnkranz, Florian Beck

arXiv:2301.09936v27.716 citationsh-index: 50Has Code

Originality Incremental advance

AI Analysis

This addresses the need for more accurate and scalable rule-based classification, particularly for large-scale applications, though it is incremental as it builds on existing rule learning paradigms.

The paper tackles the problem of conventional rule learning algorithms not providing optimal explanations for each covered example by proposing an efficient algorithm that finds locally optimal rules for each training example, resulting in a larger rule set and achieving higher average classification accuracy than state-of-the-art methods on datasets ranging from small to very large.

Conventional rule learning algorithms aim at finding a set of simple rules, where each rule covers as many examples as possible. In this paper, we argue that the rules found in this way may not be the optimal explanations for each of the examples they cover. Instead, we propose an efficient algorithm that aims at finding the best rule covering each training example in a greedy optimization consisting of one specialization and one generalization loop. These locally optimal rules are collected and then filtered for a final rule set, which is much larger than the sets learned by conventional rule learning algorithms. A new example is classified by selecting the best among the rules that cover this example. In our experiments on small to very large datasets, the approach's average classification accuracy is higher than that of state-of-the-art rule learning algorithms. Moreover, the algorithm is highly efficient and can inherently be processed in parallel without affecting the learned rule set and so the classification accuracy. We thus believe that it closes an important gap for large-scale classification rule induction.

View on arXiv PDF Code

Similar