LGDec 23, 2023

Data Classification With Multiprocessing

arXiv:2312.15152v14 citations
Originality Synthesis-oriented
AI Analysis

This work addresses efficiency and reliability in classification for ML practitioners, but it is incremental as it applies existing multiprocessing and ensembling techniques to hyperparameter tuning.

The paper tackled the problem of reducing execution time and improving accuracy in classification tasks by proposing a method that parallelly trains algorithms with different hyperparameters and ensembles their results, concluding that ensembling improves accuracy and multiprocessing reduces execution time for selected algorithms.

Classification is one of the most important tasks in Machine Learning (ML) and with recent advancements in artificial intelligence (AI) it is important to find efficient ways to implement it. Generally, the choice of classification algorithm depends on the data it is dealing with, and accuracy of the algorithm depends on the hyperparameters it is tuned with. One way is to check the accuracy of the algorithms by executing it with different hyperparameters serially and then selecting the parameters that give the highest accuracy to predict the final output. This paper proposes another way where the algorithm is parallelly trained with different hyperparameters to reduce the execution time. In the end, results from all the trained variations of the algorithms are ensembled to exploit the parallelism and improve the accuracy of prediction. Python multiprocessing is used to test this hypothesis with different classification algorithms such as K-Nearest Neighbors (KNN), Support Vector Machines (SVM), random forest and decision tree and reviews factors affecting parallelism. Ensembled output considers the predictions from all processes and final class is the one predicted by maximum number of processes. Doing this increases the reliability of predictions. We conclude that ensembling improves accuracy and multiprocessing reduces execution time for selected algorithms.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes