LGSOC-PHSep 2, 2024

Supervised Pattern Recognition Involving Skewed Feature Densities

arXiv:2409.01213v12 citationsh-index: 5
Originality Synthesis-oriented
AI Analysis

This work addresses pattern recognition challenges for datasets with skewed distributions, but it is incremental as it focuses on comparing existing methods under specific transformations.

The study compared Euclidean distance and a coincidence similarity-based dissimilarity index for k-neighbors classification on transformed symmetric densities, finding that the dissimilarity index improved classification accuracy for right-skewed feature densities.

Pattern recognition constitutes a particularly important task underlying a great deal of scientific and technologica activities. At the same time, pattern recognition involves several challenges, including the choice of features to represent the data elements, as well as possible respective transformations. In the present work, the classification potential of the Euclidean distance and a dissimilarity index based on the coincidence similarity index are compared by using the k-neighbors supervised classification method respectively to features resulting from several types of transformations of one- and two-dimensional symmetric densities. Given two groups characterized by respective densities without or with overlap, different types of respective transformations are obtained and employed to quantitatively evaluate the performance of k-neighbors methodologies based on the Euclidean distance an coincidence similarity index. More specifically, the accuracy of classifying the intersection point between the densities of two adjacent groups is taken into account for the comparison. Several interesting results are described and discussed, including the enhanced potential of the dissimilarity index for classifying datasets with right skewed feature densities, as well as the identification that the sharpness of the comparison between data elements can be independent of the respective supervised classification performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes