CLFeb 23
Natural Language Processing Models for Robust Document CategorizationRadoslaw Roszczyk, Pawel Tecza, Maciej Stodolski et al.
This article presents an evaluation of several machine learning methods applied to automated text classification, alongside the design of a demonstrative system for unbalanced document categorization and distribution. The study focuses on balancing classification accuracy with computational efficiency, a key consideration when integrating AI into real world automation pipelines. Three models of varying complexity were examined: a Naive Bayes classifier, a bidirectional LSTM network, and a fine tuned transformer based BERT model. The experiments reveal substantial differences in performance. BERT achieved the highest accuracy, consistently exceeding 99\%, but required significantly longer training times and greater computational resources. The BiLSTM model provided a strong compromise, reaching approximately 98.56\% accuracy while maintaining moderate training costs and offering robust contextual understanding. Naive Bayes proved to be the fastest to train, on the order of milliseconds, yet delivered the lowest accuracy, averaging around 94.5\%. Class imbalance influenced all methods, particularly in the recognition of minority categories. A fully functional demonstrative system was implemented to validate practical applicability, enabling automated routing of technical requests with throughput unattainable through manual processing. The study concludes that BiLSTM offers the most balanced solution for the examined scenario, while also outlining opportunities for future improvements and further exploration of transformer architectures.
48.4CLMar 12
Trust Oriented Explainable AI for Fake News DetectionKrzysztof Siwek, Daniel Stankowski, Maciej Stodolski
This article examines the application of Explainable Artificial Intelligence (XAI) in NLP based fake news detection and compares selected interpretability methods. The work outlines key aspects of disinformation, neural network architectures, and XAI techniques, with a focus on SHAP, LIME, and Integrated Gradients. In the experimental study, classification models were implemented and interpreted using these methods. The results show that XAI enhances model transparency and interpretability while maintaining high detection accuracy. Each method provides distinct explanatory value: SHAP offers detailed local attributions, LIME provides simple and intuitive explanations, and Integrated Gradients performs efficiently with convolutional models. The study also highlights limitations such as computational cost and sensitivity to parameterization. Overall, the findings demonstrate that integrating XAI with NLP is an effective approach to improving the reliability and trustworthiness of fake news detection systems.
6.2CEApr 30
Hybrid Anomaly Detection for Bullion Coin Authentication Leveraging Acoustic Signature AnalysisKrzysztof Siwek, Tran Hoai Linh, Tomasz Gryczka et al.
The verification of bullion coin authenticity is essential for maintaining integrity within the precious metals market; however, the increasing sophistication of counterfeits has rendered traditional inspection methods insufficient. This paper proposes a non-destructive verification framework based on acoustic frequency analysis and deep neural networks. The methodology leverages the unique acoustic fingerprint of a coin, a physical signature determined by its material composition, mass, and geometry, captured through mechanical excitation. We implement a synergistic dual-model architecture consisting of an autoencoder that reconstructs the spectrum for anomaly detection and a deep learning classifier for coin type identification. To address the challenges of environmental noise and limited dataset diversity, a dynamically calculated anomaly threshold and data augmentation techniques were employed. Experimental results demonstrate that the integrated system achieves high precision in distinguishing authentic specimens from high-quality counterfeits, maintaining stability across varying recording conditions and devices. Beyond bullion authentication, the study highlights the scalability of the proposed non-destructive testing method for assessing the safety of critical components in the automotive and aerospace industries.