CVFeb 25, 2025

A Fusion Model for Artwork Identification Based on Convolutional Neural Networks and Transformers

arXiv:2502.18083v33 citationsh-index: 1
Originality Synthesis-oriented
AI Analysis

This addresses artwork identification for cultural heritage and art analysis, but it is incremental as it combines existing methods.

The paper tackled artwork identification by proposing a fusion model combining CNNs and Transformers to leverage local and global features, resulting in improved classification accuracy by 9.7% and 7.1% on Chinese and oil painting datasets.

The identification of artwork is crucial in areas like cultural heritage protection, art market analysis, and historical research. With the advancement of deep learning, Convolutional Neural Networks (CNNs) and Transformer models have become key tools for image classification. While CNNs excel in local feature extraction, they struggle with global context, and Transformers are strong in capturing global dependencies but weak in fine-grained local details. To address these challenges, this paper proposes a fusion model combining CNNs and Transformers for artwork identification. The model first extracts local features using CNNs, then captures global context with a Transformer, followed by a feature fusion mechanism to enhance classification accuracy. Experiments on Chinese and oil painting datasets show the fusion model outperforms individual CNN and Transformer models, improving classification accuracy by 9.7% and 7.1%, respectively, and increasing F1 scores by 0.06 and 0.05. The results demonstrate the model's effectiveness and potential for future improvements, such as multimodal integration and architecture optimization.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes