CLApr 25, 2024

Modeling Selective Feature Attention for Representation-based Siamese Text Matching

arXiv:2404.16776v15 citationsh-index: 6
Originality Incremental advance
AI Analysis

This work addresses lightweight text matching for deployment-efficient applications, but it is incremental as it builds on existing Siamese networks with attention mechanisms.

The paper tackled the problem of improving representation-based Siamese networks for text matching by proposing Feature Attention (FA) and Selective Feature Attention (SFA) blocks to model dependencies among embedding features, resulting in enhanced performance across diverse benchmarks.

Representation-based Siamese networks have risen to popularity in lightweight text matching due to their low deployment and inference costs. While word-level attention mechanisms have been implemented within Siamese networks to improve performance, we propose Feature Attention (FA), a novel downstream block designed to enrich the modeling of dependencies among embedding features. Employing "squeeze-and-excitation" techniques, the FA block dynamically adjusts the emphasis on individual features, enabling the network to concentrate more on features that significantly contribute to the final classification. Building upon FA, we introduce a dynamic "selection" mechanism called Selective Feature Attention (SFA), which leverages a stacked BiGRU Inception structure. The SFA block facilitates multi-scale semantic extraction by traversing different stacked BiGRU layers, encouraging the network to selectively concentrate on semantic information and embedding features across varying levels of abstraction. Both the FA and SFA blocks offer a seamless integration capability with various Siamese networks, showcasing a plug-and-play characteristic. Experimental evaluations conducted across diverse text matching baselines and benchmarks underscore the indispensability of modeling feature attention and the superiority of the "selection" mechanism.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes