LGAug 23, 2022

Multi-Modal Representation Learning with Self-Adaptive Threshold for Commodity Verification

arXiv:2208.11064v4h-index: 2Has Code
Originality Incremental advance
AI Analysis

This addresses the challenge of verifying identical products for e-commerce platforms, but it is incremental as it builds on existing multi-modal and threshold-based approaches.

The paper tackles the problem of identifying identical commodities in e-commerce by using a multi-modal representation learning method with a self-adaptive threshold, achieving an F1 score of 0.8936 and third place in a competition.

In this paper, we propose a method to identify identical commodities. In e-commerce scenarios, commodities are usually described by both images and text. By definition, identical commodities are those that have identical key attributes and are cognitively identical to consumers. There are two main challenges: 1) The extraction and fusion of multi-modal representation. 2) The ability to verify identical commodities by comparing the similarity between representations and a threshold. To address the above problems, we propose an end-to-end multi-modal representation learning method with self-adaptive threshold. We use a dual-stream network to extract multi-modal commodity embeddings and threshold embeddings separately and then concatenate them to obtain commodity representation. Our method is able to adaptively adjust the threshold according to different commodities while maintaining the indexability of the commodity representation space. We experimentally validate the advantages of self-adaptive threshold and the effectiveness of multimodal representation fusion. Besides, our method achieves third place with an F1 score of 0.8936 on the second task of the CCKS-2022 Knowledge Graph Evaluation for Digital Commerce Competition. Code and pretrained models are available at https://github.com/hanchenchen/CCKS2022-track2-solution.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes