CLIRSep 19, 2024

Exploring Large Language Models for Product Attribute Value Identification

arXiv:2409.12695v15 citationsh-index: 28
Originality Incremental advance
AI Analysis

This addresses product search and recommendation systems by improving attribute extraction with less data, though it is incremental in applying LLMs to an existing task.

The paper tackled product attribute value identification by exploring large language models as data-efficient alternatives to existing fine-tuning methods, showing that their two-step prompt approach significantly improved zero-shot performance and instruction fine-tuning further boosted results with training data.

Product attribute value identification (PAVI) involves automatically identifying attributes and their values from product information, enabling features like product search, recommendation, and comparison. Existing methods primarily rely on fine-tuning pre-trained language models, such as BART and T5, which require extensive task-specific training data and struggle to generalize to new attributes. This paper explores large language models (LLMs), such as LLaMA and Mistral, as data-efficient and robust alternatives for PAVI. We propose various strategies: comparing one-step and two-step prompt-based approaches in zero-shot settings and utilizing parametric and non-parametric knowledge through in-context learning examples. We also introduce a dense demonstration retriever based on a pre-trained T5 model and perform instruction fine-tuning to explicitly train LLMs on task-specific instructions. Extensive experiments on two product benchmarks show that our two-step approach significantly improves performance in zero-shot settings, and instruction fine-tuning further boosts performance when using training data, demonstrating the practical benefits of using LLMs for PAVI.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes