CLLGJul 13, 2023

Unsupervised Calibration through Prior Adaptation for Text Classification using Large Language Models

arXiv:2307.06713v3134 citationsh-index: 2
Originality Incremental advance
AI Analysis

This work addresses the challenge of unsupervised calibration for text classification, offering a practical solution for scenarios with limited labeled data, though it is incremental as it builds on existing calibration techniques.

The authors tackled the problem of text classification with large language models without labeled data by adapting the prior class distribution using few in-domain samples, resulting in improved performance over un-adapted models and a prior calibration method across different training shots.

A wide variety of natural language tasks are currently being addressed with large-scale language models (LLMs). These models are usually trained with a very large amount of unsupervised text data and adapted to perform a downstream natural language task using methods like fine-tuning, calibration or in-context learning. In this work, we propose an approach to adapt the prior class distribution to perform text classification tasks without the need for labelled samples and only few in-domain sample queries. The proposed approach treats the LLM as a black box, adding a stage where the model posteriors are calibrated to the task. Results show that these methods outperform the un-adapted model for different number of training shots in the prompt and a previous approach were calibration is performed without using any adaptation data.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes