CLAILGSYAug 22, 2024

Fine-tuning Smaller Language Models for Question Answering over Financial Documents

arXiv:2408.12337v128 citationsh-index: 8
Originality Incremental advance
AI Analysis

This work addresses the challenge of efficient question answering in the financial domain, but it is incremental as it applies an existing fine-tuning paradigm to a specific domain.

The paper tackles the problem of enabling smaller language models to perform multi-hop numerical reasoning over financial documents by fine-tuning them with reasoning exemplars from a larger teacher model, achieving performance close to the teacher model.

Recent research has shown that smaller language models can acquire substantial reasoning abilities when fine-tuned with reasoning exemplars crafted by a significantly larger teacher model. We explore this paradigm for the financial domain, focusing on the challenge of answering questions that require multi-hop numerical reasoning over financial texts. We assess the performance of several smaller models that have been fine-tuned to generate programs that encode the required financial reasoning and calculations. Our findings demonstrate that these fine-tuned smaller models approach the performance of the teacher model. To provide a granular analysis of model performance, we propose an approach to investigate the specific student model capabilities that are enhanced by fine-tuning. Our empirical analysis indicates that fine-tuning refines the student models ability to express and apply the required financial concepts along with adapting the entity extraction for the specific data format. In addition, we hypothesize and demonstrate that comparable financial reasoning capability can be induced using relatively smaller datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes