AIDec 23, 2025

Adaptive Financial Sentiment Analysis for NIFTY 50 via Instruction-Tuned LLMs , RAG and Reinforcement Learning Approaches

Chaithra, Kamesh Kadimisetty, Biju R Mohan

arXiv:2512.20082v2h-index: 4

Originality Incremental advance

AI Analysis

This addresses the problem of market-aware sentiment analysis for investors and analysts in the Indian stock market, representing a novel integration of existing techniques rather than a fundamental breakthrough.

The paper tackles financial sentiment analysis for the Indian stock market by proposing an adaptive framework that integrates instruction-tuned LLMs with retrieval-augmented generation and reinforcement learning, using real-world stock market feedback to improve predictions. Experimental results on NIFTY 50 news headlines show significant improvements in classification accuracy, F1-score, and market alignment over baseline models.

Financial sentiment analysis plays a crucial role in informing investment decisions, assessing market risk, and predicting stock price trends. Existing works in financial sentiment analysis have not considered the impact of stock prices or market feedback on sentiment analysis. In this paper, we propose an adaptive framework that integrates large language models (LLMs) with real-world stock market feedback to improve sentiment classification in the context of the Indian stock market. The proposed methodology fine-tunes the LLaMA 3.2 3B model using instruction-based learning on the SentiFin dataset. To enhance sentiment predictions, a retrieval-augmented generation (RAG) pipeline is employed that dynamically selects multi-source contextual information based on the cosine similarity of the sentence embeddings. Furthermore, a feedback-driven module is introduced that adjusts the reliability of the source by comparing predicted sentiment with actual next-day stock returns, allowing the system to iteratively adapt to market behavior. To generalize this adaptive mechanism across temporal data, a reinforcement learning agent trained using proximal policy optimization (PPO) is incorporated. The PPO agent learns to optimize source weighting policies based on cumulative reward signals from sentiment-return alignment. Experimental results on NIFTY 50 news headlines collected from 2024 to 2025 demonstrate that the proposed system significantly improves classification accuracy, F1-score, and market alignment over baseline models and static retrieval methods. The results validate the potential of combining instruction-tuned LLMs with dynamic feedback and reinforcement learning for robust, market-aware financial sentiment modeling.

View on arXiv PDF

Similar