LG SEMay 7, 2024

Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-Tuning

Karim Galliamov, Leila Khaertdinova, Karina Denisova

arXiv:2405.04126v14.61 citationsh-index: 1Has Code

Originality Incremental advance

AI Analysis

This work addresses the challenge of adapting large models for code-text retrieval in resource-limited settings, though it is incremental as it builds on existing PEFT techniques.

The paper tackles the high computational cost of fine-tuning large transformer models for code-text retrieval by proposing a parameter-efficient fine-tuning framework with contrastive learning, achieving improved performance while tuning only 0.4% of parameters.

The latest developments in Natural Language Processing (NLP) have demonstrated remarkable progress in a code-text retrieval problem. As the Transformer-based models used in this task continue to increase in size, the computational costs and time required for end-to-end fine-tuning become substantial. This poses a significant challenge for adapting and utilizing these models when computational resources are limited. Motivated by these concerns, we propose a fine-tuning framework that leverages Parameter-Efficient Fine-Tuning (PEFT) techniques. Moreover, we adopt contrastive learning objectives to improve the quality of bimodal representations learned by transformer models. Additionally, for PEFT methods we provide extensive benchmarking, the lack of which has been highlighted as a crucial problem in the literature. Based on the thorough experimentation with the CodeT5+ model conducted on two datasets, we demonstrate that the proposed fine-tuning framework has the potential to improve code-text retrieval performance by tuning only 0.4% parameters at most.

View on arXiv PDF Code

Similar