CLJun 2, 2024

LLMs Could Autonomously Learn Without External Supervision

Ke Ji, Junying Chen, Anningzhe Gao, Wenya Xie, Xiang Wan, Benyou Wang

arXiv:2406.00606v24.86 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the labor-intensive and limited nature of current LLM training, potentially enhancing efficiency and enabling more self-reliant AI systems, though it appears incremental as it builds on existing learning paradigms.

The paper tackles the problem of LLMs requiring human-annotated datasets by introducing Autonomous Learning, a self-sufficient paradigm where models learn directly from text without supervision, and it outperforms pre-training, supervised fine-tuning, and retrieval-augmented methods in experiments on public quizzes.

In the quest for super-human performance, Large Language Models (LLMs) have traditionally been tethered to human-annotated datasets and predefined training objectives-a process that is both labor-intensive and inherently limited. This paper presents a transformative approach: Autonomous Learning for LLMs, a self-sufficient learning paradigm that frees models from the constraints of human supervision. This method endows LLMs with the ability to self-educate through direct interaction with text, akin to a human reading and comprehending literature. Our approach eliminates the reliance on annotated data, fostering an Autonomous Learning environment where the model independently identifies and reinforces its knowledge gaps. Empirical results from our comprehensive experiments, which utilized a diverse array of learning materials and were evaluated against standard public quizzes, reveal that Autonomous Learning outstrips the performance of both Pre-training and Supervised Fine-Tuning (SFT), as well as retrieval-augmented methods. These findings underscore the potential of Autonomous Learning to not only enhance the efficiency and effectiveness of LLM training but also to pave the way for the development of more advanced, self-reliant AI systems.

View on arXiv PDF Code

Similar