CLFeb 10, 2024

History, Development, and Principles of Large Language Models-An Introductory Survey

arXiv:2402.06853v3102 citationsh-index: 17AI and Ethics
Originality Synthesis-oriented
AI Analysis

It addresses the limited understanding among general practitioners about LLMs by offering a comprehensible survey, which is incremental as it synthesizes existing knowledge rather than introducing new methods.

This survey provides an introductory overview of large language models (LLMs), tracing their historical development from statistical models to current capabilities in processing and generating human-level text, and explains their underlying principles to aid a broader audience.

Language models serve as a cornerstone in natural language processing (NLP), utilizing mathematical methods to generalize language laws and knowledge for prediction and generation. Over extensive research spanning decades, language modeling has progressed from initial statistical language models (SLMs) to the contemporary landscape of large language models (LLMs). Notably, the swift evolution of LLMs has reached the ability to process, understand, and generate human-level text. Nevertheless, despite the significant advantages that LLMs offer in improving both work and personal lives, the limited understanding among general practitioners about the background and principles of these models hampers their full potential. Notably, most LLM reviews focus on specific aspects and utilize specialized language, posing a challenge for practitioners lacking relevant background knowledge. In light of this, this survey aims to present a comprehensible overview of LLMs to assist a broader audience. It strives to facilitate a comprehensive understanding by exploring the historical background of language models and tracing their evolution over time. The survey further investigates the factors influencing the development of LLMs, emphasizing key contributions. Additionally, it concentrates on elucidating the underlying principles of LLMs, equipping audiences with essential theoretical knowledge. The survey also highlights the limitations of existing work and points out promising future directions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes