AI CY LGNov 12, 2025

From Model Training to Model Raising

Roland Aydin, Christian Cyron, Steve Bachelor, Ashton Anderson, Robert West

arXiv:2511.09287v27.82 citationsh-index: 3

Originality Highly original

AI Analysis

This addresses the critical need for better-aligned AI models as their capabilities surpass humans in many tasks, though it appears incremental as it builds on existing alignment efforts.

The paper tackles the problem of AI models lacking deep-rooted value systems by proposing a shift from 'model training' to 'model raising', where alignment is integrated from the start through redesigning the training corpus, aiming to make values intrinsically harder to separate from knowledge and skills.

Current AI training methods align models with human values only after their core capabilities have been established, resulting in models that are easily misaligned and lack deep-rooted value systems. We propose a paradigm shift from "model training" to "model raising", in which alignment is woven into a model's development from the start. We identify several key components for this paradigm, all centered around redesigning the training corpus: reframing training data from a first-person perspective, recontextualizing information as lived experience, simulating social interactions, and scaffolding the ordering of training data. We expect that this redesign of the training corpus will lead to an early commitment to values from the first training token onward, such that knowledge, skills, and values are intrinsically much harder to separate. In an ecosystem in which large language model capabilities start overtaking human capabilities in many tasks, this seems to us like a critical need.

View on arXiv PDF

Similar