AI CL LG MA NE NCNov 30, 2025

Testing the Machine Consciousness Hypothesis

arXiv:2512.01081v13.3

Originality Synthesis-oriented

AI Analysis

This work addresses the foundational problem of understanding consciousness in AI systems, but it is incremental as it builds on existing theories and computational frameworks without presenting new experimental results.

The paper tackles the problem of testing the Machine Consciousness Hypothesis by proposing a research program that uses a computational model with a cellular automaton and neural networks to study how collective self-models emerge from communication in distributed systems, aiming to develop empirically testable theories.

The Machine Consciousness Hypothesis states that consciousness is a substrate-free functional property of computational systems capable of second-order perception. I propose a research program to investigate this idea in silico by studying how collective self-models (coherent, self-referential representations) emerge from distributed learning systems embedded within universal self-organizing environments. The theory outlined here starts from the supposition that consciousness is an emergent property of collective intelligence systems undergoing synchronization of prediction through communication. It is not an epiphenomenon of individual modeling but a property of the language that a system evolves to internally describe itself. For a model of base reality, I begin with a minimal but general computational world: a cellular automaton, which exhibits both computational irreducibility and local reducibility. On top of this computational substrate, I introduce a network of local, predictive, representational (neural) models capable of communication and adaptation. I use this layered model to study how collective intelligence gives rise to self-representation as a direct consequence of inter-agent alignment. I suggest that consciousness does not emerge from modeling per se, but from communication. It arises from the noisy, lossy exchange of predictive messages between groups of local observers describing persistent patterns in the underlying computational substrate (base reality). It is through this representational dialogue that a shared model arises, aligning many partial views of the world. The broader goal is to develop empirically testable theories of machine consciousness, by studying how internal self-models may form in distributed systems without centralized control.

View on arXiv PDF

Similar