AIJan 14

AI Survival Stories: a Taxonomic Analysis of AI Existential Risk

arXiv:2601.09765v18 citations
Originality Synthesis-oriented
AI Analysis

This addresses the existential risk debate for humanity, but it is incremental as it builds on existing arguments without introducing new empirical data or methods.

The paper tackles the problem of assessing AI existential risk by developing a framework based on two premises about AI power and destruction, resulting in a taxonomy of survival stories and rough probability estimates for humanity's destruction by AI.

Since the release of ChatGPT, there has been a lot of debate about whether AI systems pose an existential risk to humanity. This paper develops a general framework for thinking about the existential risk of AI systems. We analyze a two premise argument that AI systems pose a threat to humanity. Premise one: AI systems will become extremely powerful. Premise two: if AI systems become extremely powerful, they will destroy humanity. We use these two premises to construct a taxonomy of survival stories, in which humanity survives into the far future. In each survival story, one of the two premises fails. Either scientific barriers prevent AI systems from becoming extremely powerful; or humanity bans research into AI systems, thereby preventing them from becoming extremely powerful; or extremely powerful AI systems do not destroy humanity, because their goals prevent them from doing so; or extremely powerful AI systems do not destroy humanity, because we can reliably detect and disable systems that have the goal of doing so. We argue that different survival stories face different challenges. We also argue that different survival stories motivate different responses to the threats from AI. Finally, we use our taxonomy to produce rough estimates of P(doom), the probability that humanity will be destroyed by AI.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes