Reflective Hybrid Intelligence for Meaningful Human Control in Decision-Support Systems
This work addresses the societal problem of maintaining human autonomy and ethical alignment in AI decision-support systems, though it is incremental as it builds on existing interdisciplinary concepts.
The paper tackles the challenge of ensuring meaningful human control over AI systems by introducing a framework for self-reflective AI that integrates psychology, philosophy, and machine learning to align AI with human values and social norms, aiming to empower human moral reasoning and reduce moral blind spots.
With the growing capabilities and pervasiveness of AI systems, societies must collectively choose between reduced human autonomy, endangered democracies and limited human rights, and AI that is aligned to human and social values, nurturing collaboration, resilience, knowledge and ethical behaviour. In this chapter, we introduce the notion of self-reflective AI systems for meaningful human control over AI systems. Focusing on decision support systems, we propose a framework that integrates knowledge from psychology and philosophy with formal reasoning methods and machine learning approaches to create AI systems responsive to human values and social norms. We also propose a possible research approach to design and develop self-reflective capability in AI systems. Finally, we argue that self-reflective AI systems can lead to self-reflective hybrid systems (human + AI), thus increasing meaningful human control and empowering human moral reasoning by providing comprehensible information and insights on possible human moral blind spots.