AICYJan 3, 2019

Towards a Framework Combining Machine Ethics and Machine Explainability

arXiv:1901.00590v113 citations
Originality Synthesis-oriented
AI Analysis

This addresses the need for autonomous systems to behave ethically and provide understandable explanations, though it is incremental as it builds on existing concepts.

The paper tackles the integration of Machine Ethics and Machine Explainability by proposing a framework using formal argumentation theory for ethical decision-making and explanation generation, resulting in a preliminary version of such a framework.

We find ourselves surrounded by a rapidly increasing number of autonomous and semi-autonomous systems. Two grand challenges arise from this development: Machine Ethics and Machine Explainability. Machine Ethics, on the one hand, is concerned with behavioral constraints for systems, so that morally acceptable, restricted behavior results; Machine Explainability, on the other hand, enables systems to explain their actions and argue for their decisions, so that human users can understand and justifiably trust them. In this paper, we try to motivate and work towards a framework combining Machine Ethics and Machine Explainability. Starting from a toy example, we detect various desiderata of such a framework and argue why they should and how they could be incorporated in autonomous systems. Our main idea is to apply a framework of formal argumentation theory both, for decision-making under ethical constraints and for the task of generating useful explanations given only limited knowledge of the world. The result of our deliberations can be described as a first version of an ethically motivated, principle-governed framework combining Machine Ethics and Machine Explainability

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes