Towards Friendly AI: A Comprehensive Review and New Perspectives on Human-AI Alignment
It tackles the need for ethical frameworks in AI development for researchers and policymakers, but it is incremental as it primarily reviews and synthesizes existing concepts.
This paper addresses the lack of comprehensive reviews on Friendly AI (FAI) by providing a thorough examination of its theoretical perspectives, formal definition, and applications in areas like explainable AI and fairness, emphasizing its importance for ethical AI development.
As Artificial Intelligence (AI) continues to advance rapidly, Friendly AI (FAI) has been proposed to advocate for more equitable and fair development of AI. Despite its importance, there is a lack of comprehensive reviews examining FAI from an ethical perspective, as well as limited discussion on its potential applications and future directions. This paper addresses these gaps by providing a thorough review of FAI, focusing on theoretical perspectives both for and against its development, and presenting a formal definition in a clear and accessible format. Key applications are discussed from the perspectives of eXplainable AI (XAI), privacy, fairness and affective computing (AC). Additionally, the paper identifies challenges in current technological advancements and explores future research avenues. The findings emphasise the significance of developing FAI and advocate for its continued advancement to ensure ethical and beneficial AI development.