AIMar 6, 2024

Assessing the Aesthetic Evaluation Capabilities of GPT-4 with Vision: Insights from Group and Individual Assessments

Yoshia Abe, Tatsuya Daikoku, Yasuo Kuniyoshi

arXiv:2403.03594v18.57 citationsh-index: 4

Originality Synthesis-oriented

AI Analysis

It addresses the problem of aligning AI with human sensibility in aesthetic evaluation, which is incremental as it applies an existing model to a new task.

This study assessed GPT-4 with Vision's ability to predict aesthetic evaluations of images, finding it performed well in predicting both group averages and individual values, with insights into its responses to beauty and ugliness.

Recently, it has been recognized that large language models demonstrate high performance on various intellectual tasks. However, few studies have investigated alignment with humans in behaviors that involve sensibility, such as aesthetic evaluation. This study investigates the performance of GPT-4 with Vision, a state-of-the-art language model that can handle image input, on the task of aesthetic evaluation of images. We employ two tasks, prediction of the average evaluation values of a group and an individual's evaluation values. We investigate the performance of GPT-4 with Vision by exploring prompts and analyzing prediction behaviors. Experimental results reveal GPT-4 with Vision's superior performance in predicting aesthetic evaluations and the nature of different responses to beauty and ugliness. Finally, we discuss developing an AI system for aesthetic evaluation based on scientific knowledge of the human perception of beauty, employing agent technologies that integrate traditional deep learning models with large language models.

View on arXiv PDF

Similar