CV AIDec 18, 2024

Surrealistic-like Image Generation with Vision-Language Models

arXiv:2412.14366v12.0Has Code

Originality Synthesis-oriented

AI Analysis

This work addresses the problem of creating surrealistic art images for users of generative AI, but it is incremental as it applies existing methods to a specific style.

The paper tackled generating surrealistic-style images using vision-language models like DALL-E, Deep Dream Generator, and DreamStudio, finding that DALL-E 2 performed best when using prompts generated by ChatGPT.

Recent advances in generative AI make it convenient to create different types of content, including text, images, and code. In this paper, we explore the generation of images in the style of paintings in the surrealism movement using vision-language generative models, including DALL-E, Deep Dream Generator, and DreamStudio. Our investigation starts with the generation of images under various image generation settings and different models. The primary objective is to identify the most suitable model and settings for producing such images. Additionally, we aim to understand the impact of using edited base images on the generated resulting images. Through these experiments, we evaluate the performance of selected models and gain valuable insights into their capabilities in generating such images. Our analysis shows that Dall-E 2 performs the best when using the generated prompt by ChatGPT.

View on arXiv PDF Code

Similar