CVAIDec 18, 2024

Surrealistic-like Image Generation with Vision-Language Models

arXiv:2412.14366v1
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of creating surrealistic art images for users of generative AI, but it is incremental as it applies existing methods to a specific style.

The paper tackled generating surrealistic-style images using vision-language models like DALL-E, Deep Dream Generator, and DreamStudio, finding that DALL-E 2 performed best when using prompts generated by ChatGPT.

Recent advances in generative AI make it convenient to create different types of content, including text, images, and code. In this paper, we explore the generation of images in the style of paintings in the surrealism movement using vision-language generative models, including DALL-E, Deep Dream Generator, and DreamStudio. Our investigation starts with the generation of images under various image generation settings and different models. The primary objective is to identify the most suitable model and settings for producing such images. Additionally, we aim to understand the impact of using edited base images on the generated resulting images. Through these experiments, we evaluate the performance of selected models and gain valuable insights into their capabilities in generating such images. Our analysis shows that Dall-E 2 performs the best when using the generated prompt by ChatGPT.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes