CVAILGNENov 21, 2024

ComfyGI: Automatic Improvement of Image Generation Workflows

arXiv:2411.14193v15 citationsh-index: 11
Originality Incremental advance
AI Analysis

This addresses the need for practitioners to optimize image generation workflows without human involvement, though it is incremental as it builds on genetic improvement techniques.

The paper tackles the problem of automatic image generation workflows being sensitive to settings and requiring human intervention, introducing ComfyGI to automatically improve them, resulting in images that are about 50% better in median ImageReward score and preferred by humans in around 90% of cases.

Automatic image generation is no longer just of interest to researchers, but also to practitioners. However, current models are sensitive to the settings used and automatic optimization methods often require human involvement. To bridge this gap, we introduce ComfyGI, a novel approach to automatically improve workflows for image generation without the need for human intervention driven by techniques from genetic improvement. This enables image generation with significantly higher quality in terms of the alignment with the given description and the perceived aesthetics. On the performance side, we find that overall, the images generated with an optimized workflow are about 50% better compared to the initial workflow in terms of the median ImageReward score. These already good results are even surpassed in our human evaluation, as the participants preferred the images improved by ComfyGI in around 90% of the cases.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes