CVDec 4, 2023

ArtAdapter: Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation

Dar-Yen Chen, Hamish Tennent, Ching-Wen Hsu

arXiv:2312.02109v219.858 citationsh-index: 8CVPR

Originality Highly original

AI Analysis

This addresses the problem of limited style transfer capabilities in text-to-image generation for users in creative and AI art domains, representing a strong incremental improvement.

The paper tackles text-to-image style transfer by introducing ArtAdapter, a framework that uses a multi-level style encoder and explicit adaptation to achieve high fidelity in capturing artistic styles like composition, surpassing current state-of-the-art methods.

This work introduces ArtAdapter, a transformative text-to-image (T2I) style transfer framework that transcends traditional limitations of color, brushstrokes, and object shape, capturing high-level style elements such as composition and distinctive artistic expression. The integration of a multi-level style encoder with our proposed explicit adaptation mechanism enables ArtAdapter to achieve unprecedented fidelity in style transfer, ensuring close alignment with textual descriptions. Additionally, the incorporation of an Auxiliary Content Adapter (ACA) effectively separates content from style, alleviating the borrowing of content from style references. Moreover, our novel fast finetuning approach could further enhance zero-shot style representation while mitigating the risk of overfitting. Comprehensive evaluations confirm that ArtAdapter surpasses current state-of-the-art methods.

View on arXiv PDF

Similar