CVIVOct 13, 2024

Towards Defining an Efficient and Expandable File Format for AI-Generated Contents

arXiv:2410.09834v3h-index: 12ISCAS
Originality Incremental advance
AI Analysis

This addresses storage and transmission inefficiencies for AI-generated content users, though it is incremental as it builds on existing file format concepts.

The paper tackles the challenge of storing and transmitting AI-generated content (AIGC) images by defining a new file format, AIGIF, which compresses generation syntax instead of pixel data, achieving a compression ratio of up to 1/10,000 while maintaining high fidelity.

Recently, AI-generated content (AIGC) has gained significant traction due to its powerful creation capability. However, the storage and transmission of large amounts of high-quality AIGC images inevitably pose new challenges for recent file formats. To overcome this, we define a new file format for AIGC images, named AIGIF, enabling ultra-low bitrate coding of AIGC images. Unlike compressing AIGC images intuitively with pixel-wise space as existing file formats, AIGIF instead compresses the generation syntax. This raises a crucial question: Which generation syntax elements, e.g., text prompt, device configuration, etc, are necessary for compression/transmission? To answer this question, we systematically investigate the effects of three essential factors: platform, generative model, and data configuration. We experimentally find that a well-designed composable bitstream structure incorporating the above three factors can achieve an impressive compression ratio of even up to 1/10,000 while still ensuring high fidelity. We also introduce an expandable syntax in AIGIF to support the extension of the most advanced generation models to be developed in the future.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes