LGAICVApr 7, 2023

ChiroDiff: Modelling chirographic data with Diffusion Models

arXiv:2304.03785v117 citationsh-index: 77
Originality Incremental advance
AI Analysis

This work improves generative modeling for temporal geometric data, offering benefits for applications like creative design and data processing, though it is incremental as it adapts existing diffusion models to a specific domain.

The paper tackles the problem of generative modeling for chirographic data (e.g., handwriting, sketches) by introducing ChiroDiff, a diffusion model that addresses limitations of autoregressive methods, resulting in performance that is better or on par with competing approaches.

Generative modelling over continuous-time geometric constructs, a.k.a such as handwriting, sketches, drawings etc., have been accomplished through autoregressive distributions. Such strictly-ordered discrete factorization however falls short of capturing key properties of chirographic data -- it fails to build holistic understanding of the temporal concept due to one-way visibility (causality). Consequently, temporal data has been modelled as discrete token sequences of fixed sampling rate instead of capturing the true underlying concept. In this paper, we introduce a powerful model-class namely "Denoising Diffusion Probabilistic Models" or DDPMs for chirographic data that specifically addresses these flaws. Our model named "ChiroDiff", being non-autoregressive, learns to capture holistic concepts and therefore remains resilient to higher temporal sampling rate up to a good extent. Moreover, we show that many important downstream utilities (e.g. conditional sampling, creative mixing) can be flexibly implemented using ChiroDiff. We further show some unique use-cases like stochastic vectorization, de-noising/healing, abstraction are also possible with this model-class. We perform quantitative and qualitative evaluation of our framework on relevant datasets and found it to be better or on par with competing approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes