LGCVNov 13, 2020

Diffusion models for Handwriting Generation

arXiv:2011.06704v132 citationsHas Code
AI Analysis

This work addresses handwriting generation for applications like document synthesis or personalization, but it is incremental as it applies an existing diffusion model framework to a specific domain.

The paper tackled the problem of generating realistic handwriting images by proposing a diffusion probabilistic model that eliminates the need for text-recognition, writer-style, or adversarial losses, and it achieved high-quality, style-consistent outputs as demonstrated in experiments.

In this paper, we propose a diffusion probabilistic model for handwriting generation. Diffusion models are a class of generative models where samples start from Gaussian noise and are gradually denoised to produce output. Our method of handwriting generation does not require using any text-recognition based, writer-style based, or adversarial loss functions, nor does it require training of auxiliary networks. Our model is able to incorporate writer stylistic features directly from image data, eliminating the need for user interaction during sampling. Experiments reveal that our model is able to generate realistic , high quality images of handwritten text in a similar style to a given writer. Our implementation can be found at https://github.com/tcl9876/Diffusion-Handwriting-Generation

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes