LGFeb 19, 2024

Synthetic location trajectory generation using categorical diffusion models

arXiv:2402.12242v1h-index: 6
Originality Synthesis-oriented
AI Analysis

This work addresses the need for synthetic mobility data to inform political decision-making and benchmark models in mobility research, representing an incremental application of existing diffusion methods to a new domain.

The authors tackled the problem of generating synthetic individual location trajectories (ILTs) by proposing a categorical diffusion model that maps continuous diffusion processes to discrete spaces, demonstrating realistic generation compared to real-world GNSS data.

Diffusion probabilistic models (DPMs) have rapidly evolved to be one of the predominant generative models for the simulation of synthetic data, for instance, for computer vision, audio, natural language processing, or biomolecule generation. Here, we propose using DPMs for the generation of synthetic individual location trajectories (ILTs) which are sequences of variables representing physical locations visited by individuals. ILTs are of major importance in mobility research to understand the mobility behavior of populations and to ultimately inform political decision-making. We represent ILTs as multi-dimensional categorical random variables and propose to model their joint distribution using a continuous DPM by first applying the diffusion process in a continuous unconstrained space and then mapping the continuous variables into a discrete space. We demonstrate that our model can synthesize realistic ILPs by comparing conditionally and unconditionally generated sequences to real-world ILPs from a GNSS tracking data set which suggests the potential use of our model for synthetic data generation, for example, for benchmarking models used in mobility research.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes