CVApr 9, 2024

GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis

arXiv:2404.06637v138 citationsh-index: 8Has Code2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
Originality Incremental advance
AI Analysis

This work addresses the need for customizable satellite image synthesis for applications like urban planning or simulation, but it appears incremental as it builds on existing generative models with added controls.

The paper tackles the problem of synthesizing satellite images with control over global style and layout, using textual prompts or geographic location, and demonstrates that the model generates diverse, high-quality images with excellent zero-shot generalization.

We present GeoSynth, a model for synthesizing satellite images with global style and image-driven layout control. The global style control is via textual prompts or geographic location. These enable the specification of scene semantics or regional appearance respectively, and can be used together. We train our model on a large dataset of paired satellite imagery, with automatically generated captions, and OpenStreetMap data. We evaluate various combinations of control inputs, including different types of layout controls. Results demonstrate that our model can generate diverse, high-quality images and exhibits excellent zero-shot generalization. The code and model checkpoints are available at https://github.com/mvrl/GeoSynth.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes