CVIVJul 11, 2023

SAR-NeRF: Neural Radiance Fields for Synthetic Aperture Radar Multi-View Representation

arXiv:2307.05087v128 citationsh-index: 33
Originality Incremental advance
AI Analysis

This work addresses the problem of poor generalization in SAR image analysis for remote sensing applications, offering a domain-specific incremental improvement by adapting NeRF to SAR data.

The paper tackles the challenge of representing Synthetic Aperture Radar (SAR) images across different viewing angles by proposing SAR-NeRF, a novel neural radiance field model that learns attenuation coefficients and scattering intensities in 3D space, resulting in improved multi-view generalization and achieving 91.6% accuracy in a 10-type classification task with only 12 images per class.

SAR images are highly sensitive to observation configurations, and they exhibit significant variations across different viewing angles, making it challenging to represent and learn their anisotropic features. As a result, deep learning methods often generalize poorly across different view angles. Inspired by the concept of neural radiance fields (NeRF), this study combines SAR imaging mechanisms with neural networks to propose a novel NeRF model for SAR image generation. Following the mapping and projection pinciples, a set of SAR images is modeled implicitly as a function of attenuation coefficients and scattering intensities in the 3D imaging space through a differentiable rendering equation. SAR-NeRF is then constructed to learn the distribution of attenuation coefficients and scattering intensities of voxels, where the vectorized form of 3D voxel SAR rendering equation and the sampling relationship between the 3D space voxels and the 2D view ray grids are analytically derived. Through quantitative experiments on various datasets, we thoroughly assess the multi-view representation and generalization capabilities of SAR-NeRF. Additionally, it is found that SAR-NeRF augumented dataset can significantly improve SAR target classification performance under few-shot learning setup, where a 10-type classification accuracy of 91.6\% can be achieved by using only 12 images per class.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes