CVMAFeb 28

NERFIFY: A Multi-Agent Framework for Turning NeRF Papers into Code

Seemandhar Jain, Keshav Gupta, Kunal Gupta, Manmohan Chandraker
arXiv:2603.00805v11 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of time-consuming reimplementation for researchers in computer vision, particularly in the NeRF domain, though it is incremental as it builds on existing paper-to-code methods with domain-specific innovations.

The paper tackles the problem of reimplementing neural radiance field (NeRF) research papers by introducing NERFIFY, a multi-agent framework that converts papers into trainable Nerfstudio plugins, achieving visual quality matching expert human code (+/-0.5 dB PSNR, +/-0.2 SSIM) and reducing implementation time from weeks to minutes.

The proliferation of neural radiance field (NeRF) research requires significant efforts to reimplement papers before building upon them. We introduce NERFIFY, a multi-agent framework that reliably converts NeRF research papers into trainable Nerfstudio plugins, in contrast to generic paper-to-code methods and frontier models like GPT-5 that usually fail to produce runnable code. NERFIFY achieves domain-specific executability through six key innovations: (1) Context-free grammar (CFG): LLM synthesis is constrained by Nerfstudio formalized as a CFG, ensuring generated code satisfies architectural invariants. (2) Graph-of-Thought code synthesis: Specialized multi-file-agents generate repositories in topological dependency order, validating contracts and errors at each node. (3) Compositional citation recovery: Agents automatically retrieve and integrate components (samplers, encoders, proposal networks) from citation graphs of references. (4) Visual feedback: Artifacts are diagnosed through PSNR-minima ROI analysis, cross-view geometric validation, and VLM-guided patching to iteratively improve quality. (5) Knowledge enhancement: Beyond reproduction, methods can be improved with novel optimizations. (6) Benchmarking: An evaluation framework is designed for NeRF paper-to-code synthesis across 30 diverse papers. On papers without public implementations, NERFIFY achieves visual quality matching expert human code (+/-0.5 dB PSNR, +/-0.2 SSIM) while reducing implementation time from weeks to minutes. NERFIFY demonstrates that a domain-aware design enables code translation for complex vision papers, potentiating accelerated and democratized reproducible research. Code, data and implementations will be publicly released.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes