CV MAFeb 28

NERFIFY: A Multi-Agent Framework for Turning NeRF Papers into Code

Seemandhar Jain, Keshav Gupta, Kunal Gupta, Manmohan Chandraker

arXiv:2603.00805v12.81 citationsh-index: 49

Originality Incremental advance

AI Analysis

This addresses the challenge of time-consuming reimplementation for researchers in computer vision, particularly in the NeRF domain, though it is incremental as it builds on existing paper-to-code methods with domain-specific innovations.

The paper tackles the problem of reimplementing neural radiance field (NeRF) research papers by introducing NERFIFY, a multi-agent framework that converts papers into trainable Nerfstudio plugins, achieving visual quality matching expert human code (+/-0.5 dB PSNR, +/-0.2 SSIM) and reducing implementation time from weeks to minutes.

The proliferation of neural radiance field (NeRF) research requires significant efforts to reimplement papers before building upon them. We introduce NERFIFY, a multi-agent framework that reliably converts NeRF research papers into trainable Nerfstudio plugins, in contrast to generic paper-to-code methods and frontier models like GPT-5 that usually fail to produce runnable code. NERFIFY achieves domain-specific executability through six key innovations: (1) Context-free grammar (CFG): LLM synthesis is constrained by Nerfstudio formalized as a CFG, ensuring generated code satisfies architectural invariants. (2) Graph-of-Thought code synthesis: Specialized multi-file-agents generate repositories in topological dependency order, validating contracts and errors at each node. (3) Compositional citation recovery: Agents automatically retrieve and integrate components (samplers, encoders, proposal networks) from citation graphs of references. (4) Visual feedback: Artifacts are diagnosed through PSNR-minima ROI analysis, cross-view geometric validation, and VLM-guided patching to iteratively improve quality. (5) Knowledge enhancement: Beyond reproduction, methods can be improved with novel optimizations. (6) Benchmarking: An evaluation framework is designed for NeRF paper-to-code synthesis across 30 diverse papers. On papers without public implementations, NERFIFY achieves visual quality matching expert human code (+/-0.5 dB PSNR, +/-0.2 SSIM) while reducing implementation time from weeks to minutes. NERFIFY demonstrates that a domain-aware design enables code translation for complex vision papers, potentiating accelerated and democratized reproducible research. Code, data and implementations will be publicly released.

View on arXiv PDF

Similar