CV CLDec 31, 2025

CPJ: Explainable Agricultural Pest Diagnosis via Caption-Prompt-Judge with LLM-Judged Refinement

Wentao Zhang, Tao Fang, Lina Lu, Lifei Wang, Weihe Zhong

arXiv:2512.24947v16.22 citationsh-index: 12Has Code

Originality Incremental advance

AI Analysis

This work addresses the need for robust and explainable agricultural pest diagnosis for farmers and decision-makers, offering a novel method that is incremental in its approach.

The paper tackles the problem of accurate and interpretable crop disease diagnosis by proposing CPJ, a training-free few-shot framework that uses structured image captions refined by an LLM-as-Judge module, resulting in improvements of +22.7 percentage points in disease classification and +19.5 points in QA score over baselines.

Accurate and interpretable crop disease diagnosis is essential for agricultural decision-making, yet existing methods often rely on costly supervised fine-tuning and perform poorly under domain shifts. We propose Caption--Prompt--Judge (CPJ), a training-free few-shot framework that enhances Agri-Pest VQA through structured, interpretable image captions. CPJ employs large vision-language models to generate multi-angle captions, refined iteratively via an LLM-as-Judge module, which then inform a dual-answer VQA process for both recognition and management responses. Evaluated on CDDMBench, CPJ significantly improves performance: using GPT-5-mini captions, GPT-5-Nano achieves \textbf{+22.7} pp in disease classification and \textbf{+19.5} points in QA score over no-caption baselines. The framework provides transparent, evidence-based reasoning, advancing robust and explainable agricultural diagnosis without fine-tuning. Our code and data are publicly available at: https://github.com/CPJ-Agricultural/CPJ-Agricultural-Diagnosis.

View on arXiv PDF Code

Similar