AICVIVAug 21, 2025

SurgWound-Bench: A Benchmark for Surgical Wound Diagnosis

arXiv:2508.15189v1h-index: 49Has Code
Originality Synthesis-oriented
AI Analysis

This addresses the problem of surgical site infection prevention for healthcare providers by providing tools for automated wound screening, though it is incremental as it builds on existing deep learning and MLLM methods.

The authors tackled the lack of public datasets and benchmarks for surgical wound diagnosis by creating SurgWound, an open-source dataset with 697 images annotated by 3 surgeons, and introduced a benchmark with visual question answering and report generation tasks, along with a three-stage learning framework called WoundQwen that analyzes wound characteristics to provide diagnostic reports and intervention guidance.

Surgical site infection (SSI) is one of the most common and costly healthcare-associated infections and and surgical wound care remains a significant clinical challenge in preventing SSIs and improving patient outcomes. While recent studies have explored the use of deep learning for preliminary surgical wound screening, progress has been hindered by concerns over data privacy and the high costs associated with expert annotation. Currently, no publicly available dataset or benchmark encompasses various types of surgical wounds, resulting in the absence of an open-source Surgical-Wound screening tool. To address this gap: (1) we present SurgWound, the first open-source dataset featuring a diverse array of surgical wound types. It contains 697 surgical wound images annotated by 3 professional surgeons with eight fine-grained clinical attributes. (2) Based on SurgWound, we introduce the first benchmark for surgical wound diagnosis, which includes visual question answering (VQA) and report generation tasks to comprehensively evaluate model performance. (3) Furthermore, we propose a three-stage learning framework, WoundQwen, for surgical wound diagnosis. In the first stage, we employ five independent MLLMs to accurately predict specific surgical wound characteristics. In the second stage, these predictions serve as additional knowledge inputs to two MLLMs responsible for diagnosing outcomes, which assess infection risk and guide subsequent interventions. In the third stage, we train a MLLM that integrates the diagnostic results from the previous two stages to produce a comprehensive report. This three-stage framework can analyze detailed surgical wound characteristics and provide subsequent instructions to patients based on surgical images, paving the way for personalized wound care, timely intervention, and improved patient outcomes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes