CVAILGIVFeb 24

Towards Controllable Video Synthesis of Routine and Rare OR Events

arXiv:2602.21365v1h-index: 10
Originality Incremental advance
AI Analysis

This work addresses a data bottleneck for developing ambient intelligence in healthcare, specifically for detecting rare events in operating rooms, though it is incremental as it builds on existing diffusion methods.

The paper tackles the challenge of curating large-scale datasets for rare and safety-critical events in operating rooms by developing a video diffusion framework that synthesizes such events, achieving lower FVD/LPIPS and higher SSIM/PSNR compared to baselines and enabling AI models to detect near-misses with 70.13% recall.

Purpose: Curating large-scale datasets of operating room (OR) workflow, encompassing rare, safety-critical, or atypical events, remains operationally and ethically challenging. This data bottleneck complicates the development of ambient intelligence for detecting, understanding, and mitigating rare or safety-critical events in the OR. Methods: This work presents an OR video diffusion framework that enables controlled synthesis of rare and safety-critical events. The framework integrates a geometric abstraction module, a conditioning module, and a fine-tuned diffusion model to first transform OR scenes into abstract geometric representations, then condition the synthesis process, and finally generate realistic OR event videos. Using this framework, we also curate a synthetic dataset to train and validate AI models for detecting near-misses of sterile-field violations. Results: In synthesizing routine OR events, our method outperforms off-the-shelf video diffusion baselines, achieving lower FVD/LPIPS and higher SSIM/PSNR in both in- and out-of-domain datasets. Through qualitative results, we illustrate its ability for controlled video synthesis of counterfactual events. An AI model trained and validated on the generated synthetic data achieved a RECALL of 70.13% in detecting near safety-critical events. Finally, we conduct an ablation study to quantify performance gains from key design choices. Conclusion: Our solution enables controlled synthesis of routine and rare OR events from abstract geometric representations. Beyond demonstrating its capability to generate rare and safety-critical scenarios, we show its potential to support the development of ambient intelligence models.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes