CVNov 7, 2022
Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: ReportAndrey Ignatov, Grigory Malivenko, Radu Timofte et al. · tencent-ai
Various depth estimation models are now widely used on many mobile and IoT devices for image segmentation, bokeh effect rendering, object tracking and many other mobile tasks. Thus, it is very crucial to have efficient and accurate depth estimation models that can run fast on low-power mobile chipsets. In this Mobile AI challenge, the target was to develop deep learning-based single image depth estimation solutions that can show a real-time performance on IoT platforms and smartphones. For this, the participants used a large-scale RGB-to-depth dataset that was collected with the ZED stereo camera capable to generated depth maps for objects located at up to 50 meters. The runtime of all models was evaluated on the Raspberry Pi 4 platform, where the developed solutions were able to generate VGA resolution depth maps at up to 27 FPS while achieving high fidelity results. All models developed in the challenge are also compatible with any Android or Linux-based mobile devices, their detailed description is provided in this paper.
SEMay 27
Towards Demystifying and Repairing LLM-in-the-Loop VulnerabilitiesYujie Ma, Jialin Rong, Chenxi Yang et al.
Large Language Models(LLMs) have been actively integrated into modern software systems as critical components. LLM-in-the-loop vulnerabilities, where vulnerabilities are introduced by LLMs and their dependent downstream components, such as frameworks, introduce new risks. Although some benchmark datasets have been constructed to study the impact of such vulnerabilities, most works still remain at the analysis from the conventional software level, ignoring the harm actually caused by LLMs. Understanding real-world LLM-in-the-loop vulnerabilities is still an open problem. To address this gap, we build the first LLM-in-the-loop vulnerability dataset, LLMCVE, to facilitate the risk analysis of LLM-integrated software. To do so, we first collect 2,888 multi-source vulnerabilities across 230 popular LLM components. Then, through manual analysis, we identify 205 vulnerabilities that strictly fall under the concept of LLM-in-the-loop vulnerability. Through analysis, we found that LLMs more often play as targets or propagation vectors rather than the root cause of these vulnerabilities. Furthermore, based on LLMCVE, we evaluate the repairing capabilities of existing agent-based vulnerability repair methods, such as SWE-Agent. Experimental results demonstrate that compared to conventional software vulnerabilities, LLM-in-the-Loop vulnerabilities are more challenging to precisely fix, especially for those involving prompt injections where the Pass@1 rate is only 28.57%.
IVNov 7, 2022
Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: ReportAndrey Ignatov, Radu Timofte, Maurizio Denna et al.
Image super-resolution is a common task on mobile and IoT devices, where one often needs to upscale and enhance low-resolution images and video frames. While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints. In this Mobile AI challenge, we address this problem and propose the participants to design an efficient quantized image super-resolution solution that can demonstrate a real-time performance on mobile NPUs. The participants were provided with the DIV2K dataset and trained INT8 models to do a high-quality 3X image upscaling. The runtime of all models was evaluated on the Synaptics VS680 Smart Home board with a dedicated edge NPU capable of accelerating quantized neural networks. All proposed solutions are fully compatible with the above NPU, demonstrating an up to 60 FPS rate when reconstructing Full HD resolution images. A detailed description of all models developed in the challenge is provided in this paper.
CVFeb 24
SurgAtt-Tracker: Online Surgical Attention Tracking via Temporal Proposal Reranking and Motion-Aware RefinementRulin Zhou, Guankun Wang, An Wang et al.
Accurate and stable field-of-view (FoV) guidance is critical for safe and efficient minimally invasive surgery, yet existing approaches often conflate visual attention estimation with downstream camera control or rely on direct object-centric assumptions. In this work, we formulate surgical attention tracking as a spatio-temporal learning problem and model surgeon focus as a dense attention heatmap, enabling continuous and interpretable frame-wise FoV guidance. We propose SurgAtt-Tracker, a holistic framework that robustly tracks surgical attention by exploiting temporal coherence through proposal-level reranking and motion-aware refinement, rather than direct regression. To support systematic training and evaluation, we introduce SurgAtt-1.16M, a large-scale benchmark with a clinically grounded annotation protocol that enables comprehensive heatmap-based attention analysis across procedures and institutions. Extensive experiments on multiple surgical datasets demonstrate that SurgAtt-Tracker consistently achieves state-of-the-art performance and strong robustness under occlusion, multi-instrument interference, and cross-domain settings. Beyond attention tracking, our approach provides a frame-wise FoV guidance signal that can directly support downstream robotic FoV planning and automatic camera control.
NAJan 22, 2010
Simulation of Wave Equation on Manifold using DECZheng Xie, Yujie Ma
The classical numerical methods play important roles in solving wave equation, e.g. finite difference time domain method. However, their computational domain are limited to flat space and the time. This paper deals with the description of discrete exterior calculus method for numerical simulation of wave equation. The advantage of this method is that it can be used to compute equation on the space manifold and the time. The analysis of its stable condition and error is also accomplished.
NAJan 13, 2010
Westervelt Equation Simulation on Manifold using DECZheng Xie, Yujie Ma
The Westervelt equation is a model for the propagation of finite amplitude ultrasound. The method of discrete exterior calculus can be used to solve this equation numerically. A significant advantage of this method is that it can be used to find numerical solutions in the discrete space manifold and the time, and therefore is a generation of finite difference time domain method. This algorithm has been implemented in C++.
NAJan 12, 2010
Two unconditional stable schemes for simulation of heat equation on manifold using DECZheng Xie, Yujie Ma
To predict the heat diffusion in a given region over time, it is often necessary to find the numerical solution for heat equation. With the techniques of discrete differential calculus, we propose two unconditional stable numerical schemes for simulation heat equation on space manifold and time. The analysis of their stability and error is accomplished by the use of maximum principle.
NADec 29, 2009
Computation of Maxwell's equations on Manifold using DECZheng Xie, Yujie Ma
In this paper, the method of discrete exterior calculus for numerically solving Maxwell's equations in space manifold and the time is discussed, which is a kind of lattice gauge theory. The analysis of its stable condition and error is also accomplished. This algorithm has been implemented on C++ plateform for simulating TE/M waves in vacuum.
NADec 29, 2009
Computation of Maxwell's equations on manifold using implicit DEC schemeZheng Xie, Yujie Ma
Maxwell's equations can be solved numerically in space manifold and the time by discrete exterior calculus as a kind of lattice gauge theory.Since the stable conditions of this method is very severe restriction, we combine the implicit scheme of time variable and discrete exterior calculus to derive an unconditional stable scheme. It is an generation of implicit Yee-like scheme, since it can be implemented in space manifold directly. The analysis of its unconditional stability and error is also accomplished.