CVSep 24, 2024Code
VideoPatchCore: An Effective Method to Memorize Normality for Video Anomaly DetectionSunghyun Ahn, Youngwan Jo, Kijung Lee et al.
Video anomaly detection (VAD) is a crucial task in video analysis and surveillance within computer vision. Currently, VAD is gaining attention with memory techniques that store the features of normal frames. The stored features are utilized for frame reconstruction, identifying an abnormality when a significant difference exists between the reconstructed and input frames. However, this approach faces several challenges due to the simultaneous optimization required for both the memory and encoder-decoder model. These challenges include increased optimization difficulty, complexity of implementation, and performance variability depending on the memory size. To address these challenges,we propose an effective memory method for VAD, called VideoPatchCore. Inspired by PatchCore, our approach introduces a structure that prioritizes memory optimization and configures three types of memory tailored to the characteristics of video data. This method effectively addresses the limitations of existing memory-based methods, achieving good performance comparable to state-of-the-art methods. Furthermore, our method requires no training and is straightforward to implement, making VAD tasks more accessible. Our code is available online at github.com/SkiddieAhn/Paper-VideoPatchCore.
PRJan 7, 2013
On the Lq(Lp)-regularity and Besov smoothness of stochastic parabolic equations on bounded Lipschitz domainsPetru A. Cioica, Kyeong-Hun Kim, Kijung Lee et al.
We investigate the regularity of linear stochastic parabolic equations with zero Dirichlet boundary condition on bounded Lipschitz domains $O \subset R^d$ with both theoretical and numerical purpose. We use N.V. Krylov's framework of stochastic parabolic weighted Sobolev spaces $\mathfrak{H}^{γ,q}_{p,θ}(O;T)$. The summability parameters p and q in space and time may differ. Existence and uniqueness of solutions in these spaces is established and the Hölder regularity in time is analysed. Moreover, we prove a general embedding of weighted Lp(O)-Sobolev spaces into the scale of Besov spaces $B^α_{τ,τ}(O), 1/τ=α/d+1/p, α> 0$. This leads to a Hölder-Besov regularity result for the solution process. The regularity in this Besov scale determines the order of convergence that can be achieved by certain nonlinear approximation schemes.
CVMar 6, 2025Code
AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLMSunghyun Ahn, Youngwan Jo, Kijung Lee et al.
Video anomaly detection (VAD) is crucial for video analysis and surveillance in computer vision. However, existing VAD models rely on learned normal patterns, which makes them difficult to apply to diverse environments. Consequently, users should retrain models or develop separate AI models for new environments, which requires expertise in machine learning, high-performance hardware, and extensive data collection, limiting the practical usability of VAD. To address these challenges, this study proposes customizable video anomaly detection (C-VAD) technique and the AnyAnomaly model. C-VAD considers user-defined text as an abnormal event and detects frames containing a specified event in a video. We effectively implemented AnyAnomaly using a context-aware visual question answering without fine-tuning the large vision language model. To validate the effectiveness of the proposed model, we constructed C-VAD datasets and demonstrated the superiority of AnyAnomaly. Furthermore, our approach showed competitive results on VAD benchmarks, achieving state-of-the-art performance on UBnormal and UCF-Crime and surpassing other methods in generalization across all datasets. Our code is available online at github.com/SkiddieAhn/Paper-AnyAnomaly.
CVMar 24, 2025
GranQ: Efficient Channel-wise Quantization via Vectorized Pre-Scaling for Zero-Shot QATInpyo Hong, Youngwan Jo, Hyojeong Lee et al.
Zero-shot quantization (ZSQ) enables neural network compression without original training data, making it a promising solution for restricted data access scenarios. To compensate for the lack of data, recent ZSQ methods typically rely on synthetic inputs generated from the full-precision model. However, these synthetic inputs often lead to activation distortion, especially under low-bit settings. To mitigate this, existing methods typically employ per-channel scaling, but they still struggle due to the severe computational overhead during the accumulation process. To overcome this critical bottleneck, we propose GranQ, a novel activation quantization framework that introduces an efficient pre-scaling strategy. Unlike conventional channel-wise methods that repeatedly perform scaling operations during accumulation, GranQ applies scaling factors in a pre-scaling step through fully vectorized computation, eliminating runtime scaling overhead. This design enables GranQ to maintain fine-grained quantization accuracy while significantly reducing computational burden, particularly in low-bit quantization settings. Extensive experiments under quantization-aware training (QAT) settings demonstrate that GranQ consistently outperforms state-of-the-art ZSQ methods across CIFAR and ImageNet. In particular, our method achieves up to 5.45% higher accuracy in the 3-bit setting on CIFAR-100 and even surpasses the full-precision baseline on CIFAR-10.