CVFeb 3, 2025
Template Matching in Images using Segmented Normalized Cross-CorrelationDavor Marušić, Siniša Popović, Zoran Kalafatić
In this paper, a new variant of an algorithm for normalized cross-correlation (NCC) is proposed in the context of template matching in images. The proposed algorithm is based on the precomputation of a template image approximation, enabling more efficient calculation of approximate NCC with the source image than using the original template for exact NCC calculation. The approximate template is precomputed from the template image by a split-and-merge approach, resulting in a decomposition to axis-aligned rectangular segments, whose sizes depend on per-segment pixel intensity variance. In the approximate template, each segment is assigned the mean grayscale value of the corresponding pixels from the original template. The proposed algorithm achieves superior computational performance with negligible NCC approximation errors compared to the well-known Fast Fourier Transform (FFT)-based NCC algorithm, when applied on less visually complex and/or smaller template images. In other cases, the proposed algorithm can maintain either computational performance or NCC approximation error within the range of the FFT-based algorithm, but not both.
MMMar 20, 2013
Multimedia stimuli databases usage patterns: a survey reportMarko Horvat, Siniša Popović, Krešimir Ćosić
Multimedia documents such as images, sounds or videos can be used to elicit emotional responses in exposed human subjects. These stimuli are stored in affective multimedia databases and successfully used for a wide variety of research in affective computing, human-computer interaction and cognitive sciences. Affective multimedia databases are simple repositories of multimedia documents with annotated high-level semantics and affective content. Although important all affective multimedia databases have numerous deficiencies which impair their applicability. To establish a better understanding of how experts use affective multimedia databases an online survey was conducted into the subject. The survey results are statistically significant and indicate that contemporary databases lack stimuli with rich semantic and emotional content. 73.33% of survey participants find the databases lacking at least some important semantic or emotion content. Most of the participants consider stimuli descriptions to be inadequate. Overall, 1-2h or more than 24h are generally needed to construct a single stimulation sequence. Almost 84% of the survey participants would like to use real-life videos in their research. Experts unequivocally recognize the need for an intelligent stimuli retrieval application that would assist them in experimentation. Almost all experts agree such applications could be useful in their work.