CV AI DB MMJul 13, 2020

Knowledge Graph Driven Approach to Represent Video Streams for Spatiotemporal Event Pattern Matching in Complex Event Processing

Piyush Yadav, Dhaval Salwala, Edward Curry

arXiv:2007.06292v13.312 citations

Originality Incremental advance

AI Analysis

This work addresses the limitation of CEP systems in handling video data, enabling complex event pattern matching for domains like activity recognition and traffic management, though it is incremental as it builds on existing graph and CEP methods.

The paper tackles the problem of processing unstructured video streams in Complex Event Processing (CEP) by introducing a graph-based representation called VEKG, which models video objects and their spatiotemporal relationships. The approach achieved F-Scores from 0.44 to 0.90 in experiments, with an optimized version reducing graph size by over 90% and speeding up search by 5.19X to sub-second latencies.

Complex Event Processing (CEP) is an event processing paradigm to perform real-time analytics over streaming data and match high-level event patterns. Presently, CEP is limited to process structured data stream. Video streams are complicated due to their unstructured data model and limit CEP systems to perform matching over them. This work introduces a graph-based structure for continuous evolving video streams, which enables the CEP system to query complex video event patterns. We propose the Video Event Knowledge Graph (VEKG), a graph driven representation of video data. VEKG models video objects as nodes and their relationship interaction as edges over time and space. It creates a semantic knowledge representation of video data derived from the detection of high-level semantic concepts from the video using an ensemble of deep learning models. A CEP-based state optimization - VEKG-Time Aggregated Graph (VEKG-TAG) is proposed over VEKG representation for faster event detection. VEKG-TAG is a spatiotemporal graph aggregation method that provides a summarized view of the VEKG graph over a given time length. We defined a set of nine event pattern rules for two domains (Activity Recognition and Traffic Management), which act as a query and applied over VEKG graphs to discover complex event patterns. To show the efficacy of our approach, we performed extensive experiments over 801 video clips across 10 datasets. The proposed VEKG approach was compared with other state-of-the-art methods and was able to detect complex event patterns over videos with F-Score ranging from 0.44 to 0.90. In the given experiments, the optimized VEKG-TAG was able to reduce 99% and 93% of VEKG nodes and edges, respectively, with 5.19X faster search time, achieving sub-second median latency of 4-20 milliseconds.

View on arXiv PDF

Similar