CVIVJan 27, 2020

SafeNet: An Assistive Solution to Assess Incoming Threats for Premises

arXiv:2002.04405v1
AI Analysis

This provides an assistive solution for home safety, particularly for people with or without disabilities, by detecting and identifying threats like intruders, though it appears incremental in combining existing methods.

The paper tackles the problem of assessing security threats in homes by developing SafeNet, an integrated assistive system that generates context-oriented image descriptions from real-time video streams, achieving an average F-score of 0.97 for identifying known versus unknown persons and generating image descriptions from 10 classes.

An assistive solution to assess incoming threats (e.g., robbery, burglary, gun violence) for homes will enhance the safety of the people with or without disabilities. This paper presents "SafeNet"- an integrated assistive system to generate context-oriented image descriptions to assess incoming threats. The key functionality of the system includes the detection and identification of human and generating image descriptions from the real-time video streams obtained from the cameras placed in strategic locations around the house. In this paper, we focus on developing a robust model called "SafeNet" to generate image descriptions. To interact with the system, we implemented a dialog enabled interface for creating a personalized profile from face images or videos of friends/families. To improve computational efficiency, we apply change detection to filter out frames that do not have any activity and use Faster-RCNN to detect the human presence and extract faces using Multitask Cascaded Convolutional Networks (MTCNN). Subsequently, we apply LBP/FaceNet to identify a person. SafeNet sends image descriptions to the users with an MMS containing a person's name if any match found or as "Unknown", scene image, facial description, and contextual information. SafeNet identifies friends/families/caregiver versus intruders/unknown with an average F-score 0.97 and generates image descriptions from 10 classes with an average F-measure 0.97.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes