SIMMJun 23, 2018

Temporal Activity Path Based Character Correction in Social Networks

arXiv:1806.08904v1
Originality Incremental advance
AI Analysis

This addresses a pervasive data quality issue in social network analysis that affects researchers and analysts working with multimedia-derived networks.

The paper tackles the problem of character duplication in social networks where one person is represented by multiple vertices with different names, which causes incorrect network analysis results. Their proposed method uses temporal activity paths to measure similarity between characters and merge duplicates, achieving accurate character uniqueness confirmation in large-scale networks.

Vast amount of multimedia data contains massive and multifarious social information which is used to construct large-scale social networks. In a complex social network, a character should be ideally denoted by one and only one vertex. However, it is pervasive that a character is denoted by two or more vertices with different names, thus it is usually considered as multiple, different characters. This problem causes incorrectness of results in network analysis and mining. The factual challenge is that character uniqueness is hard to correctly confirm due to lots of complicated factors, e.g. name changing and anonymization, leading to character duplication. Early, limited research has shown that previous methods depended overly upon supplementary attribute information from databases. In this paper, we propose a novel method to merge the character vertices which refer to as the same entity but are denoted with different names. With this method, we firstly build the relationship network among characters based on records of social activities participated, which are extracted from multimedia sources. Then define temporal activity paths (TAPs) for each character over time. After that, we measure similarity of the TAPs for any two characters. If the similarity is high enough, the two vertices should be considered to the same character. Based on TAPs, we can determine whether to merge the two character vertices. Our experiments shown that this solution can accurately confirm character uniqueness in large-scale social network.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes