SOC-PHMay 5
D-MODD: A Diffusion Model of Opinion Dynamics Derived from Online DataIxandra Achitouv, David Chavalarias, Raphael Fournier-S'niehotta
We present the first empirical derivation of a continuous-time stochastic model for real-world opinion dynamics. Using longitudinal social-media data to infer users opinion on a binary climate-change topic, we reconstruct the underlying drift and diffusion functions governing individual opinion updates. We show that the observed dynamics are well described by a Langevin-type stochastic differential equation, with persistent attractor basins and spatially sensitive drift and diffusion terms. The empirically inferred one-step transition probabilities closely reproduce the transition kernel generated from the D-MODD model we introduce. Our results provide the first direct evidence that online opinion dynamics on a polarized topic admit a Markovian description at the operator level, with empirically reconstructed transition kernels accurately reproduced by a data-driven Langevin model, bridging sociophysics, behavioral data, and complex-systems modeling.
CLNov 14, 2023
Natural Language Processing for Financial RegulationIxandra Achitouv, Dragos Gorduza, Antoine Jacquier
This article provides an understanding of Natural Language Processing techniques in the framework of financial regulation, more specifically in order to perform semantic matching search between rules and policy when no dataset is available for supervised learning. We outline how to outperform simple pre-trained sentences-transformer models using freely available resources and explain the mathematical concepts behind the key building blocks of Natural Language Processing.
SIJun 24, 2024
Testing network clustering algorithms with Natural Language ProcessingIxandra Achitouv, David Chavalarias, Bruno Gaume
The advent of online social networks has led to the development of an abundant literature on the study of online social groups and their relationship to individuals' personalities as revealed by their textual productions. Social structures are inferred from a wide range of social interactions. Those interactions form complex -- sometimes multi-layered -- networks, on which community detection algorithms are applied to extract higher order structures. The choice of the community detection algorithm is however hardily questioned in relation with the cultural production of the individual they classify. In this work, we assume the entangled nature of social networks and their cultural production to propose a definition of cultural based online social groups as sets of individuals whose online production can be categorized as social group-related. We take advantage of this apparently self-referential description of online social groups with a hybrid methodology that combines a community detection algorithm and a natural language processing classification algorithm. A key result of this analysis is the possibility to score community detection algorithms using their agreement with the natural language processing classification. A second result is that we can assign the opinion of a random user at >85% accuracy.