SI CY LGJun 2, 2020

Learning Opinion Dynamics From Social Traces

Corrado Monti, Gianmarco De Francisci Morales, Francesco Bonchi

arXiv:2006.01673v140 citations

Originality Incremental advance

AI Analysis

This work addresses the problem of improving data-driven analysis and hypothesis testing in opinion dynamics for social scientists, though it is incremental as it builds on existing agent-based models.

The authors tackled the limited predictive power and manual calibration of traditional agent-based opinion dynamics models by proposing a generative inference mechanism that fits to real-world social traces, enabling model selection and hypothesis testing; they applied it to Reddit data and found low prominence of the backfire effect in political conversations.

Opinion dynamics - the research field dealing with how people's opinions form and evolve in a social context - traditionally uses agent-based models to validate the implications of sociological theories. These models encode the causal mechanism that drives the opinion formation process, and have the advantage of being easy to interpret. However, as they do not exploit the availability of data, their predictive power is limited. Moreover, parameter calibration and model selection are manual and difficult tasks. In this work we propose an inference mechanism for fitting a generative, agent-like model of opinion dynamics to real-world social traces. Given a set of observables (e.g., actions and interactions between agents), our model can recover the most-likely latent opinion trajectories that are compatible with the assumptions about the process dynamics. This type of model retains the benefits of agent-based ones (i.e., causal interpretation), while adding the ability to perform model selection and hypothesis testing on real data. We showcase our proposal by translating a classical agent-based model of opinion dynamics into its generative counterpart. We then design an inference algorithm based on online expectation maximization to learn the latent parameters of the model. Such algorithm can recover the latent opinion trajectories from traces generated by the classical agent-based model. In addition, it can identify the most likely set of macro parameters used to generate a data trace, thus allowing testing of sociological hypotheses. Finally, we apply our model to real-world data from Reddit to explore the long-standing question about the impact of backfire effect. Our results suggest a low prominence of the effect in Reddit's political conversation.

View on arXiv PDF

Similar