Piotr Sapiezynski

CY
4papers
57citations
Novelty26%
AI Score34

4 Papers

8.7CYMay 11
When 'For You' Isn't For You: Measuring User Agency in TikTok's Algorithmic Feed

Levi Kaplan, Devin Patel, Nicole Gerzon et al.

The short-form video-sharing service TikTok has become an important platform in the social media landscape, with much of its popularity owed to its algorithmically-driven "For You Page" (FYP). This feature serves as the "home screen" for the platform and provides a personalized feed of content for each user. Unlike other social media services-where new users start their journey by explicitly signaling whom they choose to friend or follow-the TikTok FYP algorithm instead begins making inferences based on implicit signals, such as how long they watch particular videos. As a result, users have less explicit control over what content they see, and concerns have been raised about the impact on users (e.g., the delivery of potentially harmful content). In this work, we investigate the extent to which users have control over the content they see on the FYP on TikTok. We first develop novel techniques to study the TikTok mobile app, introducing a new avenue for conducting controlled experiments that enable us to send both explicit and implicit signals on the app. We then use these techniques to study the FYP algorithm based on accounts we control. We find that the FYP algorithm is sensitive to both types of signals, changing the amount of personalized content the account sees. However, we find that users may have difficulty convincing the FYP algorithm to stop showing content the user wishes to no longer see: the most effective explicit signal-marking a video as 'Not Interested'-is unintuitively buried in the interface. Worse, we find that once accounts cease to indicate disinterest in a topic, many find their feeds dominated by such content again.

CYMay 22, 2020
The Fallibility of Contact-Tracing Apps

Piotr Sapiezynski, Johanna Pruessing, Vedran Sekara

Since the onset of the COVID-19's global spread we have been following the debate around contact tracing apps -- the tech-enabled response to the pandemic. As corporations, academics, governments, and civil society discuss the right way to implement these apps, we noticed recurring implicit assumptions. The proposed solutions are designed for a world where Internet access and smartphone ownership are a given, people are willing and able to install these apps, and those who receive notifications about potential exposure to the virus have access to testing and can isolate safely. In this work we challenge these assumptions. We not only show that there are not enough smartphones worldwide to reach required adoption thresholds but also highlight a broad lack of internet access, which affects certain groups more: the elderly, those with lower incomes, and those with limited ability to socially distance. Unfortunately, these are also the groups that are at the highest risks from COVID-19. We also report that the contact tracing apps that are already deployed on an opt-in basis show disappointing adoption levels. We warn about the potential consequences of over-extending the existing state and corporate surveillance powers. Finally, we describe a multitude of scenarios where contact tracing apps will not help regardless of access or policy. In this work we call for a comprehensive and equitable policy response that prioritizes the needs of the most vulnerable, protects human rights, and considers long term impact instead of focusing on technology-first fixes.

CYMar 23, 2018
Detrimental Network Effects in Privacy: A Graph-theoretic Model for Node-based Intrusions

Florimond Houssiau, Piotr Sapiezynski, Laura Radaelli et al.

Despite proportionality being one of the tenets of data protection laws, we currently lack a robust analytical framework to evaluate the reach of modern data collections and the network effects at play. We here propose a graph-theoretic model and notions of node- and edge-observability to quantify the reach of networked data collections. We first prove closed-form expressions for our metrics and quantify the impact of the graph's structure on observability. Second, using our model, we quantify how (1) from 270,000 compromised accounts, Cambridge Analytica collected 68.0M Facebook profiles; (2) from surveilling 0.01\% the nodes in a mobile phone network, a law-enforcement agency could observe 18.6\% of all communications; and (3) an app installed on 1\% of smartphones could monitor the location of half of the London population through close proximity tracing. Better quantifying the reach of data collection mechanisms is essential to evaluate their proportionality.

APJun 13, 2013
Crowds, Bluetooth, and Rock-n-Roll. Understanding Music Festival Participant Behavior

Jakob Eg Larsen, Piotr Sapiezynski, Arkadiusz Stopczynski et al.

In this paper we present a study of sensing and analyzing an offline social network of participants at a large-scale music festival (8 days, 130,000+ participants). We place 33 fixed-location Bluetooth scanners in strategic spots around the festival area to discover Bluetooth-enabled mobile phones carried by the participants, and thus collect spatio-temporal traces of their mobility and interactions. We subsequently analyze the data on two levels. On the micro level, we run a community detection algorithm to reveal a variety of groups the festival participants form. On the macro level, we employ an Infinite Relational Model (IRM) in order to recover the structure of the social network related to participants' music preferences. The obtained structure in the form of clusters of concerts and participants is then interpreted using meta-information about music genres, band origins, stages, and dates of performances. We show that most of the concerts clusters can be described by one or more of the meta-features, effectively revealing preferences of participants (e.g. a cluster of US bands) and discuss the significance of the findings and the potential and limitations of the used method. Finally, we discuss the possibility of employing the described method and techniques for creating user-oriented applications and extending the sensing capabilities during large-scale events by introducing user involvement.