SIOct 24, 2025

Just Another Hour on TikTok: ID sampling to obtain a complete slice of TikTok

arXiv:2504.132791 citationsh-index: 4
Originality Incremental advance
AI Analysis

Provides the first near-complete slice of TikTok data for researchers studying platform characteristics and societal impact.

The authors developed a method to sample >99% of TikTok posts from a given time range, collected all posts from one hour and one minute per hour over a day, and estimated 269 million daily posts, 18% featuring children, and 0.5% AI-generated content.

TikTok is now a massive platform, and has a deep impact on global events. Despite preliminary studies, issues remain in determining fundamental characteristics of the platform. We develop a method to extract a representative sample of >99% of posts from a given time range on TikTok, and use it to collect all posts from a full hour on the platform, alongside all posts from a single minute from each hour of a day. Through this, we obtain post metadata, video media, and comments from a close-to-complete slice of TikTok, and report the critical statistics of the platform. Notably, we estimate a total of 269 million posts produced on the day we looked at, that 18% of videos on the platform feature children, and that at least 0.5% of posts contain artificial intelligence-generated content.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes