LGCLGNMar 17, 2022

Time Dependency, Data Flow, and Competitive Advantage

arXiv:2203.09128v11 citationsh-index: 35
Originality Synthesis-oriented
AI Analysis

This research addresses the problem of optimizing data acquisition strategies for organizations in dynamic environments, though it is incremental as it applies existing methods to new data contexts.

The study analyzed how the value of data for machine learning predictions changes over time, using Reddit text data across different topics, and found that data relevance declines at varying rates depending on the subreddit, with implications for competitive strategies in business areas where data decays rapidly.

Data is fundamental to machine learning-based products and services and is considered strategic due to its externalities for businesses, governments, non-profits, and more generally for society. It is renowned that the value of organizations (businesses, government agencies and programs, and even industries) scales with the volume of available data. What is often less appreciated is that the data value in making useful organizational predictions will range widely and is prominently a function of data characteristics and underlying algorithms. In this research, our goal is to study how the value of data changes over time and how this change varies across contexts and business areas (e.g. next word prediction in the context of history, sports, politics). We focus on data from Reddit.com and compare the value's time-dependency across various Reddit topics (Subreddits). We make this comparison by measuring the rate at which user-generated text data loses its relevance to the algorithmic prediction of conversations. We show that different subreddits have different rates of relevance decline over time. Relating the text topics to various business areas of interest, we argue that competing in a business area in which data value decays rapidly alters strategies to acquire competitive advantage. When data value decays rapidly, access to a continuous flow of data will be more valuable than access to a fixed stock of data. In this kind of setting, improving user engagement and increasing user-base help creating and maintaining a competitive advantage.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes