LGGTOct 27, 2023

Optimal Pricing for Data-Augmented AutoML Marketplaces

arXiv:2310.17843v21 citationsh-index: 19
AI Analysis

This addresses the challenge of data scarcity and monetization for organizations using ML, though it is incremental as it builds on existing AutoML platforms.

The paper tackles the problem of underutilized data in ML by proposing a data-augmented AutoML marketplace that prices models based on performance improvements from external data, rather than computational costs, to enhance ML outcomes and monetize data sustainably.

Organizations often lack sufficient data to effectively train machine learning (ML) models, while others possess valuable data that remains underutilized. Data markets promise to unlock substantial value by matching data suppliers with demand from ML consumers. However, market design involves addressing intricate challenges, including data pricing, fairness, robustness, and strategic behavior. In this paper, we propose a pragmatic data-augmented AutoML market that seamlessly integrates with existing cloud-based AutoML platforms such as Google's Vertex AI and Amazon's SageMaker. Unlike standard AutoML solutions, our design automatically augments buyer-submitted training data with valuable external datasets, pricing the resulting models based on their measurable performance improvements rather than computational costs as the status quo. Our key innovation is a pricing mechanism grounded in the instrumental value - the marginal model quality improvement - of externally sourced data. This approach bypasses direct dataset pricing complexities, mitigates strategic buyer behavior, and accommodates diverse buyer valuations through menu-based options. By integrating automated data and model discovery, our solution not only enhances ML outcomes but also establishes an economically sustainable framework for monetizing external data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes