LGAIMay 24, 2025

PromptWise: Online Learning for Cost-Aware Prompt Assignment in Generative Models

arXiv:2505.18901v25 citationsh-index: 15Has Code
Originality Incremental advance
AI Analysis

This addresses the cost-efficiency challenge for users of generative AI models, though it is incremental by building on existing bandit methods.

The paper tackles the problem of selecting generative models for prompts by considering both performance and cost, introducing PromptWise, an online learning framework that reduces costs while maintaining comparable performance, as shown in experiments on tasks like code generation and translation.

The rapid advancement of generative AI has provided users with a wide range of well-trained models to address diverse prompts. When selecting a model for a given prompt, users should weigh not only its performance but also its service cost. However, existing model-selection methods typically emphasize performance while overlooking cost differences. In this paper, we introduce PromptWise, an online learning framework that assigns prompts to generative models in a cost-aware manner. PromptWise estimates prompt-model compatibility to select the least expensive model expected to deliver satisfactory outputs. Unlike standard contextual bandits that make a one-shot decision per prompt, PromptWise employs a cost-aware bandit structure that allows sequential model assignments per prompt to reduce total service cost. Through numerical experiments on tasks such as code generation and translation, we demonstrate that PromptWise can achieve performance comparable to baseline selection methods while incurring substantially lower costs. The code is available at: github.com/yannxiaoyanhu/PromptWise.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes