GenAIOps for GenAI Model-Agility
This work addresses the need for model-agility in GenAI applications for organizations, but it appears incremental as it builds on existing prompt tuning technologies.
The paper tackles the problem of application quality degradation caused by changes in underlying foundation models for generative AI, proposing GenAIOps as a methodology to address this through prompt tuning, with effectiveness and limitations discussed via case studies.
AI-agility, with which an organization can be quickly adapted to its business priorities, is desired even for the development and operations of generative AI (GenAI) applications. Especially in this paper, we discuss so-called GenAI Model-agility, which we define as the readiness to be flexibly adapted to base foundation models as diverse as the model providers and versions. First, for handling issues specific to generative AI, we first define a methodology of GenAI application development and operations, as GenAIOps, to identify the problem of application quality degradation caused by changes to the underlying foundation models. We study prompt tuning technologies, which look promising to address this problem, and discuss their effectiveness and limitations through case studies using existing tools.