SEAIJan 29, 2024

Green Runner: A tool for efficient deep learning component selection

arXiv:2401.15810v12 citationsh-index: 52024 IEEE/ACM 3rd International Conference on AI Engineering – Software Engineering for AI (CAIN)
Originality Incremental advance
AI Analysis

This work addresses the challenge of reducing wasted compute and environmental impact for developers selecting models, though it appears incremental as it builds on existing LLM capabilities for a specific tool.

The paper tackles the problem of inefficient model selection in machine learning by introducing Green Runner, a tool that uses large language models to automatically select and evaluate models based on natural language descriptions, demonstrating efficiency and accuracy compared to ad-hoc or brute-force methods.

For software that relies on machine-learned functionality, model selection is key to finding the right model for the task with desired performance characteristics. Evaluating a model requires developers to i) select from many models (e.g. the Hugging face model repository), ii) select evaluation metrics and training strategy, and iii) tailor trade-offs based on the problem domain. However, current evaluation approaches are either ad-hoc resulting in sub-optimal model selection or brute force leading to wasted compute. In this work, we present \toolname, a novel tool to automatically select and evaluate models based on the application scenario provided in natural language. We leverage the reasoning capabilities of large language models to propose a training strategy and extract desired trade-offs from a problem description. \toolname~features a resource-efficient experimentation engine that integrates constraints and trade-offs based on the problem into the model selection process. Our preliminary evaluation demonstrates that \toolname{} is both efficient and accurate compared to ad-hoc evaluations and brute force. This work presents an important step toward energy-efficient tools to help reduce the environmental impact caused by the growing demand for software with machine-learned functionality.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes