CL AIOct 17, 2023

H2O Open Ecosystem for State-of-the-art Large Language Models

Arno Candel, Jon McKinney, Philipp Singer, Pascal Pfeiffer, Maximilian Jeblick, Chun Ming Lee, Marcos V. Conde

arXiv:2310.13012v221.2132 citationsh-index: 21Has Code

Originality Synthesis-oriented

AI Analysis

This work addresses the need for open, accessible, and trustworthy AI development, particularly for researchers and developers working with LLMs, though it is incremental as it builds on existing fine-tuning and deployment techniques.

The authors tackled the risks of biased, private, copyrighted, or harmful text in large language models by introducing an open-source ecosystem, including h2oGPT models and H2O LLM Studio, to provide transparent and safe alternatives to closed-source approaches.

Large Language Models (LLMs) represent a revolution in AI. However, they also pose many significant risks, such as the presence of biased, private, copyrighted or harmful text. For this reason we need open, transparent and safe solutions. We introduce a complete open-source ecosystem for developing and testing LLMs. The goal of this project is to boost open alternatives to closed-source approaches. We release h2oGPT, a family of fine-tuned LLMs of diverse sizes. We also introduce H2O LLM Studio, a framework and no-code GUI designed for efficient fine-tuning, evaluation, and deployment of LLMs using the most recent state-of-the-art techniques. Our code and models are fully open-source. We believe this work helps to boost AI development and make it more accessible, efficient and trustworthy. The demo is available at: https://gpt.h2o.ai/

View on arXiv PDF Code

Similar