OpenJAI-v1.0: An Open Thai Large Language Model
This provides an alternative NLP resource for the Thai AI community, but it is incremental as it builds on an existing model with curated data.
The researchers tackled the problem of developing an open-source large language model for Thai and English by introducing OpenJAI-v1.0, which improves on its base model and outperforms other leading open-source Thai models on benchmarks while avoiding catastrophic forgetting.
We introduce OpenJAI-v1.0, an open-source large language model for Thai and English, developed from the Qwen3-14B model. Our work focuses on boosting performance on practical tasks through carefully curated data across three key use cases: instruction following, long-context understanding, and tool use. Evaluation results show that OpenJAI-v1.0 improves on the capabilities of its base model and outperforms other leading open-source Thai models on a diverse suite of benchmarks, while avoiding catastrophic forgetting. OpenJAI-v1.0 is publicly released as another alternative NLP resource for the Thai AI community.