CLApr 1, 2025

RECKON: Large-scale Reference-based Efficient Knowledge Evaluation for Large Language Model

Lin Zhang, Zhouhong Gu, Xiaoran Shi, Hongwei Feng, Yanghua Xiao

arXiv:2504.00756v12.7h-index: 10Has Code

Originality Incremental advance

AI Analysis

This addresses the high resource costs and information loss in traditional benchmark-based evaluation for LLMs, offering a more efficient solution for researchers and practitioners.

The paper tackles the problem of efficiently evaluating knowledge in large language models by proposing RECKON, a method that uses reference data to reduce resource consumption by 56.5% while achieving over 97% accuracy across multiple domains.

As large language models (LLMs) advance, efficient knowledge evaluation becomes crucial to verifying their capabilities. Traditional methods, relying on benchmarks, face limitations such as high resource costs and information loss. We propose the Large-scale Reference-based Efficient Knowledge Evaluation for Large Language Model (RECKON), which directly uses reference data to evaluate models. RECKON organizes unstructured data into manageable units and generates targeted questions for each cluster, improving evaluation accuracy and efficiency. Experimental results show that RECKON reduces resource consumption by 56.5% compared to traditional methods while achieving over 97% accuracy across various domains, including world knowledge, code, legal, and biomedical datasets. Code is available at https://github.com/MikeGu721/reckon

View on arXiv PDF Code

Similar