SEApr 6

OpenCoderRank: Personalized Technical Assessments with Generative AI

arXiv:2509.067748.5h-index: 7
Predicted impact top 69% in SE · last 90 daysOriginality Synthesis-oriented
AI Analysis

This addresses the problem for organizations and educational institutions needing secure, personalized technical assessments, but it is incremental as it builds on existing assessment tools with model-agnostic features.

The paper tackles the challenge of maintaining assessment integrity in coding evaluations when LLMs can provide easy solutions, by introducing OpenCoderRank, a lightweight, self-hosted platform for creating and grading time-constrained technical assessments, with evaluation using BERTScore and LLM methods showing it supports customizable assessments in resource-constrained settings.

Organizations and educational institutions use time-bound assessment tasks to evaluate coding and problem-solving skills. These assessments measure not only the correctness of the solutions, but also their efficiency. Problem setters (educator/interviewer) are responsible for crafting these challenges, carefully balancing difficulty and relevance to create meaningful evaluation experiences. Conversely, problem solvers (student/interviewee) apply critical and logical thinking to arrive at correct solutions. In the era of Large Language Models (LLMs), LLMs assist problem setters in generating diverse and challenging questions, but they can undermine assessment integrity for problem solvers by providing easy access to solutions. We introduce OpenCoderRank, a lightweight, self-hosted platform that emulates real-world timed technical assessments in resource-constrained environments. OpenCoderRank is intentionally model-agnostic: it facilitates the creation, deployment and automatic grading of problems while offering fine-grained control over time limits, input-output pairs and execution constraints. OpenCoderRank is evaluated using two methods: 1. BERTScore, 2. LLM evaluation. Our findings indicate that OpenCoderRank connects problem setters and solvers by supporting time-constrained preparation and self-hosted, customizable assessments in resource-constrained settings.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes