CL AI CYJul 18, 2025

Using LLMs to identify features of personal and professional skills in an open-response situational judgment test

Cole Walsh, Rodica Ivan, Muhammad Zafar Iqbal, Colleen Robb

arXiv:2507.13881v18.32 citationsh-index: 14

Originality Incremental advance

AI Analysis

This addresses the need for scalable assessment systems in academic programs, though it appears incremental as it builds on past NLP attempts with a new method.

The researchers tackled the challenge of scaling situational judgment tests (SJTs) for personal and professional skills by using large language models (LLMs) to extract construct-relevant features from open-response SJTs, demonstrating efficacy on the Casper SJT.

Academic programs are increasingly recognizing the importance of personal and professional skills and their critical role alongside technical expertise in preparing students for future success in diverse career paths. With this growing demand comes the need for scalable systems to measure, evaluate, and develop these skills. Situational Judgment Tests (SJTs) offer one potential avenue for measuring these skills in a standardized and reliable way, but open-response SJTs have traditionally relied on trained human raters for evaluation, presenting operational challenges to delivering SJTs at scale. Past attempts at developing NLP-based scoring systems for SJTs have fallen short due to issues with construct validity of these systems. In this article, we explore a novel approach to extracting construct-relevant features from SJT responses using large language models (LLMs). We use the Casper SJT to demonstrate the efficacy of this approach. This study sets the foundation for future developments in automated scoring for personal and professional skills.

View on arXiv PDF

Similar