CYAICLApr 28, 2024

Leveraging Prompts in LLMs to Overcome Imbalances in Complex Educational Text Data

arXiv:2407.01551v14 citationsh-index: 4
Originality Incremental advance
AI Analysis

It addresses imbalances in educational datasets for improved cognitive engagement classification, but it is incremental as it builds on existing prompt engineering methods.

The paper tackled the problem of data imbalances in educational text by using LLMs with assertions, resulting in up to a 32% F1-score increase over traditional models and an 11.94% improvement in a sensitivity analysis.

In this paper, we explore the potential of Large Language Models (LLMs) with assertions to mitigate imbalances in educational datasets. Traditional models often fall short in such contexts, particularly due to the complexity and nuanced nature of the data. This issue is especially prominent in the education sector, where cognitive engagement levels among students show significant variation in their open responses. To test our hypothesis, we utilized an existing technology for assertion-based prompt engineering through an 'Iterative - ICL PE Design Process' comparing traditional Machine Learning (ML) models against LLMs augmented with assertions (N=135). Further, we conduct a sensitivity analysis on a subset (n=27), examining the variance in model performance concerning classification metrics and cognitive engagement levels in each iteration. Our findings reveal that LLMs with assertions significantly outperform traditional ML models, particularly in cognitive engagement levels with minority representation, registering up to a 32% increase in F1-score. Additionally, our sensitivity study indicates that incorporating targeted assertions into the LLM tested on the subset enhances its performance by 11.94%. This improvement primarily addresses errors stemming from the model's limitations in understanding context and resolving lexical ambiguities in student responses.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes