CLAIMar 28

Listen, Correct, and Feed Back: Spoken Pedagogical Feedback Generation

arXiv:2604.1417792.6h-index: 2Has Code
Predicted impact top 22% in CL · last 90 daysOriginality Incremental advance
AI Analysis

For researchers in educational NLP, this work provides a new dataset and baseline for generating learner-friendly spoken feedback, but the gains from preference learning are incremental.

The paper introduces SPFG, a dataset for spoken pedagogical feedback generation, and evaluates LLMs with SFT and preference-based alignment, finding SFT yields consistent improvements while DPO/KTO show mixed results.

Grammatical error correction (GEC) and explanation (GEE) have made rapid progress, but real teaching scenarios also require \emph{learner-friendly pedagogical feedback} that is actionable, level-appropriate, and encouraging. We introduce \textbf{SPFG} (\textbf{S}poken \textbf{P}edagogical \textbf{F}eedback \textbf{G}eneration), a dataset built based on the Speak \& Improve Challenge 2025 corpus, pairing fluency-oriented transcriptions with GEC targets and \emph{human-verified} teacher-style feedback, including preferred/rejected feedback pairs for preference learning. We study a transcript-based Spoken Grammatical Error Correction (SGEC) setting and evaluate three instruction-tuned LLMs (Qwen2.5, Llama-3.1, and GLM-4), comparing supervised fine-tuning (SFT) with preference-based alignment (using DPO and KTO) for jointly generating corrections and feedback. Results show that SFT provides the most consistent improvements, while DPO/KTO yield smaller or mixed gains, and that correction quality and feedback quality are weakly coupled. Our implementation is available at https://github.com/Skywalker-Harrison/spfg.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes