CVAIAug 25, 2025

Seeing Like a Designer Without One: A Study on Unsupervised Slide Quality Assessment via Designer Cue Augmentation

arXiv:2508.19289v1
Originality Incremental advance
AI Analysis

This provides scalable, objective feedback on slide quality for educators and presenters, though it is incremental as it builds on existing design metrics and multimodal embeddings.

The paper tackled the problem of assessing presentation slide quality without human designers by developing an unsupervised pipeline that combines expert-inspired visual-design metrics with CLIP-ViT embeddings, achieving Pearson correlations up to 0.83 with human ratings, which is 1.79x to 3.23x stronger than leading vision-language models.

We present an unsupervised slide-quality assessment pipeline that combines seven expert-inspired visual-design metrics (whitespace, colorfulness, edge density, brightness contrast, text density, color harmony, layout balance) with CLIP-ViT embeddings, using Isolation Forest-based anomaly scoring to evaluate presentation slides. Trained on 12k professional lecture slides and evaluated on six academic talks (115 slides), our method achieved Pearson correlations up to 0.83 with human visual-quality ratings-1.79x to 3.23x stronger than scores from leading vision-language models (ChatGPT o4-mini-high, ChatGPT o3, Claude Sonnet 4, Gemini 2.5 Pro). We demonstrate convergent validity with visual ratings, discriminant validity against speaker-delivery scores, and exploratory alignment with overall impressions. Our results show that augmenting low-level design cues with multimodal embeddings closely approximates audience perceptions of slide quality, enabling scalable, objective feedback in real time.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes