CVAIHCFeb 27, 2025

LIVS: A Pluralistic Alignment Dataset for Inclusive Public Spaces

arXiv:2503.01894v213 citationsh-index: 7ICML
Originality Synthesis-oriented
AI Analysis

This addresses the challenge of incorporating pluralistic, stakeholder-driven values into AI models for spatial design, though it is incremental in applying existing alignment methods to a new domain-specific dataset.

The researchers tackled the problem of aligning text-to-image models with diverse community preferences for inclusive urban planning by creating the LIVS dataset through participatory methods, and found that fine-tuning with DPO improved alignment under certain conditions while revealing significant variations in preferences across different identity groups.

We introduce the Local Intersectional Visual Spaces (LIVS) dataset, a benchmark for multi-criteria alignment, developed through a two-year participatory process with 30 community organizations to support the pluralistic alignment of text-to-image (T2I) models in inclusive urban planning. The dataset encodes 37,710 pairwise comparisons across 13,462 images, structured along six criteria - Accessibility, Safety, Comfort, Invitingness, Inclusivity, and Diversity - derived from 634 community-defined concepts. Using Direct Preference Optimization (DPO), we fine-tune Stable Diffusion XL to reflect multi-criteria spatial preferences and evaluate the LIVS dataset and the fine-tuned model through four case studies: (1) DPO increases alignment with annotated preferences, particularly when annotation volume is high; (2) preference patterns vary across participant identities, underscoring the need for intersectional data; (3) human-authored prompts generate more distinctive visual outputs than LLM-generated ones, influencing annotation decisiveness; and (4) intersectional groups assign systematically different ratings across criteria, revealing the limitations of single-objective alignment. While DPO improves alignment under specific conditions, the prevalence of neutral ratings indicates that community values are heterogeneous and often ambiguous. LIVS provides a benchmark for developing T2I models that incorporate local, stakeholder-driven preferences, offering a foundation for context-aware alignment in spatial design.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes