MLCYLGMay 5

Heterogeneous Ordinal Structure Learning with Bayesian Nonparametric Complexity Discovery

arXiv:2605.0419124.1h-index: 4
Predicted impact top 64% in ML · last 90 daysOriginality Incremental advance
AI Analysis

For researchers analyzing ordinal survey data with heterogeneous dependency structures, this work provides a principled method to discover and estimate cluster-specific DAGs, improving predictive accuracy and interpretability.

The paper introduces a heterogeneous ordinal structure-learning framework that combines Bayesian nonparametric complexity discovery with confirmatory cluster-specific DAG estimation. On the 2024 Pew American Trends Panel survey, the method reduces holdout MSE by 25.8% over a single-graph baseline and by 4.6% over mixture-only clustering.

Public attitudes toward artificial intelligence are heterogeneous, ordinally measured, and poorly captured by any single dependency graph. Existing ordinal structure learners assume a shared directed acyclic graph (DAG) across all respondents; recent heterogeneous ordinal graphical-model approaches focus on subgroup discovery rather than confirmatory cluster-specific DAG estimation; and latent profile analyses discard dependency structure entirely. We introduce a heterogeneous ordinal structure-learning framework combining monotone Gaussian score embedding, Bayesian nonparametric (BNP) complexity discovery via a truncated stick-breaking prior, and confirmatory fixed-K estimation with cluster-specific sparse DAG learning. The key methodological insight is a discovery-to-confirmation workflow: the nonparametric stage calibrates plausible archetype complexity, while inner-validated confirmatory refitting yields stable, interpretable structural estimates. On the 2024 Pew American Trends Panel AI attitudes survey, Wave 152 (W152) survey, (N = 4,788, 8 ordinal items), the confirmatory K*=5 model reduces holdout transformed-score mean squared error (MSE) by 25.8% over a single-graph baseline and by 4.6% over mixture-only clustering. A controlled tiered semi-synthetic benchmark calibrated to W152 structure validates recovery across difficulty regimes and transparently reveals failure modes under stress conditions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes