AIMar 28, 2025

Agent-Centric Personalized Multiple Clustering with Multi-Modal LLMs

arXiv:2503.22241v3h-index: 9
Originality Incremental advance
AI Analysis

It addresses the need for more accurate and user-aligned clustering in applications like recommendation systems, though it is incremental by building on existing personalized clustering methods.

The paper tackles the problem of generating diverse dataset partitions based on user-specific aspects in personalized multiple clustering, achieving NMI scores of 0.9667 and 0.9481 on benchmarks, which improves the state-of-the-art by over 140%.

Personalized multiple clustering aims to generate diverse partitions of a dataset based on different user-specific aspects, rather than a single clustering. It has recently drawn research interest for accommodating varying user preferences. Recent approaches primarily use CLIP embeddings with proxy learning to extract representations biased toward user clustering preferences. However, CLIP primarily focuses on coarse image-text alignment, lacking a deep contextual understanding of user interests. To overcome these limitations, we propose an agent-centric personalized clustering framework that leverages multi-modal large language models (MLLMs) as agents to comprehensively traverse a relational graph to search for clusters based on user interests. Due to the advanced reasoning mechanism of MLLMs, the obtained clusters align more closely with user-defined criteria than those obtained from CLIP-based representations. To reduce computational overhead, we shorten the agents' traversal path by constructing a relational graph using user-interest-biased embeddings extracted by MLLMs. A large number of weakly connected edges can be filtered out based on embedding similarity, facilitating an efficient traversal search for agents. Experimental results show that the proposed method achieves NMI scores of 0.9667 and 0.9481 on the Card Order and Card Suits benchmarks, respectively, largely improving the SOTA model by over 140%.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes