IR CVMay 11

Loom: Hybrid Retrieval-Scoring Outfit Recommendation with Semantic Material Compatibility and Occasion-Aware Embedding Priors

arXiv:2605.098304.1

Predicted impact top 99% in IR · last 90 daysOriginality Incremental advance

AI Analysis

For fashion e-commerce, Loom provides a practical hybrid system that outperforms purely learned or rule-based approaches, though the improvement is demonstrated on a small catalog (620 items) and may not generalize to larger, more diverse datasets.

Loom combines neural embedding retrieval with structured domain scoring to generate coherent outfits from fashion catalogs, achieving a 3.3x improvement in mean outfit score (0.179 vs 0.054) and a 42% reduction in hard violations (9.3% vs 16.0%) over a category-constrained random baseline.

We present Loom, an outfit recommendation system that combines neural embedding retrieval with structured domain scoring to generate complete, coherent outfits from fashion catalogs. Given an anchor clothing item, Loom retrieves complementary pieces via slot-constrained approximate nearest neighbor search over FashionCLIP embeddings, then scores candidate outfits using a multi-objective function that integrates six signals: embedding similarity, color harmony, formality consistency, occasion coherence, style direction, and within-outfit diversity. We introduce two techniques that address limitations of purely learned or purely rule-based approaches: (1) semantic material weight, which uses CLIP embedding geometry to infer garment heaviness for layer compatibility without hand-coded material taxonomies; and (2) vibe/anti-vibe occasion priors, which embed prose descriptions of occasion contexts as anchor vectors in CLIP space and score items by differential affinity. Ablation experiments on a catalog of 620 items show that each component contributes measurably to outfit quality: the full system achieves a mean outfit score of 0.179 with a 9.3% hard violation rate, compared to 0.054 score and 16.0% violations for a category-constrained random baseline, a 3.3x improvement in score and 42% reduction in violations. Direction reranking is the single indispensable component: removing it drops score to 0.052, essentially equal to random. The system generates three stylistically distinct outfits in under 5 seconds on commodity hardware.

View on arXiv PDF

Similar