LGDec 25, 2017

Strongly Hierarchical Factorization Machines and ANOVA Kernel Regression

arXiv:1712.09133v21 citations
Originality Incremental advance
AI Analysis

This work solves problems in data mining tasks like user response and stock prediction by improving model efficiency and structure, though it appears incremental as it builds on existing factorization and regularization techniques.

The paper tackles the issues of expensive computation, difficult parameter estimation, and lack of structure in high-order parametric models for sparse data by proposing Strongly Hierarchical Factorization Machines and ANOVA kernel regression, which address all three problems without complicating optimization. Experimental results show significant outperformance over state-of-the-art methods in cold-start user response time prediction and stock volatility prediction.

High-order parametric models that include terms for feature interactions are applied to various data mining tasks, where ground truth depends on interactions of features. However, with sparse data, the high- dimensional parameters for feature interactions often face three issues: expensive computation, difficulty in parameter estimation and lack of structure. Previous work has proposed approaches which can partially re- solve the three issues. In particular, models with factorized parameters (e.g. Factorization Machines) and sparse learning algorithms (e.g. FTRL-Proximal) can tackle the first two issues but fail to address the third. Regarding to unstructured parameters, constraints or complicated regularization terms are applied such that hierarchical structures can be imposed. However, these methods make the optimization problem more challenging. In this work, we propose Strongly Hierarchical Factorization Machines and ANOVA kernel regression where all the three issues can be addressed without making the optimization problem more difficult. Experimental results show the proposed models significantly outperform the state-of-the-art in two data mining tasks: cold-start user response time prediction and stock volatility prediction.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes