Strongly hierarchical factorization machines and ANOVA kernel regression

Ruocheng Guo; Hamidreza Alvari; Paulo Shakarian

doi:10.1137/1.9781611975321.82

Strongly hierarchical factorization machines and ANOVA kernel regression

Ruocheng Guo, Hamidreza Alvari, Paulo Shakarian

Research output: Contribution to conference › Paper › peer-review

2 Scopus citations

Abstract

High-order parametric models that include terms for feature interactions are applied to various data mining tasks, where ground truth depends on interactions of features. However, with sparse data, the high-dimensional parameters for feature interactions often face three issues: expensive computation, difficulty in parameter estimation and lack of structure. Previous work has proposed approaches which can partially resolve the three issues. In particular, models with fac-torized parameters (e.g. Factorization Machines) and sparse learning algorithms (e.g. FTRL-Proximal) can tackle the first two issues but fail to address the third. Regarding to unstructured parameters, constraints or complicated regularization terms are applied such that hierarchical structures can be imposed. However, these methods make the optimization problem more challenging. In this work, we propose Strongly Hierarchical Factorization Machines and ANOVA kernel regression where all the three issues can be addressed without making the optimization problem more difficult. Experimental results show the proposed models significantly outperform the state-of-the-art in two data mining tasks: cold-start user response time prediction and stock volatility prediction.

Original language	English (US)
Pages	729-737
Number of pages	9
DOIs	https://doi.org/10.1137/1.9781611975321.82
State	Published - 2018
Event	2018 SIAM International Conference on Data Mining, SDM 2018 - San Diego, United States Duration: May 3 2018 → May 5 2018

Other

Other	2018 SIAM International Conference on Data Mining, SDM 2018
Country/Territory	United States
City	San Diego
Period	5/3/18 → 5/5/18

ASJC Scopus subject areas

Computer Science Applications
Software

Access to Document

10.1137/1.9781611975321.82

Cite this

@conference{e07bb8ff22fc402f8c3e64e4cfc01e04,

title = "Strongly hierarchical factorization machines and ANOVA kernel regression",

abstract = "High-order parametric models that include terms for feature interactions are applied to various data mining tasks, where ground truth depends on interactions of features. However, with sparse data, the high-dimensional parameters for feature interactions often face three issues: expensive computation, difficulty in parameter estimation and lack of structure. Previous work has proposed approaches which can partially resolve the three issues. In particular, models with fac-torized parameters (e.g. Factorization Machines) and sparse learning algorithms (e.g. FTRL-Proximal) can tackle the first two issues but fail to address the third. Regarding to unstructured parameters, constraints or complicated regularization terms are applied such that hierarchical structures can be imposed. However, these methods make the optimization problem more challenging. In this work, we propose Strongly Hierarchical Factorization Machines and ANOVA kernel regression where all the three issues can be addressed without making the optimization problem more difficult. Experimental results show the proposed models significantly outperform the state-of-the-art in two data mining tasks: cold-start user response time prediction and stock volatility prediction.",

author = "Ruocheng Guo and Hamidreza Alvari and Paulo Shakarian",

note = "Publisher Copyright: {\textcopyright} 2018 by SIAM.; 2018 SIAM International Conference on Data Mining, SDM 2018 ; Conference date: 03-05-2018 Through 05-05-2018",

year = "2018",

doi = "10.1137/1.9781611975321.82",

language = "English (US)",

pages = "729--737",

}

TY - CONF

T1 - Strongly hierarchical factorization machines and ANOVA kernel regression

AU - Guo, Ruocheng

AU - Alvari, Hamidreza

AU - Shakarian, Paulo

PY - 2018

Y1 - 2018

N2 - High-order parametric models that include terms for feature interactions are applied to various data mining tasks, where ground truth depends on interactions of features. However, with sparse data, the high-dimensional parameters for feature interactions often face three issues: expensive computation, difficulty in parameter estimation and lack of structure. Previous work has proposed approaches which can partially resolve the three issues. In particular, models with fac-torized parameters (e.g. Factorization Machines) and sparse learning algorithms (e.g. FTRL-Proximal) can tackle the first two issues but fail to address the third. Regarding to unstructured parameters, constraints or complicated regularization terms are applied such that hierarchical structures can be imposed. However, these methods make the optimization problem more challenging. In this work, we propose Strongly Hierarchical Factorization Machines and ANOVA kernel regression where all the three issues can be addressed without making the optimization problem more difficult. Experimental results show the proposed models significantly outperform the state-of-the-art in two data mining tasks: cold-start user response time prediction and stock volatility prediction.

AB - High-order parametric models that include terms for feature interactions are applied to various data mining tasks, where ground truth depends on interactions of features. However, with sparse data, the high-dimensional parameters for feature interactions often face three issues: expensive computation, difficulty in parameter estimation and lack of structure. Previous work has proposed approaches which can partially resolve the three issues. In particular, models with fac-torized parameters (e.g. Factorization Machines) and sparse learning algorithms (e.g. FTRL-Proximal) can tackle the first two issues but fail to address the third. Regarding to unstructured parameters, constraints or complicated regularization terms are applied such that hierarchical structures can be imposed. However, these methods make the optimization problem more challenging. In this work, we propose Strongly Hierarchical Factorization Machines and ANOVA kernel regression where all the three issues can be addressed without making the optimization problem more difficult. Experimental results show the proposed models significantly outperform the state-of-the-art in two data mining tasks: cold-start user response time prediction and stock volatility prediction.

UR - http://www.scopus.com/inward/record.url?scp=85048339148&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85048339148&partnerID=8YFLogxK

U2 - 10.1137/1.9781611975321.82

DO - 10.1137/1.9781611975321.82

M3 - Paper

AN - SCOPUS:85048339148

SP - 729

EP - 737

T2 - 2018 SIAM International Conference on Data Mining, SDM 2018

Y2 - 3 May 2018 through 5 May 2018

ER -

Strongly hierarchical factorization machines and ANOVA kernel regression

Abstract

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this