nTD: Noise-profile adaptive tensor decomposition

Xinsheng Li; Kasim Candan; Maria Luisa Sapino

doi:10.1145/3038912.3052641

nTD: Noise-profile adaptive tensor decomposition

Xinsheng Li, Kasim Candan, Maria Luisa Sapino

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

3 Scopus citations

Abstract

Tensor decomposition is used for many web and user data analysis operations from clustering, trend detection, anomaly detection, to correlation analysis. However, many of the tensor decomposition schemes are sensitive to noisy data, an inevitable problem in the real world that can lead to false conclusions. The problem is compounded by overfitting when the user data is sparse. Recent research has shown that it is possible to avoid over-fitting by relying on probabilistic techniques. However, these have two major deficiencies: (a) firstly, they assume that all the data and intermediary results can fit in the main memory, and (b) they treat the entire tensor uniformly, ignoring potential non-uniformities in the noise distribution. In this paper, we propose a Noise-Profile Adaptive Tensor Decomposition (nTD) method, which aims to tackle both of these challenges. In particular, nTD leverages a grid-based two-phase decomposition strategy for two complementary purposes: firstly, the grid partitioning helps ensure that the memory footprint of the decomposition is kept low; secondly (and perhaps more importantly) any a priori knowledge about the noise profiles of the grid partitions enable us to develop a sample assignment strategy (or s-strategy) that best suits the noise distribution of the given tensor. Experiments show that nTD’s performance is significantly better than conventional CP decomposition techniques on noisy user data tensors.

Original language	English (US)
Title of host publication	26th International World Wide Web Conference, WWW 2017
Publisher	International World Wide Web Conferences Steering Committee
Pages	243-252
Number of pages	10
ISBN (Print)	9781450349130
DOIs	https://doi.org/10.1145/3038912.3052641
State	Published - 2017
Event	26th International World Wide Web Conference, WWW 2017 - Perth, Australia Duration: Apr 3 2017 → Apr 7 2017

Publication series

Name	26th International World Wide Web Conference, WWW 2017

Other

Other	26th International World Wide Web Conference, WWW 2017
Country/Territory	Australia
City	Perth
Period	4/3/17 → 4/7/17

ASJC Scopus subject areas

Software
Computer Networks and Communications

Access to Document

10.1145/3038912.3052641

Cite this

nTD: Noise-profile adaptive tensor decomposition. / Li, Xinsheng; Candan, Kasim; Sapino, Maria Luisa.
26th International World Wide Web Conference, WWW 2017. International World Wide Web Conferences Steering Committee, 2017. p. 243-252 3052641 (26th International World Wide Web Conference, WWW 2017).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Li, X, Candan, K & Sapino, ML 2017, nTD: Noise-profile adaptive tensor decomposition. in 26th International World Wide Web Conference, WWW 2017., 3052641, 26th International World Wide Web Conference, WWW 2017, International World Wide Web Conferences Steering Committee, pp. 243-252, 26th International World Wide Web Conference, WWW 2017, Perth, Australia, 4/3/17. https://doi.org/10.1145/3038912.3052641

@inproceedings{6dceef28b8fc40cba499094eee148068,

title = "nTD: Noise-profile adaptive tensor decomposition",

abstract = "Tensor decomposition is used for many web and user data analysis operations from clustering, trend detection, anomaly detection, to correlation analysis. However, many of the tensor decomposition schemes are sensitive to noisy data, an inevitable problem in the real world that can lead to false conclusions. The problem is compounded by overfitting when the user data is sparse. Recent research has shown that it is possible to avoid over-fitting by relying on probabilistic techniques. However, these have two major deficiencies: (a) firstly, they assume that all the data and intermediary results can fit in the main memory, and (b) they treat the entire tensor uniformly, ignoring potential non-uniformities in the noise distribution. In this paper, we propose a Noise-Profile Adaptive Tensor Decomposition (nTD) method, which aims to tackle both of these challenges. In particular, nTD leverages a grid-based two-phase decomposition strategy for two complementary purposes: firstly, the grid partitioning helps ensure that the memory footprint of the decomposition is kept low; secondly (and perhaps more importantly) any a priori knowledge about the noise profiles of the grid partitions enable us to develop a sample assignment strategy (or s-strategy) that best suits the noise distribution of the given tensor. Experiments show that nTD{\textquoteright}s performance is significantly better than conventional CP decomposition techniques on noisy user data tensors.",

author = "Xinsheng Li and Kasim Candan and Sapino, {Maria Luisa}",

note = "Publisher Copyright: {\textcopyright} 2017 International World Wide Web Conference Committee (IW3C2).; 26th International World Wide Web Conference, WWW 2017 ; Conference date: 03-04-2017 Through 07-04-2017",

year = "2017",

doi = "10.1145/3038912.3052641",

language = "English (US)",

isbn = "9781450349130",

series = "26th International World Wide Web Conference, WWW 2017",

publisher = "International World Wide Web Conferences Steering Committee",

pages = "243--252",

booktitle = "26th International World Wide Web Conference, WWW 2017",

}

TY - GEN

T1 - nTD

T2 - 26th International World Wide Web Conference, WWW 2017

AU - Li, Xinsheng

AU - Candan, Kasim

AU - Sapino, Maria Luisa

PY - 2017

Y1 - 2017

N2 - Tensor decomposition is used for many web and user data analysis operations from clustering, trend detection, anomaly detection, to correlation analysis. However, many of the tensor decomposition schemes are sensitive to noisy data, an inevitable problem in the real world that can lead to false conclusions. The problem is compounded by overfitting when the user data is sparse. Recent research has shown that it is possible to avoid over-fitting by relying on probabilistic techniques. However, these have two major deficiencies: (a) firstly, they assume that all the data and intermediary results can fit in the main memory, and (b) they treat the entire tensor uniformly, ignoring potential non-uniformities in the noise distribution. In this paper, we propose a Noise-Profile Adaptive Tensor Decomposition (nTD) method, which aims to tackle both of these challenges. In particular, nTD leverages a grid-based two-phase decomposition strategy for two complementary purposes: firstly, the grid partitioning helps ensure that the memory footprint of the decomposition is kept low; secondly (and perhaps more importantly) any a priori knowledge about the noise profiles of the grid partitions enable us to develop a sample assignment strategy (or s-strategy) that best suits the noise distribution of the given tensor. Experiments show that nTD’s performance is significantly better than conventional CP decomposition techniques on noisy user data tensors.

AB - Tensor decomposition is used for many web and user data analysis operations from clustering, trend detection, anomaly detection, to correlation analysis. However, many of the tensor decomposition schemes are sensitive to noisy data, an inevitable problem in the real world that can lead to false conclusions. The problem is compounded by overfitting when the user data is sparse. Recent research has shown that it is possible to avoid over-fitting by relying on probabilistic techniques. However, these have two major deficiencies: (a) firstly, they assume that all the data and intermediary results can fit in the main memory, and (b) they treat the entire tensor uniformly, ignoring potential non-uniformities in the noise distribution. In this paper, we propose a Noise-Profile Adaptive Tensor Decomposition (nTD) method, which aims to tackle both of these challenges. In particular, nTD leverages a grid-based two-phase decomposition strategy for two complementary purposes: firstly, the grid partitioning helps ensure that the memory footprint of the decomposition is kept low; secondly (and perhaps more importantly) any a priori knowledge about the noise profiles of the grid partitions enable us to develop a sample assignment strategy (or s-strategy) that best suits the noise distribution of the given tensor. Experiments show that nTD’s performance is significantly better than conventional CP decomposition techniques on noisy user data tensors.

UR - http://www.scopus.com/inward/record.url?scp=85051512640&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85051512640&partnerID=8YFLogxK

U2 - 10.1145/3038912.3052641

DO - 10.1145/3038912.3052641

M3 - Conference contribution

AN - SCOPUS:85051512640

SN - 9781450349130

T3 - 26th International World Wide Web Conference, WWW 2017

SP - 243

EP - 252

BT - 26th International World Wide Web Conference, WWW 2017

PB - International World Wide Web Conferences Steering Committee

Y2 - 3 April 2017 through 7 April 2017

ER -

nTD: Noise-profile adaptive tensor decomposition

Abstract

Publication series

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this