Matrix Factorization with Interval-Valued Data

Mao Lin Li; Francesco Di Mauro; K. Selcuk Candan; Maria Luisa Sapino

doi:10.1109/TKDE.2019.2942310

Matrix Factorization with Interval-Valued Data

Mao Lin Li, Francesco Di Mauro, K. Selcuk Candan, Maria Luisa Sapino

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Contribution to journal › Article › peer-review

4 Scopus citations

Abstract

With many applications relying on multi-dimensional datasets for decision making, matrix factorization (or decomposition) is becoming the basis for many knowledge discoveries and machine learning tasks, from clustering, trend detection, anomaly detection, to correlation analysis. Unfortunately, a major shortcoming of matrix analysis operations is that, despite their effectiveness when the data is scalar, these operations become difficult to apply in the presence of non-scalar data, as they are not designed for data that include non-scalar observations, such as intervals. Yet, in many applications, the available data are inherently non-scalar for various reasons, including imprecision in data collection, conflicts in aggregated data, data summarization, or privacy issues, where one is provided with a reduced, clustered, or intentionally noisy and obfuscated version of the data to hide information. In this paper, we propose matrix decomposition techniques that consider the existence of interval-valued data. We show that naive ways to deal with such imperfect data may introduce errors in analysis and present factorization techniques that are especially effective when the amount of imprecise information is large.

Original language	English (US)
Article number	8844796
Pages (from-to)	1644-1658
Number of pages	15
Journal	IEEE Transactions on Knowledge and Data Engineering
Volume	33
Issue number	4
DOIs	https://doi.org/10.1109/TKDE.2019.2942310
State	Published - Apr 1 2021

Keywords

Matrix factorization
interval valued data

ASJC Scopus subject areas

Information Systems
Computer Science Applications
Computational Theory and Mathematics

Access to Document

10.1109/TKDE.2019.2942310

Cite this

@article{480e17208ac04450902c07bcf1c15802,

title = "Matrix Factorization with Interval-Valued Data",

abstract = "With many applications relying on multi-dimensional datasets for decision making, matrix factorization (or decomposition) is becoming the basis for many knowledge discoveries and machine learning tasks, from clustering, trend detection, anomaly detection, to correlation analysis. Unfortunately, a major shortcoming of matrix analysis operations is that, despite their effectiveness when the data is scalar, these operations become difficult to apply in the presence of non-scalar data, as they are not designed for data that include non-scalar observations, such as intervals. Yet, in many applications, the available data are inherently non-scalar for various reasons, including imprecision in data collection, conflicts in aggregated data, data summarization, or privacy issues, where one is provided with a reduced, clustered, or intentionally noisy and obfuscated version of the data to hide information. In this paper, we propose matrix decomposition techniques that consider the existence of interval-valued data. We show that naive ways to deal with such imperfect data may introduce errors in analysis and present factorization techniques that are especially effective when the amount of imprecise information is large.",

keywords = "Matrix factorization, interval valued data",

author = "Li, {Mao Lin} and Mauro, {Francesco Di} and Candan, {K. Selcuk} and Sapino, {Maria Luisa}",

note = "Publisher Copyright: {\textcopyright} 1989-2012 IEEE.",

year = "2021",

month = apr,

day = "1",

doi = "10.1109/TKDE.2019.2942310",

language = "English (US)",

volume = "33",

pages = "1644--1658",

journal = "IEEE Transactions on Knowledge and Data Engineering",

issn = "1041-4347",

publisher = "IEEE Computer Society",

number = "4",

}

TY - JOUR

T1 - Matrix Factorization with Interval-Valued Data

AU - Li, Mao Lin

AU - Mauro, Francesco Di

AU - Candan, K. Selcuk

AU - Sapino, Maria Luisa

PY - 2021/4/1

Y1 - 2021/4/1

N2 - With many applications relying on multi-dimensional datasets for decision making, matrix factorization (or decomposition) is becoming the basis for many knowledge discoveries and machine learning tasks, from clustering, trend detection, anomaly detection, to correlation analysis. Unfortunately, a major shortcoming of matrix analysis operations is that, despite their effectiveness when the data is scalar, these operations become difficult to apply in the presence of non-scalar data, as they are not designed for data that include non-scalar observations, such as intervals. Yet, in many applications, the available data are inherently non-scalar for various reasons, including imprecision in data collection, conflicts in aggregated data, data summarization, or privacy issues, where one is provided with a reduced, clustered, or intentionally noisy and obfuscated version of the data to hide information. In this paper, we propose matrix decomposition techniques that consider the existence of interval-valued data. We show that naive ways to deal with such imperfect data may introduce errors in analysis and present factorization techniques that are especially effective when the amount of imprecise information is large.

AB - With many applications relying on multi-dimensional datasets for decision making, matrix factorization (or decomposition) is becoming the basis for many knowledge discoveries and machine learning tasks, from clustering, trend detection, anomaly detection, to correlation analysis. Unfortunately, a major shortcoming of matrix analysis operations is that, despite their effectiveness when the data is scalar, these operations become difficult to apply in the presence of non-scalar data, as they are not designed for data that include non-scalar observations, such as intervals. Yet, in many applications, the available data are inherently non-scalar for various reasons, including imprecision in data collection, conflicts in aggregated data, data summarization, or privacy issues, where one is provided with a reduced, clustered, or intentionally noisy and obfuscated version of the data to hide information. In this paper, we propose matrix decomposition techniques that consider the existence of interval-valued data. We show that naive ways to deal with such imperfect data may introduce errors in analysis and present factorization techniques that are especially effective when the amount of imprecise information is large.

KW - Matrix factorization

KW - interval valued data

UR - http://www.scopus.com/inward/record.url?scp=85102249006&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85102249006&partnerID=8YFLogxK

U2 - 10.1109/TKDE.2019.2942310

DO - 10.1109/TKDE.2019.2942310

M3 - Article

AN - SCOPUS:85102249006

SN - 1041-4347

VL - 33

SP - 1644

EP - 1658

JO - IEEE Transactions on Knowledge and Data Engineering

JF - IEEE Transactions on Knowledge and Data Engineering

IS - 4

M1 - 8844796

ER -

Matrix Factorization with Interval-Valued Data

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this