Matrix Factorization with Interval-Valued Data

Mao Lin Li, Francesco Di Mauro, K. Selcuk Candan, Maria Luisa Sapino

Research output: Contribution to journalArticlepeer-review

Abstract

With many applications relying on multi-dimensional datasets for decision making, matrix factorization (or decomposition) is becoming the basis for many knowledge discoveries and machine learning tasks, from clustering, trend detection, anomaly detection, to correlation analysis. Unfortunately, a major shortcoming of matrix analysis operations is that, despite their effectiveness when the data is scalar, these operations become difficult to apply in the presence of non-scalar data, as they are not designed for data that include non-scalar observations, such as intervals. Yet, in many applications, the available data are inherently non-scalar for various reasons, including imprecision in data collection, conflicts in aggregated data, data summarization, or privacy issues, where one is provided with a reduced, clustered, or intentionally noisy and obfuscated version of the data to hide information. In this paper, we propose matrix decomposition techniques that consider the existence of interval-valued data. We show that naive ways to deal with such imperfect data may introduce errors in analysis and present factorization techniques that are especially effective when the amount of imprecise information is large.

Original languageEnglish (US)
Article number8844796
Pages (from-to)1644-1658
Number of pages15
JournalIEEE Transactions on Knowledge and Data Engineering
Volume33
Issue number4
DOIs
StatePublished - Apr 1 2021

Keywords

  • interval valued data
  • Matrix factorization

ASJC Scopus subject areas

  • Information Systems
  • Computer Science Applications
  • Computational Theory and Mathematics

Fingerprint Dive into the research topics of 'Matrix Factorization with Interval-Valued Data'. Together they form a unique fingerprint.

Cite this