Dimensionality reduction using magnitude and shape approximations

Umit Ogras, Hakan Ferhatosmanoglu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Scopus citations

Abstract

High dimensional data sets are encountered in many modern database applications. The usual approach is to construct a summary of the data set through a lossy compression technique, and use this lower dimensional synopsis to provide fast, approximate answers to the queries. In this paper, we develop a novel dimensionality reduction technique based on partitioning the high dimensional vector space into orthogonal subspaces. First, we find a relation between the Euclidian distance of two n-dimensional vectors and the Euclidian distances of their projections on the orthogonal subspaces. Then, based on this relation we develop a method to approximate the Euclidian distance using novel inner product approximation. This process allows us to incorporate the shape information of the vectors to this approximation. While the inner product approximation is symmetric, i.e., captures only the magnitude information of the data, the proposed method takes both the magnitude and shape information of the original vectors into account through partitioning. In the experiments, we demonstrate the effectiveness of our technique by comparing it with commonly used methods.

Original languageEnglish (US)
Title of host publicationInternational Conference on Information and Knowledge Management, Proceedings
EditorsO. Frieder, J. Hammer, S. Qureshi, L. Seligman
Pages99-107
Number of pages9
StatePublished - 2003
Externally publishedYes
EventCIKM 2003: Proceedings of the Twelfth ACM International Conference on Information and Knowledge Management - New Orleans, LA, United States
Duration: Nov 3 2003Nov 8 2003

Other

OtherCIKM 2003: Proceedings of the Twelfth ACM International Conference on Information and Knowledge Management
Country/TerritoryUnited States
CityNew Orleans, LA
Period11/3/0311/8/03

Keywords

  • High Dimensional Data
  • Shape Approximation
  • Similarity Search

ASJC Scopus subject areas

  • General Business, Management and Accounting

Fingerprint

Dive into the research topics of 'Dimensionality reduction using magnitude and shape approximations'. Together they form a unique fingerprint.

Cite this