SimDB: A similarity-aware database system

Yasin N. Silva, Ahmed M. Aly, Walid G. Aref, Per Ake Larson

Research output: Chapter in Book/Report/Conference proceedingConference contribution

27 Scopus citations


The identification and processing of similarities in the data play a key role in multiple application scenarios. Several types of similarity-aware operations have been studied in the literature. However, in most of the previous work, similarity-aware operations are studied in isolation from other regular or similarity-aware operations. Furthermore, most of the previous research in the area considers a standalone implementation, i.e., without any integration with a database system. In this demonstration we present SimDB, a similarity-aware database management system. SimDB supports multiple similarity-aware operations as first-class database operators. We describe the architectural changes to implement the similarity-aware operators. In particular, we present the way conventional operators' implementation machinery is extended to support similarity-aware operators. We also show how these operators interact with other similarity-aware and regular operators. In particular, we show the effectiveness of multiple equivalence rules that can be used to extend cost-based query optimization to the case of similarity-ware operations.

Original languageEnglish (US)
Title of host publicationProceedings of the 2010 International Conference on Management of Data, SIGMOD '10
Number of pages4
StatePublished - Jul 23 2010
Externally publishedYes
Event2010 International Conference on Management of Data, SIGMOD '10 - Indianapolis, IN, United States
Duration: Jun 6 2010Jun 11 2010

Publication series

NameProceedings of the ACM SIGMOD International Conference on Management of Data
ISSN (Print)0730-8078


Conference2010 International Conference on Management of Data, SIGMOD '10
Country/TerritoryUnited States
CityIndianapolis, IN


  • similarity group-by
  • similarity join
  • similarity-aware query processing and optimization

ASJC Scopus subject areas

  • Software
  • Information Systems


Dive into the research topics of 'SimDB: A similarity-aware database system'. Together they form a unique fingerprint.

Cite this