Similarity queries

Their conceptual evaluation, transformations, and processing

Yasin Silva, Walid G. Aref, Per Ake Larson, Spencer S. Pearson, Mohamed H. Ali

Research output: Contribution to journalArticle

27 Citations (Scopus)

Abstract

Many application scenarios can significantly benefit from the identification and processing of similarities in the data. Even though some work has been done to extend the semantics of some operators, for example join and selection, to be aware of data similarities, there has not been much study on the role and implementation of similarity-aware operations as first-class database operators. Furthermore, very little work has addressed the problem of evaluating and optimizing queries that combine several similarity operations. The focus of this paper is the study of similarity queries that contain one or multiple first-class similarity database operators such as Similarity Selection, Similarity Join, and Similarity Group-by. Particularly, we analyze the implementation techniques of several similarity operators, introduce a consistent and comprehensive conceptual evaluation model for similarity queries, and present a rich set of transformation rules to extend cost-based query optimization to the case of similarity queries.

Original languageEnglish (US)
Pages (from-to)395-420
Number of pages26
JournalVLDB Journal
Volume22
Issue number3
DOIs
StatePublished - Jun 2013

Fingerprint

Processing
Semantics
Costs

Keywords

  • Conceptual evaluation
  • Query processing
  • Query transformations
  • Similarity queries

ASJC Scopus subject areas

  • Hardware and Architecture
  • Information Systems

Cite this

Similarity queries : Their conceptual evaluation, transformations, and processing. / Silva, Yasin; Aref, Walid G.; Larson, Per Ake; Pearson, Spencer S.; Ali, Mohamed H.

In: VLDB Journal, Vol. 22, No. 3, 06.2013, p. 395-420.

Research output: Contribution to journalArticle

Silva, Yasin ; Aref, Walid G. ; Larson, Per Ake ; Pearson, Spencer S. ; Ali, Mohamed H. / Similarity queries : Their conceptual evaluation, transformations, and processing. In: VLDB Journal. 2013 ; Vol. 22, No. 3. pp. 395-420.
@article{e41113caf5434f408b74eca7c0d50432,
title = "Similarity queries: Their conceptual evaluation, transformations, and processing",
abstract = "Many application scenarios can significantly benefit from the identification and processing of similarities in the data. Even though some work has been done to extend the semantics of some operators, for example join and selection, to be aware of data similarities, there has not been much study on the role and implementation of similarity-aware operations as first-class database operators. Furthermore, very little work has addressed the problem of evaluating and optimizing queries that combine several similarity operations. The focus of this paper is the study of similarity queries that contain one or multiple first-class similarity database operators such as Similarity Selection, Similarity Join, and Similarity Group-by. Particularly, we analyze the implementation techniques of several similarity operators, introduce a consistent and comprehensive conceptual evaluation model for similarity queries, and present a rich set of transformation rules to extend cost-based query optimization to the case of similarity queries.",
keywords = "Conceptual evaluation, Query processing, Query transformations, Similarity queries",
author = "Yasin Silva and Aref, {Walid G.} and Larson, {Per Ake} and Pearson, {Spencer S.} and Ali, {Mohamed H.}",
year = "2013",
month = "6",
doi = "10.1007/s00778-012-0296-4",
language = "English (US)",
volume = "22",
pages = "395--420",
journal = "VLDB Journal",
issn = "1066-8888",
publisher = "Springer New York",
number = "3",

}

TY - JOUR

T1 - Similarity queries

T2 - Their conceptual evaluation, transformations, and processing

AU - Silva, Yasin

AU - Aref, Walid G.

AU - Larson, Per Ake

AU - Pearson, Spencer S.

AU - Ali, Mohamed H.

PY - 2013/6

Y1 - 2013/6

N2 - Many application scenarios can significantly benefit from the identification and processing of similarities in the data. Even though some work has been done to extend the semantics of some operators, for example join and selection, to be aware of data similarities, there has not been much study on the role and implementation of similarity-aware operations as first-class database operators. Furthermore, very little work has addressed the problem of evaluating and optimizing queries that combine several similarity operations. The focus of this paper is the study of similarity queries that contain one or multiple first-class similarity database operators such as Similarity Selection, Similarity Join, and Similarity Group-by. Particularly, we analyze the implementation techniques of several similarity operators, introduce a consistent and comprehensive conceptual evaluation model for similarity queries, and present a rich set of transformation rules to extend cost-based query optimization to the case of similarity queries.

AB - Many application scenarios can significantly benefit from the identification and processing of similarities in the data. Even though some work has been done to extend the semantics of some operators, for example join and selection, to be aware of data similarities, there has not been much study on the role and implementation of similarity-aware operations as first-class database operators. Furthermore, very little work has addressed the problem of evaluating and optimizing queries that combine several similarity operations. The focus of this paper is the study of similarity queries that contain one or multiple first-class similarity database operators such as Similarity Selection, Similarity Join, and Similarity Group-by. Particularly, we analyze the implementation techniques of several similarity operators, introduce a consistent and comprehensive conceptual evaluation model for similarity queries, and present a rich set of transformation rules to extend cost-based query optimization to the case of similarity queries.

KW - Conceptual evaluation

KW - Query processing

KW - Query transformations

KW - Similarity queries

UR - http://www.scopus.com/inward/record.url?scp=84877876392&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84877876392&partnerID=8YFLogxK

U2 - 10.1007/s00778-012-0296-4

DO - 10.1007/s00778-012-0296-4

M3 - Article

VL - 22

SP - 395

EP - 420

JO - VLDB Journal

JF - VLDB Journal

SN - 1066-8888

IS - 3

ER -