QPIAD: Query processing over incomplete autonomous databases

Hemal Khatri, Jianchun Fan, Yi Chen, Subbarao Kambhampati

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

Incompleteness due to missing attribute values (aka "null values") is very common in autonomous web databases, on which user accesses are usually supported through mediators. Traditional query processing techniques that focus on the strict soundness of answer tuples often ignore tuples with critical missing attributes, even if they wind up being relevant to a user query. Ideally we would like the mediator to retrieve such relevant uncertain answers and gauge their relevance by accessing their likelihood of being relevant answers to the query. However, the autonomous nature of the databases poses several challenges, such as the restricted access privileges, limited query patterns, and sensitivity of database and network resource consumption in the web environment. We introduce a novel query rewriting and optimization framework QPIAD that tackles these challenges to retrieve relevant uncertain answers. Our technique involves reformulating the user query based on approximate functional dependencies (AFDs) among the database attributes and ranking these queries using value distributions learned from Naïve Bayes Classifiers. Empirical studies demonstrate the effectiveness of our approach in retrieving relevant uncertain answers with high precision, high recall and manageable cost.

Original languageEnglish (US)
Title of host publicationProceedings - International Conference on Data Engineering
Pages1430-1432
Number of pages3
DOIs
StatePublished - 2007
Event23rd International Conference on Data Engineering, ICDE 2007 - Istanbul, Turkey
Duration: Apr 15 2007Apr 20 2007

Other

Other23rd International Conference on Data Engineering, ICDE 2007
CountryTurkey
CityIstanbul
Period4/15/074/20/07

Fingerprint

Query processing
Gages
Classifiers
Costs

Keywords

  • Autonomous databases
  • Incomplete databases
  • Query rewriting
  • Querying hidden web

ASJC Scopus subject areas

  • Software
  • Engineering(all)
  • Engineering (miscellaneous)

Cite this

Khatri, H., Fan, J., Chen, Y., & Kambhampati, S. (2007). QPIAD: Query processing over incomplete autonomous databases. In Proceedings - International Conference on Data Engineering (pp. 1430-1432). [4221818] https://doi.org/10.1109/ICDE.2007.369028

QPIAD : Query processing over incomplete autonomous databases. / Khatri, Hemal; Fan, Jianchun; Chen, Yi; Kambhampati, Subbarao.

Proceedings - International Conference on Data Engineering. 2007. p. 1430-1432 4221818.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Khatri, H, Fan, J, Chen, Y & Kambhampati, S 2007, QPIAD: Query processing over incomplete autonomous databases. in Proceedings - International Conference on Data Engineering., 4221818, pp. 1430-1432, 23rd International Conference on Data Engineering, ICDE 2007, Istanbul, Turkey, 4/15/07. https://doi.org/10.1109/ICDE.2007.369028
Khatri H, Fan J, Chen Y, Kambhampati S. QPIAD: Query processing over incomplete autonomous databases. In Proceedings - International Conference on Data Engineering. 2007. p. 1430-1432. 4221818 https://doi.org/10.1109/ICDE.2007.369028
Khatri, Hemal ; Fan, Jianchun ; Chen, Yi ; Kambhampati, Subbarao. / QPIAD : Query processing over incomplete autonomous databases. Proceedings - International Conference on Data Engineering. 2007. pp. 1430-1432
@inproceedings{ac7eccfdfbc44e7b87d88e844fdfa384,
title = "QPIAD: Query processing over incomplete autonomous databases",
abstract = "Incompleteness due to missing attribute values (aka {"}null values{"}) is very common in autonomous web databases, on which user accesses are usually supported through mediators. Traditional query processing techniques that focus on the strict soundness of answer tuples often ignore tuples with critical missing attributes, even if they wind up being relevant to a user query. Ideally we would like the mediator to retrieve such relevant uncertain answers and gauge their relevance by accessing their likelihood of being relevant answers to the query. However, the autonomous nature of the databases poses several challenges, such as the restricted access privileges, limited query patterns, and sensitivity of database and network resource consumption in the web environment. We introduce a novel query rewriting and optimization framework QPIAD that tackles these challenges to retrieve relevant uncertain answers. Our technique involves reformulating the user query based on approximate functional dependencies (AFDs) among the database attributes and ranking these queries using value distributions learned from Na{\"i}ve Bayes Classifiers. Empirical studies demonstrate the effectiveness of our approach in retrieving relevant uncertain answers with high precision, high recall and manageable cost.",
keywords = "Autonomous databases, Incomplete databases, Query rewriting, Querying hidden web",
author = "Hemal Khatri and Jianchun Fan and Yi Chen and Subbarao Kambhampati",
year = "2007",
doi = "10.1109/ICDE.2007.369028",
language = "English (US)",
isbn = "1424408032",
pages = "1430--1432",
booktitle = "Proceedings - International Conference on Data Engineering",

}

TY - GEN

T1 - QPIAD

T2 - Query processing over incomplete autonomous databases

AU - Khatri, Hemal

AU - Fan, Jianchun

AU - Chen, Yi

AU - Kambhampati, Subbarao

PY - 2007

Y1 - 2007

N2 - Incompleteness due to missing attribute values (aka "null values") is very common in autonomous web databases, on which user accesses are usually supported through mediators. Traditional query processing techniques that focus on the strict soundness of answer tuples often ignore tuples with critical missing attributes, even if they wind up being relevant to a user query. Ideally we would like the mediator to retrieve such relevant uncertain answers and gauge their relevance by accessing their likelihood of being relevant answers to the query. However, the autonomous nature of the databases poses several challenges, such as the restricted access privileges, limited query patterns, and sensitivity of database and network resource consumption in the web environment. We introduce a novel query rewriting and optimization framework QPIAD that tackles these challenges to retrieve relevant uncertain answers. Our technique involves reformulating the user query based on approximate functional dependencies (AFDs) among the database attributes and ranking these queries using value distributions learned from Naïve Bayes Classifiers. Empirical studies demonstrate the effectiveness of our approach in retrieving relevant uncertain answers with high precision, high recall and manageable cost.

AB - Incompleteness due to missing attribute values (aka "null values") is very common in autonomous web databases, on which user accesses are usually supported through mediators. Traditional query processing techniques that focus on the strict soundness of answer tuples often ignore tuples with critical missing attributes, even if they wind up being relevant to a user query. Ideally we would like the mediator to retrieve such relevant uncertain answers and gauge their relevance by accessing their likelihood of being relevant answers to the query. However, the autonomous nature of the databases poses several challenges, such as the restricted access privileges, limited query patterns, and sensitivity of database and network resource consumption in the web environment. We introduce a novel query rewriting and optimization framework QPIAD that tackles these challenges to retrieve relevant uncertain answers. Our technique involves reformulating the user query based on approximate functional dependencies (AFDs) among the database attributes and ranking these queries using value distributions learned from Naïve Bayes Classifiers. Empirical studies demonstrate the effectiveness of our approach in retrieving relevant uncertain answers with high precision, high recall and manageable cost.

KW - Autonomous databases

KW - Incomplete databases

KW - Query rewriting

KW - Querying hidden web

UR - http://www.scopus.com/inward/record.url?scp=34548755508&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34548755508&partnerID=8YFLogxK

U2 - 10.1109/ICDE.2007.369028

DO - 10.1109/ICDE.2007.369028

M3 - Conference contribution

AN - SCOPUS:34548755508

SN - 1424408032

SN - 9781424408030

SP - 1430

EP - 1432

BT - Proceedings - International Conference on Data Engineering

ER -