Scalable data exchange with functional dependencies

Bruno Marnette, Giansalvatore Mecca, Paolo Papotti

Research output: Contribution to journalArticle

30 Citations (Scopus)

Abstract

The recent literature has provided a solid theoretical foundation for the use of schema mappings in data-exchange applications. Following this formalization, new algorithms have been developed to generate optimal solutions for mapping scenarios in a highly scalable way, by relying on SQL. However, these algorithms suffer from a serious drawback: they are not able to handle key constraints and functional dependencies on the target, i.e., equality generating dependencies (egds). While egds play a crucial role in the generation of optimal solutions, handling them with first-order languages is a difficult problem. In fact, we start from a negative result: it is not always possible to compute solutions for scenarios with egds using an SQL script. Then, we identify many practical cases in which this is possible, and develop a best-effort algorithm to do this. Experimental results show that our algorithm produces solutions of better quality with respect to those produced by previous algorithms, and scales nicely to large databases.

Original languageEnglish (US)
Pages (from-to)105-116
Number of pages12
JournalUnknown Journal
Volume3
Issue number1
StatePublished - Sep 2010
Externally publishedYes

Fingerprint

Electronic data interchange
Language
Dependency (Psychology)
Databases

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Computer Science(all)

Cite this

Marnette, B., Mecca, G., & Papotti, P. (2010). Scalable data exchange with functional dependencies. Unknown Journal, 3(1), 105-116.

Scalable data exchange with functional dependencies. / Marnette, Bruno; Mecca, Giansalvatore; Papotti, Paolo.

In: Unknown Journal, Vol. 3, No. 1, 09.2010, p. 105-116.

Research output: Contribution to journalArticle

Marnette, B, Mecca, G & Papotti, P 2010, 'Scalable data exchange with functional dependencies', Unknown Journal, vol. 3, no. 1, pp. 105-116.
Marnette B, Mecca G, Papotti P. Scalable data exchange with functional dependencies. Unknown Journal. 2010 Sep;3(1):105-116.
Marnette, Bruno ; Mecca, Giansalvatore ; Papotti, Paolo. / Scalable data exchange with functional dependencies. In: Unknown Journal. 2010 ; Vol. 3, No. 1. pp. 105-116.
@article{8be09e2f863548d7a000a158e4c0d40a,
title = "Scalable data exchange with functional dependencies",
abstract = "The recent literature has provided a solid theoretical foundation for the use of schema mappings in data-exchange applications. Following this formalization, new algorithms have been developed to generate optimal solutions for mapping scenarios in a highly scalable way, by relying on SQL. However, these algorithms suffer from a serious drawback: they are not able to handle key constraints and functional dependencies on the target, i.e., equality generating dependencies (egds). While egds play a crucial role in the generation of optimal solutions, handling them with first-order languages is a difficult problem. In fact, we start from a negative result: it is not always possible to compute solutions for scenarios with egds using an SQL script. Then, we identify many practical cases in which this is possible, and develop a best-effort algorithm to do this. Experimental results show that our algorithm produces solutions of better quality with respect to those produced by previous algorithms, and scales nicely to large databases.",
author = "Bruno Marnette and Giansalvatore Mecca and Paolo Papotti",
year = "2010",
month = "9",
language = "English (US)",
volume = "3",
pages = "105--116",
journal = "Scanning Electron Microscopy",
issn = "0586-5581",
publisher = "Scanning Microscopy International",
number = "1",

}

TY - JOUR

T1 - Scalable data exchange with functional dependencies

AU - Marnette, Bruno

AU - Mecca, Giansalvatore

AU - Papotti, Paolo

PY - 2010/9

Y1 - 2010/9

N2 - The recent literature has provided a solid theoretical foundation for the use of schema mappings in data-exchange applications. Following this formalization, new algorithms have been developed to generate optimal solutions for mapping scenarios in a highly scalable way, by relying on SQL. However, these algorithms suffer from a serious drawback: they are not able to handle key constraints and functional dependencies on the target, i.e., equality generating dependencies (egds). While egds play a crucial role in the generation of optimal solutions, handling them with first-order languages is a difficult problem. In fact, we start from a negative result: it is not always possible to compute solutions for scenarios with egds using an SQL script. Then, we identify many practical cases in which this is possible, and develop a best-effort algorithm to do this. Experimental results show that our algorithm produces solutions of better quality with respect to those produced by previous algorithms, and scales nicely to large databases.

AB - The recent literature has provided a solid theoretical foundation for the use of schema mappings in data-exchange applications. Following this formalization, new algorithms have been developed to generate optimal solutions for mapping scenarios in a highly scalable way, by relying on SQL. However, these algorithms suffer from a serious drawback: they are not able to handle key constraints and functional dependencies on the target, i.e., equality generating dependencies (egds). While egds play a crucial role in the generation of optimal solutions, handling them with first-order languages is a difficult problem. In fact, we start from a negative result: it is not always possible to compute solutions for scenarios with egds using an SQL script. Then, we identify many practical cases in which this is possible, and develop a best-effort algorithm to do this. Experimental results show that our algorithm produces solutions of better quality with respect to those produced by previous algorithms, and scales nicely to large databases.

UR - http://www.scopus.com/inward/record.url?scp=79952774500&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79952774500&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:79952774500

VL - 3

SP - 105

EP - 116

JO - Scanning Electron Microscopy

JF - Scanning Electron Microscopy

SN - 0586-5581

IS - 1

ER -