The LLUNATIC data-cleaning framework

Floris Geerts, Giansalvatore Mecca, Paolo Papotti, Donatello Santoro

Research output: Contribution to journalArticle

96 Scopus citations

Abstract

Data-cleaning (or data-repairing) is considered a crucial problem in many database-related tasks. It consists in making a database consistent with respect to a set of given constraints. In recent years, repairing methods have been proposed for several classes of constraints. However, these methods rely on ad hoc decisions and tend to hard-code the strategy to repair conflicting values. As a consequence, there is currently no general algorithm to solve database repairing problems that involve different kinds of constraints and different strategies to select preferred values. In this paper we develop a uniform framework to solve this problem. We propose a new semantics for repairs, and a chase-based algorithm to compute minimal solutions. We implemented the framework in a DBMSbased prototype, and we report experimental results that confirm its good scalability and superior quality in computing repairs.

Original languageEnglish (US)
Pages (from-to)625-636
Number of pages12
JournalProceedings of the VLDB Endowment
Volume6
Issue number9
DOIs
StatePublished - 2013

    Fingerprint

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Computer Science(all)

Cite this

Geerts, F., Mecca, G., Papotti, P., & Santoro, D. (2013). The LLUNATIC data-cleaning framework. Proceedings of the VLDB Endowment, 6(9), 625-636. https://doi.org/10.14778/2536360.2536363