Interactive data repairing: The FALCON dive

Enzo Veltri, Donatello Santoro, Giansalvatore Mecca, Paolo Papotti, Jian He, Gouliang Li, Nan Tang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper we discuss Falcon, an interactive, deterministic, and declarative data cleaning system. Unlike traditional rule-based system, Falcon does not rely on the existence of a set of pre-defined data quality rules, but it encourages users to explore the data, identify possible problems, and make updates to fix them. The main technical challenge consists in finding a set of rules, expressed as sql update queries, that are semantically correct and that fixes the largest number of errors in the data. Falcon navigates the lattice by interacting with users to gradually checking the correctness of a set of rules. We have conducted extensive experiments using both real-world and synthetic datasets to show that Falcon can effectively communicate with users in data repairing.

Original languageEnglish (US)
Title of host publication25th Italian Symposium on Advanced Database Systems, SEBD 2017
PublisherSistemi Evoluti per Basi di Dati (SEBD)
Pages267-274
Number of pages8
StatePublished - Jan 1 2017
Event25th Italian Symposium on Advanced Database Systems, SEBD 2017 - Squillace Lido, Catanzaro, Italy
Duration: Jun 25 2017Jun 29 2017

Other

Other25th Italian Symposium on Advanced Database Systems, SEBD 2017
Country/TerritoryItaly
CitySquillace Lido, Catanzaro
Period6/25/176/29/17

ASJC Scopus subject areas

  • Software
  • Information Systems

Fingerprint

Dive into the research topics of 'Interactive data repairing: The FALCON dive'. Together they form a unique fingerprint.

Cite this