TY - JOUR
T1 - Interactive Data Repairing
T2 - 25th Italian Symposium on Advanced Database Systems, SEBD 2017
AU - Veltri, Enzo
AU - Santoro, Donatello
AU - Mecca, Giansalvatore
AU - Papotti, Paolo
AU - He, Jian
AU - Li, Gouliang
AU - Tang, Nan
PY - 2017
Y1 - 2017
N2 - In this paper we discuss Falcon, an interactive, deterministic, and declarative data cleaning system. Unlike traditional rule-based system, Falcon does not rely on the existence of a set of pre-defined data quality rules, but it encourages users to explore the data, identify possible problems, and make updates to fix them. The main technical challenge consists in finding a set of rules, expressed as sql update queries, that are semantically correct and that fixes the largest number of errors in the data. Falcon navigates the lattice by interacting with users to gradually checking the correctness of a set of rules. We have conducted extensive experiments using both real-world and synthetic datasets to show that Falcon can effectively communicate with users in data repairing.
AB - In this paper we discuss Falcon, an interactive, deterministic, and declarative data cleaning system. Unlike traditional rule-based system, Falcon does not rely on the existence of a set of pre-defined data quality rules, but it encourages users to explore the data, identify possible problems, and make updates to fix them. The main technical challenge consists in finding a set of rules, expressed as sql update queries, that are semantically correct and that fixes the largest number of errors in the data. Falcon navigates the lattice by interacting with users to gradually checking the correctness of a set of rules. We have conducted extensive experiments using both real-world and synthetic datasets to show that Falcon can effectively communicate with users in data repairing.
UR - http://www.scopus.com/inward/record.url?scp=85041447513&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85041447513&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85041447513
SN - 1613-0073
VL - 2037
JO - CEUR Workshop Proceedings
JF - CEUR Workshop Proceedings
Y2 - 25 June 2017 through 29 June 2017
ER -