AlphaSum: Size-constrained table summarization using value lattices

Kasim Candan, Huiping Cao, Yan Qi, Maria Luisa Sapino

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Scopus citations

Abstract

Consider a scientist who wants to explore multiple data sets to select the relevant ones for further analysis. Since the visualization real estate may put a stringent constraint on how much detail can be presented to this user in a single page, effective table summarization techniques are needed to create summaries that are both sufficiently small and effective in communicating the available content. In this paper, we first argue that table summarization can benefit from knowledge about acceptable value clustering alternatives for clustering the values in the database. We formulate the problem of table summarization with the help of value lattices. We then provide a framework to express alternative clustering strategies and to account for various utility measures (such as information loss) in assessing different summarization alternatives. Based on this interpretation, we introduce three preference criteria, max-min-util (cautious), max-sum-util (cumulative), and pareto-util, for the problem of table summarization. To tackle with the inherent complexity, we rely on the properties of the fuzzy interpretation to further develop a novel ranked set cover based evaluation mechanism (RSC). These are brought together in an AlphaSum, table summarization system. Experimental evaluations showed that RSC improves both execution times and the summary qualities in AlphaSum, by pruning the search space more effectively than the existing solutions.

Original languageEnglish (US)
Title of host publicationProceedings of the 12th International Conference on Extending Database Technology
Subtitle of host publicationAdvances in Database Technology, EDBT'09
Pages96-107
Number of pages12
DOIs
StatePublished - Sep 21 2009
Event12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09 - Saint Petersburg, Russian Federation
Duration: Mar 24 2009Mar 26 2009

Publication series

NameProceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09

Other

Other12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09
CountryRussian Federation
CitySaint Petersburg
Period3/24/093/26/09

ASJC Scopus subject areas

  • Computer Science Applications
  • Software

Cite this

Candan, K., Cao, H., Qi, Y., & Sapino, M. L. (2009). AlphaSum: Size-constrained table summarization using value lattices. In Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09 (pp. 96-107). (Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09). https://doi.org/10.1145/1516360.1516373