Supporting annotations on relations

Mohamed Y. Eltabakh, Walid G. Aref, Ahmed K. Elmagarmid, Mourad Ouzzani, Yasin N. Silva

Research output: Chapter in Book/Report/Conference proceedingConference contribution

26 Scopus citations

Abstract

Annotations play a key role in understanding and curating databases. Annotations may represent comments, descriptions, lineage information, among several others. Annotation management is a vital mechanism for sharing knowledge and building an interactive and collaborative environment among database users and scientists. What makes it challenging is that annotations can be attached to database entities at various granularities, e.g., at the table, tuple, column, cell levels, or more generally, to any subset of cells that results from a select statement. Therefore, simple comment fields in tuples would not work because of the combinatorial nature of the annotations. In this paper, we present extensions to current database management systems to support annotations. We propose storage schemes to efficiently store annotations at multiple granularities, i.e., at the table, tuple, column, and cell levels. Compared to storing the annotations with the individual cells, the proposed schemes achieve more than an order-of-magnitude reduction in storage and up to 70% saving in the query execution time. We define types of annotations that inherit different behaviors. Through these types, users can specify, for example, whether or not an annotation is continuously applied over newly inserted data and whether or not an annotation is archived when the base data is modified. These annotation types raise several storage and processing challenges that are addressed in the paper. We propose declarative ways to add, archive, query, and propagate annotations. The proposed mechanisms are realized through extensions to the standard SQL. We implemented the proposed functionalities inside PostgreSQL with an easy to use Excel-based front-end graphical interface.

Original languageEnglish (US)
Title of host publicationProceedings of the 12th International Conference on Extending Database Technology
Subtitle of host publicationAdvances in Database Technology, EDBT'09
Pages379-390
Number of pages12
DOIs
StatePublished - 2009
Externally publishedYes
Event12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09 - Saint Petersburg, Russian Federation
Duration: Mar 24 2009Mar 26 2009

Publication series

NameProceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09

Other

Other12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09
Country/TerritoryRussian Federation
CitySaint Petersburg
Period3/24/093/26/09

ASJC Scopus subject areas

  • Computer Science Applications
  • Software

Cite this