Supporting annotations on relations

Mohamed Y. Eltabakh; Walid G. Aref; Ahmed K. Elmagarmid; Mourad Ouzzani; Yasin N. Silva

doi:10.1145/1516360.1516405

Supporting annotations on relations

Mohamed Y. Eltabakh, Walid G. Aref, Ahmed K. Elmagarmid, Mourad Ouzzani, Yasin N. Silva

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

26 Scopus citations

Abstract

Annotations play a key role in understanding and curating databases. Annotations may represent comments, descriptions, lineage information, among several others. Annotation management is a vital mechanism for sharing knowledge and building an interactive and collaborative environment among database users and scientists. What makes it challenging is that annotations can be attached to database entities at various granularities, e.g., at the table, tuple, column, cell levels, or more generally, to any subset of cells that results from a select statement. Therefore, simple comment fields in tuples would not work because of the combinatorial nature of the annotations. In this paper, we present extensions to current database management systems to support annotations. We propose storage schemes to efficiently store annotations at multiple granularities, i.e., at the table, tuple, column, and cell levels. Compared to storing the annotations with the individual cells, the proposed schemes achieve more than an order-of-magnitude reduction in storage and up to 70% saving in the query execution time. We define types of annotations that inherit different behaviors. Through these types, users can specify, for example, whether or not an annotation is continuously applied over newly inserted data and whether or not an annotation is archived when the base data is modified. These annotation types raise several storage and processing challenges that are addressed in the paper. We propose declarative ways to add, archive, query, and propagate annotations. The proposed mechanisms are realized through extensions to the standard SQL. We implemented the proposed functionalities inside PostgreSQL with an easy to use Excel-based front-end graphical interface.

Original language	English (US)
Title of host publication	Proceedings of the 12th International Conference on Extending Database Technology
Subtitle of host publication	Advances in Database Technology, EDBT'09
Pages	379-390
Number of pages	12
DOIs	https://doi.org/10.1145/1516360.1516405
State	Published - 2009
Externally published	Yes
Event	12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09 - Saint Petersburg, Russian Federation Duration: Mar 24 2009 → Mar 26 2009

Publication series

Name	Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09

Other

Other	12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09
Country/Territory	Russian Federation
City	Saint Petersburg
Period	3/24/09 → 3/26/09

ASJC Scopus subject areas

Computer Science Applications
Software

Access to Document

10.1145/1516360.1516405

Cite this

Eltabakh, M. Y., Aref, W. G., Elmagarmid, A. K., Ouzzani, M., & Silva, Y. N. (2009). Supporting annotations on relations. In Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09 (pp. 379-390). (Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09). https://doi.org/10.1145/1516360.1516405

Supporting annotations on relations. / Eltabakh, Mohamed Y.; Aref, Walid G.; Elmagarmid, Ahmed K. et al.
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09. 2009. p. 379-390 (Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Eltabakh, MY, Aref, WG, Elmagarmid, AK, Ouzzani, M & Silva, YN 2009, Supporting annotations on relations. in Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09. Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09, pp. 379-390, 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09, Saint Petersburg, Russian Federation, 3/24/09. https://doi.org/10.1145/1516360.1516405

Eltabakh MY, Aref WG, Elmagarmid AK, Ouzzani M, Silva YN. Supporting annotations on relations. In Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09. 2009. p. 379-390. (Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09). doi: 10.1145/1516360.1516405

Eltabakh, Mohamed Y. ; Aref, Walid G. ; Elmagarmid, Ahmed K. et al. / Supporting annotations on relations. Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09. 2009. pp. 379-390 (Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09).

@inproceedings{df3dee69e2df4a788dab079311fc12ee,

title = "Supporting annotations on relations",

abstract = "Annotations play a key role in understanding and curating databases. Annotations may represent comments, descriptions, lineage information, among several others. Annotation management is a vital mechanism for sharing knowledge and building an interactive and collaborative environment among database users and scientists. What makes it challenging is that annotations can be attached to database entities at various granularities, e.g., at the table, tuple, column, cell levels, or more generally, to any subset of cells that results from a select statement. Therefore, simple comment fields in tuples would not work because of the combinatorial nature of the annotations. In this paper, we present extensions to current database management systems to support annotations. We propose storage schemes to efficiently store annotations at multiple granularities, i.e., at the table, tuple, column, and cell levels. Compared to storing the annotations with the individual cells, the proposed schemes achieve more than an order-of-magnitude reduction in storage and up to 70% saving in the query execution time. We define types of annotations that inherit different behaviors. Through these types, users can specify, for example, whether or not an annotation is continuously applied over newly inserted data and whether or not an annotation is archived when the base data is modified. These annotation types raise several storage and processing challenges that are addressed in the paper. We propose declarative ways to add, archive, query, and propagate annotations. The proposed mechanisms are realized through extensions to the standard SQL. We implemented the proposed functionalities inside PostgreSQL with an easy to use Excel-based front-end graphical interface.",

author = "Eltabakh, {Mohamed Y.} and Aref, {Walid G.} and Elmagarmid, {Ahmed K.} and Mourad Ouzzani and Silva, {Yasin N.}",

year = "2009",

doi = "10.1145/1516360.1516405",

language = "English (US)",

isbn = "9781605584225",

series = "Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09",

pages = "379--390",

booktitle = "Proceedings of the 12th International Conference on Extending Database Technology",

note = "12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09 ; Conference date: 24-03-2009 Through 26-03-2009",

}

TY - GEN

T1 - Supporting annotations on relations

AU - Eltabakh, Mohamed Y.

AU - Aref, Walid G.

AU - Elmagarmid, Ahmed K.

AU - Ouzzani, Mourad

AU - Silva, Yasin N.

PY - 2009

Y1 - 2009

N2 - Annotations play a key role in understanding and curating databases. Annotations may represent comments, descriptions, lineage information, among several others. Annotation management is a vital mechanism for sharing knowledge and building an interactive and collaborative environment among database users and scientists. What makes it challenging is that annotations can be attached to database entities at various granularities, e.g., at the table, tuple, column, cell levels, or more generally, to any subset of cells that results from a select statement. Therefore, simple comment fields in tuples would not work because of the combinatorial nature of the annotations. In this paper, we present extensions to current database management systems to support annotations. We propose storage schemes to efficiently store annotations at multiple granularities, i.e., at the table, tuple, column, and cell levels. Compared to storing the annotations with the individual cells, the proposed schemes achieve more than an order-of-magnitude reduction in storage and up to 70% saving in the query execution time. We define types of annotations that inherit different behaviors. Through these types, users can specify, for example, whether or not an annotation is continuously applied over newly inserted data and whether or not an annotation is archived when the base data is modified. These annotation types raise several storage and processing challenges that are addressed in the paper. We propose declarative ways to add, archive, query, and propagate annotations. The proposed mechanisms are realized through extensions to the standard SQL. We implemented the proposed functionalities inside PostgreSQL with an easy to use Excel-based front-end graphical interface.

AB - Annotations play a key role in understanding and curating databases. Annotations may represent comments, descriptions, lineage information, among several others. Annotation management is a vital mechanism for sharing knowledge and building an interactive and collaborative environment among database users and scientists. What makes it challenging is that annotations can be attached to database entities at various granularities, e.g., at the table, tuple, column, cell levels, or more generally, to any subset of cells that results from a select statement. Therefore, simple comment fields in tuples would not work because of the combinatorial nature of the annotations. In this paper, we present extensions to current database management systems to support annotations. We propose storage schemes to efficiently store annotations at multiple granularities, i.e., at the table, tuple, column, and cell levels. Compared to storing the annotations with the individual cells, the proposed schemes achieve more than an order-of-magnitude reduction in storage and up to 70% saving in the query execution time. We define types of annotations that inherit different behaviors. Through these types, users can specify, for example, whether or not an annotation is continuously applied over newly inserted data and whether or not an annotation is archived when the base data is modified. These annotation types raise several storage and processing challenges that are addressed in the paper. We propose declarative ways to add, archive, query, and propagate annotations. The proposed mechanisms are realized through extensions to the standard SQL. We implemented the proposed functionalities inside PostgreSQL with an easy to use Excel-based front-end graphical interface.

UR - http://www.scopus.com/inward/record.url?scp=70349101933&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=70349101933&partnerID=8YFLogxK

U2 - 10.1145/1516360.1516405

DO - 10.1145/1516360.1516405

M3 - Conference contribution

AN - SCOPUS:70349101933

SN - 9781605584225

T3 - Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09

SP - 379

EP - 390

BT - Proceedings of the 12th International Conference on Extending Database Technology

T2 - 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09

Y2 - 24 March 2009 through 26 March 2009

ER -

Supporting annotations on relations

Abstract

Publication series

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Cite this