Safety Guarantee of continuous join queries over punctuated data streams

Hua Gang Li; Songting Chen; Junichi Tatemura; Divyakant Agrawal; K. Selçuk Candan; Wang Pin Hsiung

Safety Guarantee of continuous join queries over punctuated data streams

Hua Gang Li, Songting Chen, Junichi Tatemura, Divyakant Agrawal, K. Selçuk Candan, Wang Pin Hsiung

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

Continuous join queries (CJQ) are needed for correlating data from multiple streams. One fundamental problem for processing such queries is that since the data streams are infinite, this would require the join operator to store infinite states and eventually run out of space. Punctuation semantics has been proposed to specifically address this problem. In particular, punctuations explicitly mark the end of a subset of data and, hence, enable purging of the stored data which will not contribute to any new query results. Given a set of available punctuation schemes, if one can identify that a CJQ still requires unbounded storage, then this query can be flagged as unsafe and can be prevented from running. Unfortunately, while Punctuation semantics is clearly useful, the mechanisms to identify if and how a particular CJQ could benefit from a given set of punctuation schemes are not yet known. In this paper, we provide sufficient and necessary conditions for checking whether a CJQ can be safely executed under a given set of punctuation schemes or not. In Particular, we introduce a novel punctuation graph to aid the analysis of the safety for a given query. We show that the safety checking Problem can be done in polynomial time based on this punctuation graph construct. In addition, various issues and challenges related to the safety checking of CJQs are highlighted.

Original language	English (US)
Title of host publication	VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases
Publisher	Association for Computing Machinery
Pages	19-30
Number of pages	12
ISBN (Print)	1595933859, 9781595933850
State	Published - 2006
Externally published	Yes
Event	32nd International Conference on Very Large Data Bases, VLDB 2006 - Seoul, Korea, Republic of Duration: Sep 12 2006 → Sep 15 2006

Publication series

Name	VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases

Other

Other	32nd International Conference on Very Large Data Bases, VLDB 2006
Country/Territory	Korea, Republic of
City	Seoul
Period	9/12/06 → 9/15/06

ASJC Scopus subject areas

Hardware and Architecture
Information Systems
Software
Information Systems and Management

Cite this

Li, H. G., Chen, S., Tatemura, J., Agrawal, D., Candan, K. S., & Hsiung, W. P. (2006). Safety Guarantee of continuous join queries over punctuated data streams. In VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases (pp. 19-30). (VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases). Association for Computing Machinery.

Safety Guarantee of continuous join queries over punctuated data streams. / Li, Hua Gang; Chen, Songting; Tatemura, Junichi et al.
VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases. Association for Computing Machinery, 2006. p. 19-30 (VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Li, HG, Chen, S, Tatemura, J, Agrawal, D, Candan, KS & Hsiung, WP 2006, Safety Guarantee of continuous join queries over punctuated data streams. in VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases. VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases, Association for Computing Machinery, pp. 19-30, 32nd International Conference on Very Large Data Bases, VLDB 2006, Seoul, Korea, Republic of, 9/12/06.

@inproceedings{7cd7b01c13444985a671281cc85acc3f,

title = "Safety Guarantee of continuous join queries over punctuated data streams",

abstract = "Continuous join queries (CJQ) are needed for correlating data from multiple streams. One fundamental problem for processing such queries is that since the data streams are infinite, this would require the join operator to store infinite states and eventually run out of space. Punctuation semantics has been proposed to specifically address this problem. In particular, punctuations explicitly mark the end of a subset of data and, hence, enable purging of the stored data which will not contribute to any new query results. Given a set of available punctuation schemes, if one can identify that a CJQ still requires unbounded storage, then this query can be flagged as unsafe and can be prevented from running. Unfortunately, while Punctuation semantics is clearly useful, the mechanisms to identify if and how a particular CJQ could benefit from a given set of punctuation schemes are not yet known. In this paper, we provide sufficient and necessary conditions for checking whether a CJQ can be safely executed under a given set of punctuation schemes or not. In Particular, we introduce a novel punctuation graph to aid the analysis of the safety for a given query. We show that the safety checking Problem can be done in polynomial time based on this punctuation graph construct. In addition, various issues and challenges related to the safety checking of CJQs are highlighted.",

author = "Li, {Hua Gang} and Songting Chen and Junichi Tatemura and Divyakant Agrawal and Candan, {K. Sel{\c c}uk} and Hsiung, {Wang Pin}",

year = "2006",

language = "English (US)",

isbn = "1595933859",

series = "VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases",

publisher = "Association for Computing Machinery",

pages = "19--30",

booktitle = "VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases",

note = "32nd International Conference on Very Large Data Bases, VLDB 2006 ; Conference date: 12-09-2006 Through 15-09-2006",

}

TY - GEN

T1 - Safety Guarantee of continuous join queries over punctuated data streams

AU - Li, Hua Gang

AU - Chen, Songting

AU - Tatemura, Junichi

AU - Agrawal, Divyakant

AU - Candan, K. Selçuk

AU - Hsiung, Wang Pin

PY - 2006

Y1 - 2006

N2 - Continuous join queries (CJQ) are needed for correlating data from multiple streams. One fundamental problem for processing such queries is that since the data streams are infinite, this would require the join operator to store infinite states and eventually run out of space. Punctuation semantics has been proposed to specifically address this problem. In particular, punctuations explicitly mark the end of a subset of data and, hence, enable purging of the stored data which will not contribute to any new query results. Given a set of available punctuation schemes, if one can identify that a CJQ still requires unbounded storage, then this query can be flagged as unsafe and can be prevented from running. Unfortunately, while Punctuation semantics is clearly useful, the mechanisms to identify if and how a particular CJQ could benefit from a given set of punctuation schemes are not yet known. In this paper, we provide sufficient and necessary conditions for checking whether a CJQ can be safely executed under a given set of punctuation schemes or not. In Particular, we introduce a novel punctuation graph to aid the analysis of the safety for a given query. We show that the safety checking Problem can be done in polynomial time based on this punctuation graph construct. In addition, various issues and challenges related to the safety checking of CJQs are highlighted.

AB - Continuous join queries (CJQ) are needed for correlating data from multiple streams. One fundamental problem for processing such queries is that since the data streams are infinite, this would require the join operator to store infinite states and eventually run out of space. Punctuation semantics has been proposed to specifically address this problem. In particular, punctuations explicitly mark the end of a subset of data and, hence, enable purging of the stored data which will not contribute to any new query results. Given a set of available punctuation schemes, if one can identify that a CJQ still requires unbounded storage, then this query can be flagged as unsafe and can be prevented from running. Unfortunately, while Punctuation semantics is clearly useful, the mechanisms to identify if and how a particular CJQ could benefit from a given set of punctuation schemes are not yet known. In this paper, we provide sufficient and necessary conditions for checking whether a CJQ can be safely executed under a given set of punctuation schemes or not. In Particular, we introduce a novel punctuation graph to aid the analysis of the safety for a given query. We show that the safety checking Problem can be done in polynomial time based on this punctuation graph construct. In addition, various issues and challenges related to the safety checking of CJQs are highlighted.

UR - http://www.scopus.com/inward/record.url?scp=34547990984&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34547990984&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:34547990984

SN - 1595933859

SN - 9781595933850

T3 - VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases

SP - 19

EP - 30

BT - VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases

PB - Association for Computing Machinery

T2 - 32nd International Conference on Very Large Data Bases, VLDB 2006

Y2 - 12 September 2006 through 15 September 2006

ER -

Safety Guarantee of continuous join queries over punctuated data streams

Abstract

Publication series

Other

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this