Optimal synchronization policies for data warehouses

Debabrata Dey, Zhongju Zhang, Prabuddha De

Research output: Contribution to journalArticle

14 Citations (Scopus)

Abstract

The notion of a data warehouse for integrating operational data into a single repository is rapidly becoming popular in modern organizations. An important issue in this context is how often one should synchronize the data warehouse to reflect the changes in the constituent operational data sources. If the synchronization is performed very frequently, the associated cost might be quite high, although the data warehouse would only have a small amount of stale data. On the other hand, if the data warehouse is synchronized infrequently, it might result in costly errors in business decisions arising from the stale data. This paper examines the trade-off between the synchronization and staleness costs and derives the optimal synchronization frequency.

Original languageEnglish (US)
Pages (from-to)229-242
Number of pages14
JournalINFORMS Journal on Computing
Volume18
Issue number2
DOIs
StatePublished - 2006
Externally publishedYes

Fingerprint

Data warehouses
Synchronization
Costs
Data warehouse
Industry

Keywords

  • Data warehouse
  • Stateness cost
  • Synchronization

ASJC Scopus subject areas

  • Computer Science Applications
  • Computational Theory and Mathematics
  • Management Science and Operations Research

Cite this

Optimal synchronization policies for data warehouses. / Dey, Debabrata; Zhang, Zhongju; De, Prabuddha.

In: INFORMS Journal on Computing, Vol. 18, No. 2, 2006, p. 229-242.

Research output: Contribution to journalArticle

Dey, Debabrata ; Zhang, Zhongju ; De, Prabuddha. / Optimal synchronization policies for data warehouses. In: INFORMS Journal on Computing. 2006 ; Vol. 18, No. 2. pp. 229-242.
@article{a7a065f4b43b429d951189882cea91c2,
title = "Optimal synchronization policies for data warehouses",
abstract = "The notion of a data warehouse for integrating operational data into a single repository is rapidly becoming popular in modern organizations. An important issue in this context is how often one should synchronize the data warehouse to reflect the changes in the constituent operational data sources. If the synchronization is performed very frequently, the associated cost might be quite high, although the data warehouse would only have a small amount of stale data. On the other hand, if the data warehouse is synchronized infrequently, it might result in costly errors in business decisions arising from the stale data. This paper examines the trade-off between the synchronization and staleness costs and derives the optimal synchronization frequency.",
keywords = "Data warehouse, Stateness cost, Synchronization",
author = "Debabrata Dey and Zhongju Zhang and Prabuddha De",
year = "2006",
doi = "10.1287/ijoc.1040.0094",
language = "English (US)",
volume = "18",
pages = "229--242",
journal = "INFORMS Journal on Computing",
issn = "1091-9856",
publisher = "INFORMS Inst.for Operations Res.and the Management Sciences",
number = "2",

}

TY - JOUR

T1 - Optimal synchronization policies for data warehouses

AU - Dey, Debabrata

AU - Zhang, Zhongju

AU - De, Prabuddha

PY - 2006

Y1 - 2006

N2 - The notion of a data warehouse for integrating operational data into a single repository is rapidly becoming popular in modern organizations. An important issue in this context is how often one should synchronize the data warehouse to reflect the changes in the constituent operational data sources. If the synchronization is performed very frequently, the associated cost might be quite high, although the data warehouse would only have a small amount of stale data. On the other hand, if the data warehouse is synchronized infrequently, it might result in costly errors in business decisions arising from the stale data. This paper examines the trade-off between the synchronization and staleness costs and derives the optimal synchronization frequency.

AB - The notion of a data warehouse for integrating operational data into a single repository is rapidly becoming popular in modern organizations. An important issue in this context is how often one should synchronize the data warehouse to reflect the changes in the constituent operational data sources. If the synchronization is performed very frequently, the associated cost might be quite high, although the data warehouse would only have a small amount of stale data. On the other hand, if the data warehouse is synchronized infrequently, it might result in costly errors in business decisions arising from the stale data. This paper examines the trade-off between the synchronization and staleness costs and derives the optimal synchronization frequency.

KW - Data warehouse

KW - Stateness cost

KW - Synchronization

UR - http://www.scopus.com/inward/record.url?scp=33745711703&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33745711703&partnerID=8YFLogxK

U2 - 10.1287/ijoc.1040.0094

DO - 10.1287/ijoc.1040.0094

M3 - Article

AN - SCOPUS:33745711703

VL - 18

SP - 229

EP - 242

JO - INFORMS Journal on Computing

JF - INFORMS Journal on Computing

SN - 1091-9856

IS - 2

ER -