Flexible-matching data-comparison tool with the cloud applications

Wei Tek Tsai, Xin Sun, Qihong Shao

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Cloud computing provides an infrastructure that enables rapid delivery of computing resources in a scalable virtualized manner. Software residing on the clouds, often called Software-a-a-Service (SaaS), is exposed to end users as web applications in browsers or apps on thin clients such as mobile devices and netbooks. Data on clouds are often huge in size, and they are processed in the MapReduce manner in real time. A data differencer is needed to compare the data for regression testing and data filtering. However, traditional data comparison techniques for relational or semi-structured data such as XML files do not take unique issues brought by cloud computing into account. This paper presents a fine-grained flexible-matching data comparison algorithm that can compare encoded data according to users' specific interests and visualize differences in a user friendly manner. This paper uses a weather forecasting problem to illustrate the algorithm by running it on a cloud environment using Protocol Buffer data format. The experiment results show that the proposed algorithm is effective and efficient in identifying data differences in real time on the GAE.

Original languageEnglish (US)
Title of host publicationProceedings - 5th IEEE International Symposium on Service-Oriented System Engineering, SOSE 2010
Pages95-102
Number of pages8
DOIs
StatePublished - Nov 23 2010
Event5th IEEE International Symposium on Service-Oriented System Engineering, SOSE 2010 - Nanjing, China
Duration: Jun 4 2010Jun 5 2010

Publication series

NameProceedings - 5th IEEE International Symposium on Service-Oriented System Engineering, SOSE 2010

Other

Other5th IEEE International Symposium on Service-Oriented System Engineering, SOSE 2010
Country/TerritoryChina
CityNanjing
Period6/4/106/5/10

Keywords

  • Cloud
  • Cloud computing
  • Data comparison
  • Protocolbuffer
  • Real-time computing

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Networks and Communications
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Flexible-matching data-comparison tool with the cloud applications'. Together they form a unique fingerprint.

Cite this