Region based fault-tolerant distributed file storage system design under budget constraint

Anisha Mazumder, Arun Das, Chenyang Zhou, Arunabha Sen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Scopus citations

Abstract

Two independent lines of research, (i) erasure code based file storage system design, and (ii) fault-tolerant network design for spatially correlated (or region-based) failures, have received considerable attention in the networking research community in recent times. A recently proposed (N,K)-coding based distributed file storage scheme ensures complete reconstruction of a file after network fragmentation due to any single region-based fault. For every region of the network, it stores K distinct file segments in one of the largest connected component that results from the fragmentation of the network due to the failure of a region. This distribution scheme provides an all-region fault-tolerant storage system, in the sense that no matter which region of the network fails, a largest connected component of the fragmented network will still have enough distinct file segments with which to reconstruct the file. However, the storage requirement and the associated cost for such an all-region-fault-tolerant storage system may be quite high. As such, with a limited budget it may not be possible to realize such an all-region fault-tolerant storage system. We consider a budget constrained distributed file system design problem and provide solutions that maximizes the number of regions that can be made fault-tolerant, within the specified budget. We show that the problem is NP-complete, and provide an approximation algorithm for the problem. The performance of the approximation algorithm is evaluated through simulation on two real networks. The simulation results demonstrate that the worst case experimental performance is significantly better than the worst case theoretical bound. Moreover, the approximation algorithm almost always produce near optimal solution in a fraction of time needed to find the optimal solution.

Original languageEnglish (US)
Title of host publicationProceedings of 2014 6th International Workshop on Reliable Networks Design and Modeling, RNDM 2014
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages61-68
Number of pages8
ISBN (Print)9781479970407
DOIs
StatePublished - Jan 19 2014
Event6th International Workshop on Reliable Networks Design and Modeling, RNDM 2014 - Barcelon, Spain
Duration: Nov 17 2014Nov 19 2014

Other

Other6th International Workshop on Reliable Networks Design and Modeling, RNDM 2014
Country/TerritorySpain
CityBarcelon
Period11/17/1411/19/14

Keywords

  • (N,K) coding
  • approximation algorithm
  • budget
  • distributed data storage
  • region-based faults

ASJC Scopus subject areas

  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Region based fault-tolerant distributed file storage system design under budget constraint'. Together they form a unique fingerprint.

Cite this