Cachededup: In-line deduplication for flash caching

Wenji Li, Gregory Jean-Baptise, Juan Riveros, Giri Narasimhan, Tong Zhang, Ming Zhao

Research output: Chapter in Book/Report/Conference proceedingConference contribution

30 Scopus citations

Abstract

Flash caching has emerged as a promising solution to the scalability problems of storage systems by using fast flash memory devices as the cache for slower primary storage. But its adoption faces serious obstacles due to the limited capacity and endurance of flash devices. This paper presents CacheDedup, a solution that addresses these limitations using in-line deduplication. First, it proposes a novel architecture that integrates the caching of data and deduplication metadata (source addresses and fingerprints of the data) and efficiently manages these two components. Second, it proposes duplication-aware cache replacement algorithms (D-LRU, DARC) to optimize both cache performance and endurance. The paper presents a rigorous analysis of the algorithms to prove that they do not waste valuable cache space and that they are efficient in time and space usage. The paper also includes an experimental evaluation using real-world traces, which confirms that CacheDedup substantially improves I/O performance (up to 20% reduction in miss ratio and 51% in latency) and flash endurance (up to 89% reduction in writes sent to the cache device) compared to traditional cache management. It also shows that the proposed architecture and algorithms can be extended to support the combination of compression and deduplication for flash caching and improve its performance and endurance.

Original languageEnglish (US)
Title of host publicationProceedings of the 14th USENIX Conference on File and Storage Technologies, FAST 2016
PublisherUSENIX Association
Pages301-314
Number of pages14
ISBN (Electronic)9781931971287
StatePublished - Jan 1 2019
Event14th USENIX Conference on File and Storage Technologies, FAST 2016 - Santa Clara, United States
Duration: Feb 22 2016Feb 25 2016

Publication series

NameProceedings of the 14th USENIX Conference on File and Storage Technologies, FAST 2016

Conference

Conference14th USENIX Conference on File and Storage Technologies, FAST 2016
CountryUnited States
CitySanta Clara
Period2/22/162/25/16

ASJC Scopus subject areas

  • Hardware and Architecture
  • Software
  • Computer Networks and Communications

Fingerprint Dive into the research topics of 'Cachededup: In-line deduplication for flash caching'. Together they form a unique fingerprint.

  • Cite this

    Li, W., Jean-Baptise, G., Riveros, J., Narasimhan, G., Zhang, T., & Zhao, M. (2019). Cachededup: In-line deduplication for flash caching. In Proceedings of the 14th USENIX Conference on File and Storage Technologies, FAST 2016 (pp. 301-314). (Proceedings of the 14th USENIX Conference on File and Storage Technologies, FAST 2016). USENIX Association.