Client-side flash caching for cloud systems

Dulcardo Arteaga, Ming Zhao

Research output: Chapter in Book/Report/Conference proceedingConference contribution

23 Citations (Scopus)

Abstract

As the size of cloud systems and the number of hosted VMs rapidly grow, the scalability of shared VM storage systems becomes a serious issue. Client-side flash-based caching has the potential to improve the performance of cloud VM storage by employing flash storage available on the client-side of the storage system to exploit the locality inherent in VM IOs. However, because of the limited capacity and durability of flash storage, it is important to determine the proper size and configuration of the flash caches used in cloud systems. This paper provides answers to the key design questions of cloud flash caching based on dm-cache, a block-level caching solution customized for cloud environments, and a large amount of long-term traces collected from real-world public and private clouds. The study first validates that cloud workloads have good cacheability and dm-cache-based flash caching incurs low overhead with respect to commodity flash devices. It further reveals that write-back caching substantially outperforms write-through caching in typical cloud environments due to the reduction of server IO load. It also shows that there is a tradeoff on making a flash cache persistent across client restarts which saves hours of cache warm-up time but incurs considerable overhead from committing every metadata update persistently. Finally, to reduce the data loss risk from using write-back caching, the paper proposes a new cache-optimized RAID technique, which minimizes the RAID overhead by introducing redundancy of cache dirty data only, and shows to be significantly faster than traditional RAID and write-through caching.

Original languageEnglish (US)
Title of host publicationProceedings of the 7th ACM International Systems and Storage Conference, SYSTOR 2014
PublisherAssociation for Computing Machinery, Inc
Pages4
Number of pages1
ISBN (Print)9781450329200
StatePublished - Jun 30 2014
Externally publishedYes
Event7th ACM International Systems and Storage Conference, SYSTOR 2014 - Haifa, Israel
Duration: Jun 10 2014Jun 12 2014

Other

Other7th ACM International Systems and Storage Conference, SYSTOR 2014
CountryIsrael
CityHaifa
Period6/10/146/12/14

Fingerprint

Metadata
Redundancy
Scalability
Durability
Servers

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Hardware and Architecture
  • Software
  • Computer Science Applications

Cite this

Arteaga, D., & Zhao, M. (2014). Client-side flash caching for cloud systems. In Proceedings of the 7th ACM International Systems and Storage Conference, SYSTOR 2014 (pp. 4). Association for Computing Machinery, Inc.

Client-side flash caching for cloud systems. / Arteaga, Dulcardo; Zhao, Ming.

Proceedings of the 7th ACM International Systems and Storage Conference, SYSTOR 2014. Association for Computing Machinery, Inc, 2014. p. 4.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Arteaga, D & Zhao, M 2014, Client-side flash caching for cloud systems. in Proceedings of the 7th ACM International Systems and Storage Conference, SYSTOR 2014. Association for Computing Machinery, Inc, pp. 4, 7th ACM International Systems and Storage Conference, SYSTOR 2014, Haifa, Israel, 6/10/14.
Arteaga D, Zhao M. Client-side flash caching for cloud systems. In Proceedings of the 7th ACM International Systems and Storage Conference, SYSTOR 2014. Association for Computing Machinery, Inc. 2014. p. 4
Arteaga, Dulcardo ; Zhao, Ming. / Client-side flash caching for cloud systems. Proceedings of the 7th ACM International Systems and Storage Conference, SYSTOR 2014. Association for Computing Machinery, Inc, 2014. pp. 4
@inproceedings{98a42a9e6e7a4ef9be12d21ff0daae41,
title = "Client-side flash caching for cloud systems",
abstract = "As the size of cloud systems and the number of hosted VMs rapidly grow, the scalability of shared VM storage systems becomes a serious issue. Client-side flash-based caching has the potential to improve the performance of cloud VM storage by employing flash storage available on the client-side of the storage system to exploit the locality inherent in VM IOs. However, because of the limited capacity and durability of flash storage, it is important to determine the proper size and configuration of the flash caches used in cloud systems. This paper provides answers to the key design questions of cloud flash caching based on dm-cache, a block-level caching solution customized for cloud environments, and a large amount of long-term traces collected from real-world public and private clouds. The study first validates that cloud workloads have good cacheability and dm-cache-based flash caching incurs low overhead with respect to commodity flash devices. It further reveals that write-back caching substantially outperforms write-through caching in typical cloud environments due to the reduction of server IO load. It also shows that there is a tradeoff on making a flash cache persistent across client restarts which saves hours of cache warm-up time but incurs considerable overhead from committing every metadata update persistently. Finally, to reduce the data loss risk from using write-back caching, the paper proposes a new cache-optimized RAID technique, which minimizes the RAID overhead by introducing redundancy of cache dirty data only, and shows to be significantly faster than traditional RAID and write-through caching.",
author = "Dulcardo Arteaga and Ming Zhao",
year = "2014",
month = "6",
day = "30",
language = "English (US)",
isbn = "9781450329200",
pages = "4",
booktitle = "Proceedings of the 7th ACM International Systems and Storage Conference, SYSTOR 2014",
publisher = "Association for Computing Machinery, Inc",

}

TY - GEN

T1 - Client-side flash caching for cloud systems

AU - Arteaga, Dulcardo

AU - Zhao, Ming

PY - 2014/6/30

Y1 - 2014/6/30

N2 - As the size of cloud systems and the number of hosted VMs rapidly grow, the scalability of shared VM storage systems becomes a serious issue. Client-side flash-based caching has the potential to improve the performance of cloud VM storage by employing flash storage available on the client-side of the storage system to exploit the locality inherent in VM IOs. However, because of the limited capacity and durability of flash storage, it is important to determine the proper size and configuration of the flash caches used in cloud systems. This paper provides answers to the key design questions of cloud flash caching based on dm-cache, a block-level caching solution customized for cloud environments, and a large amount of long-term traces collected from real-world public and private clouds. The study first validates that cloud workloads have good cacheability and dm-cache-based flash caching incurs low overhead with respect to commodity flash devices. It further reveals that write-back caching substantially outperforms write-through caching in typical cloud environments due to the reduction of server IO load. It also shows that there is a tradeoff on making a flash cache persistent across client restarts which saves hours of cache warm-up time but incurs considerable overhead from committing every metadata update persistently. Finally, to reduce the data loss risk from using write-back caching, the paper proposes a new cache-optimized RAID technique, which minimizes the RAID overhead by introducing redundancy of cache dirty data only, and shows to be significantly faster than traditional RAID and write-through caching.

AB - As the size of cloud systems and the number of hosted VMs rapidly grow, the scalability of shared VM storage systems becomes a serious issue. Client-side flash-based caching has the potential to improve the performance of cloud VM storage by employing flash storage available on the client-side of the storage system to exploit the locality inherent in VM IOs. However, because of the limited capacity and durability of flash storage, it is important to determine the proper size and configuration of the flash caches used in cloud systems. This paper provides answers to the key design questions of cloud flash caching based on dm-cache, a block-level caching solution customized for cloud environments, and a large amount of long-term traces collected from real-world public and private clouds. The study first validates that cloud workloads have good cacheability and dm-cache-based flash caching incurs low overhead with respect to commodity flash devices. It further reveals that write-back caching substantially outperforms write-through caching in typical cloud environments due to the reduction of server IO load. It also shows that there is a tradeoff on making a flash cache persistent across client restarts which saves hours of cache warm-up time but incurs considerable overhead from committing every metadata update persistently. Finally, to reduce the data loss risk from using write-back caching, the paper proposes a new cache-optimized RAID technique, which minimizes the RAID overhead by introducing redundancy of cache dirty data only, and shows to be significantly faster than traditional RAID and write-through caching.

UR - http://www.scopus.com/inward/record.url?scp=84945521760&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84945521760&partnerID=8YFLogxK

M3 - Conference contribution

SN - 9781450329200

SP - 4

BT - Proceedings of the 7th ACM International Systems and Storage Conference, SYSTOR 2014

PB - Association for Computing Machinery, Inc

ER -