TY - GEN
T1 - Virtualization-based bandwidth management for parallel storage systems
AU - Xu, Yiqi
AU - Wang, Lixi
AU - Arteaga, Dulcardo
AU - Zhao, Ming
AU - Liu, Yonggang
AU - Figueiredo, Renato
PY - 2010
Y1 - 2010
N2 - This paper presents a new parallel storage management approach which supports the allocation of shared storage bandwidth on a per-application basis. Existing parallel storage systems are unable to differentiate I/Os from different applications and meet per-application bandwidth requirement. This limitation presents a hurdle for applications to achieve their desired performance, which will become even more challenging as high-performance computing (HPC) systems continue to scale up with respect to both the amount of available resources and the number of concurrent applications. This paper proposes a novel solution to address this challenge through the virtualization of parallel file systems (PFSes). Such PFS virtualization is achieved with user-level PFS proxies, which interpose between native PFS clients and servers and schedule the I/Os from different applications according to the resource sharing algorithm (e.g., SFQ(D)). In this way, virtual PFSes can be created on a perapplication basis, each with a specific bandwidth share allocated according to its I/O requirement. This approach is applicable to different PFS-based parallel storage systems and can be transparently integrated with existing as well as future HPC systems. A prototype of this approach is implemented upon PVFS2, a widely used PFS, and evaluated with experiments using a typical parallel I/O benchmark (IOR). Results show that this approach's overhead is very small and it achieves effective proportional sharing under different usage scenarios.
AB - This paper presents a new parallel storage management approach which supports the allocation of shared storage bandwidth on a per-application basis. Existing parallel storage systems are unable to differentiate I/Os from different applications and meet per-application bandwidth requirement. This limitation presents a hurdle for applications to achieve their desired performance, which will become even more challenging as high-performance computing (HPC) systems continue to scale up with respect to both the amount of available resources and the number of concurrent applications. This paper proposes a novel solution to address this challenge through the virtualization of parallel file systems (PFSes). Such PFS virtualization is achieved with user-level PFS proxies, which interpose between native PFS clients and servers and schedule the I/Os from different applications according to the resource sharing algorithm (e.g., SFQ(D)). In this way, virtual PFSes can be created on a perapplication basis, each with a specific bandwidth share allocated according to its I/O requirement. This approach is applicable to different PFS-based parallel storage systems and can be transparently integrated with existing as well as future HPC systems. A prototype of this approach is implemented upon PVFS2, a widely used PFS, and evaluated with experiments using a typical parallel I/O benchmark (IOR). Results show that this approach's overhead is very small and it achieves effective proportional sharing under different usage scenarios.
UR - http://www.scopus.com/inward/record.url?scp=78651381106&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78651381106&partnerID=8YFLogxK
U2 - 10.1109/PDSW.2010.5668094
DO - 10.1109/PDSW.2010.5668094
M3 - Conference contribution
AN - SCOPUS:78651381106
SN - 9781424489121
T3 - Proceedings of the 2010 5th Petascale Data Storage Workshop, PDSW'10, Held in Conjunction with SC'10 - The International Conference for High Performance Computing, Networking, Storage and Analysis
BT - Proceedings of the 2010 5th Petascale Data Storage Workshop, PDSW'10, Held in Conjunction with SC'10 - The International Conference for High Performance Computing, Networking, Storage and Analysis
T2 - 2010 5th Petascale Data Storage Workshop, PDSW'10, Held in Conjunction with SC'10 - The International Conference for High Performance Computing, Networking, Storage and Analysis
Y2 - 15 November 2010 through 15 November 2010
ER -