TY - GEN
T1 - On the design and implementation of a simulator for parallel file system research
AU - Liu, Yonggang
AU - Figueiredo, Renato
AU - Xu, Yiqi
AU - Zhao, Ming
PY - 2013/9/2
Y1 - 2013/9/2
N2 - Due to the popularity and importance of Parallel File Systems (PFSs) in modern High Performance Computing (HPC) centers, PFS designs and I/O optimizations are active research topics. However, the research process is often time-consuming and faces cost and complexity challenges in deploying experiments in real HPC systems. This paper describes PFSsim, a trace-driven simulator of distributed storage systems that allows the evaluation of PFS designs, I/O schedulers, network structures, and workloads. PFSsim differentiates itself from related work in that it provides a powerful platform featuring a modular design with high flexibility in the modeling of subsystems including the network, clients, data servers and I/O schedulers. It does so by designing the simulator to capture abstractions found in common PFSs. PFSsim also exposes script-based interfaces for detailed configurations. Experiments and validation against real systems considering sub-modules and the entire simulator show that PFSsim is capable of simulating a representative PFS (PVFS2) and of modeling different I/O scheduler algorithms with good fidelity. In addition, the simulation speed is also shown to be acceptable.
AB - Due to the popularity and importance of Parallel File Systems (PFSs) in modern High Performance Computing (HPC) centers, PFS designs and I/O optimizations are active research topics. However, the research process is often time-consuming and faces cost and complexity challenges in deploying experiments in real HPC systems. This paper describes PFSsim, a trace-driven simulator of distributed storage systems that allows the evaluation of PFS designs, I/O schedulers, network structures, and workloads. PFSsim differentiates itself from related work in that it provides a powerful platform featuring a modular design with high flexibility in the modeling of subsystems including the network, clients, data servers and I/O schedulers. It does so by designing the simulator to capture abstractions found in common PFSs. PFSsim also exposes script-based interfaces for detailed configurations. Experiments and validation against real systems considering sub-modules and the entire simulator show that PFSsim is capable of simulating a representative PFS (PVFS2) and of modeling different I/O scheduler algorithms with good fidelity. In addition, the simulation speed is also shown to be acceptable.
KW - I/O scheduling
KW - parallel file system
KW - simulation
UR - http://www.scopus.com/inward/record.url?scp=84883072746&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84883072746&partnerID=8YFLogxK
U2 - 10.1109/MSST.2013.6558438
DO - 10.1109/MSST.2013.6558438
M3 - Conference contribution
AN - SCOPUS:84883072746
SN - 9781479902170
T3 - IEEE Symposium on Mass Storage Systems and Technologies
BT - 2013 IEEE 29th Symposium on Mass Storage Systems and Technologies, MSST 2013
T2 - 2013 IEEE 29th Symposium on Mass Storage Systems and Technologies, MSST 2013
Y2 - 6 May 2013 through 10 May 2013
ER -