Fermilab Computing Division

CS Document 4455-v5

Investigation of Storage Systems for use in Grid Applications

Document #:
Document type:
Submitted by:
Gabriele Garzoglio
Updated by:
Gabriele Garzoglio
Document Created:
07 Oct 2011, 10:02
Contents Revised:
19 Jul 2012, 12:09
Metadata Revised:
19 Jul 2012, 12:09
Viewable by:
  • Public document
Modifiable by:

Quick Links:
Latest Version

Other Versions:
01 Jun 2012, 14:20
14 May 2012, 13:18
11 May 2012, 10:37
11 May 2012, 10:12
07 Oct 2011, 10:02
In recent years, several new storage technologies, such as Lustre, Hadoop, OrangeFS, and BlueArc, have emerged. While several groups have run benchmarks to characterize them under a variety of configurations, more work is needed to evaluate these technologies for the use cases of scientific computing on Grid clusters and Cloud facilities. This paper discusses our evaluation of the technologies as deployed on a test bed at FermiCloud, one of the Fermilab infrastructure-as-a-service Cloud facilities. The test bed consists of 4 server-class nodes with 40 TB of disk space and up to 50 virtual machine clients, some running on the storage server nodes themselves. With this configuration, the evaluation compares the performance of some of these technologies when deployed on virtual machines and on "bare metal" nodes. In addition to running standard benchmarks such as IOZone to check the sanity of our installation, we have run I/O intensive tests using physics-analysis applications. This paper presents how the storage solutions perform in a variety of realistic use cases of scientific computing. One interesting difference among the storage systems tested is found in a decrease in total read throughput with increasing number of client processes, which occurs in some implementations but not others.
Associated with Events:
CHEP 2012 held on 21 May 2012 in New York, New York
DocDB Home ]  [ Search ] [ Authors ] [ Events ] [ Topics ]

DocDB Version 8.8.9, contact Document Database Administrators