Fermilab Computing Division

CS Document 4275-v3

FermiGrid Scalability and Reliability Improvements

Document #:
CS-doc-4275-v3
Document type:
Presentation
Submitted by:
Keith Chadwick
Updated by:
Keith Chadwick
Document Created:
19 Mar 2011, 02:42
Contents Revised:
22 Mar 2011, 01:22
Metadata Revised:
22 Mar 2011, 01:22
Viewable by:
  • Public document
Modifiable by:

Quick Links:
Latest Version

Other Versions:
CS-doc-4275-v2
19 Mar 2011, 02:50
CS-doc-4275-v1
19 Mar 2011, 02:42
Abstract:
The Fermilab Campus Grid (FermiGrid)is a meta-facility that provides grid infrastructure for scientific computing at Fermilab. It provides highly available centralized authorization and authentication services, a site portal for Globus job submission, coordination for interoperability among the various stakeholders, and grid-enabled mass storage interfaces. We currently support approximately 25000 batch processing slots. This paper will describe the current structure of FermiGrid and recent improvements in scalability and reliability of our authorization and authentication services. These improvements include orders of magnitude improvement in our web services based Site AuthoriZation service (SAZ). We will also describe recent enhancements to the information system and matchmaking algorithm of our site job gateway. Finally we will describe the FermiGrid HA-2 project currently under way which distributes our services across two buildings, making us resilient in the case of major building outages.
Primary authors : Dr. CHADWICK, Keith (Fermi National Accelerator Laboratory)
Co-authors : Dr. TIMM, Steven (Fermi National Accelerator Laboratory) ; Mr. YOCUM, Dan (Fermi National Accelerator Laboratory) ; Mrs. SHARMA, Neha (Fermi National Accelerator Laboratory) ; Mr. LOWE, Faarooq (Fermi National Accelerator Laboratory)
Files in Document:
DocDB Home ]  [ Search ] [ Authors ] [ Events ] [ Topics ]

DocDB Version 8.8.9, contact Document Database Administrators