Fermilab Computing Division

CS Document 4323-v1

FermiGrid Scalability and Reliability Improvements

Document #:
CS-doc-4323-v1
Document type:
Presentation
Submitted by:
Keith Chadwick
Updated by:
Keith Chadwick
Document Created:
02 May 2011, 07:03
Contents Revised:
02 May 2011, 07:03
Metadata Revised:
02 May 2011, 07:03
Viewable by:
  • Public document
Modifiable by:

Quick Links:
Latest Version

Abstract:
The Fermilab Campus Grid (FermiGrid) is a meta-facility that provides grid infrastructure for scientific computing at Fermilab. It provides highly available centralized authorization and authentication services, a site portal for Globus job submission, coordination for interoperability among the various stakeholders, and grid-enabled mass storage interfaces. We currently support approximately 25000 batch processing slots. This presentation will describe the current structure of FermiGrid and recent improvements in scalability and reliability of our authorization and authentication services. These improvements include orders of magnitude improvement in our web services based Site AuthoriZation service (SAZ). We will also describe recent enhancements to the information system and matchmaking algorithm of our site job gateway. Finally we will describe the FermiGrid HA2 project currently under way which distributes our services across two buildings, making us resilient in the case of major building outages.
Files in Document:
DocDB Home ]  [ Search ] [ Authors ] [ Events ] [ Topics ]

DocDB Version 8.8.9, contact Document Database Administrators