Fermilab Computing Division

CS Document 4045-v1

Root Cause Analysis for FermiGrid Cluster disruption - PBI0166

Document #:
CS-doc-4045-v1
Document type:
Documentation
Submitted by:
Tim Doody
Updated by:
Tim Doody
Document Created:
13 Aug 2010, 08:33
Contents Revised:
13 Aug 2010, 14:24
Metadata Revised:
13 Aug 2010, 14:24
Viewable by:
  • Public document
Modifiable by:

Quick Links:
Latest Version

Other Versions:
CS-doc-4045-v0
13 Aug 2010, 13:21
Abstract:
After a data migration on our BlueArc network storage servers, several of our FermiGrid clusters experienced problems with jobs accessing a designated data area. Current jobs / process ended up with NFS stale file handles.
Files in Document:
Authors:
DocDB Home ]  [ Search ] [ Authors ] [ Events ] [ Topics ]

DocDB Version 8.8.9, contact Document Database Administrators