Fermilab Computing Division

CS Document 2058-v1

The CMS Dataset Bookkeeping Service

Document #:
Document type:
Submitted by:
Lee Lueking
Updated by:
Lee Lueking
Document Created:
27 Mar 2007, 13:54
Contents Revised:
27 Mar 2007, 13:54
Metadata Revised:
27 Mar 2007, 13:58
Viewable by:
  • Public document
Modifiable by:

Quick Links:
Latest Version

The CMS Dataset Bookkeeping Service (DBS) has been developed to catalog all CMS
event data from Monte Carlo and Detector sources. It includes the ability to identify MC
or trigger source, track data provenance, construct datasets for analysis, and discover
interesting data. CMS requires processing and analysis activities at various service levels
and the system provides support for localized processing or private analysis, as well as
global access for CMS users at large. Catalog entries can be moved among the various
service levels with a simple set of migration tools, thus forming a loose federation of
databases. DBS is available to CMS users via a Python API, Command Line, and a
Discovery web page interfaces. The system is built as a multi-tier web application with
Java servlets running under Tomcat, with connecting via JDBC to Oracle or MySQL
database backends. Clients connect to the service through HTTP or HTTPs with
authentication provided by GRID certificates and authorization through VOMS. DBS is
an integral part of the overall CMS Data Management and Workflow Management
systems. The system has been in operation since March 2007, an overview of the schema,
functionality, deployment details, operational statistics and experience will be presented.
Files in Document:
CMS Data Bookkeeping
DocDB Home ]  [ Search ] [ Authors ] [ Events ] [ Topics ]

DocDB Version 8.8.9, contact Document Database Administrators