JIM Job Submission For Monte Carlo Requests

Contents

Introduction

This note describes JIM job submission for D0 Monte Carlo requests to Samgrid enabled execution sites using the runjob job manager. Other sites using the mcfarm job manager have a similar procedure, but in general requires prestaging of run files which involves coordination with the site managers. A prime resource regarding Samgrid can be found in the
Samgrid Manual.

A job submitted via JIM takes the path schematically illustrated below.

Jim_client site ==> Scheduler site ==> Execution site
The user submits the job from the client site to the scheduler site where the job is prepared. The scheduler sends the job to the execution site where it is run. A scheduler site, also called a submission site, maintains the queue of grid jobs, mediates the interactions of the user with the execution site and acts on behalf of the user for job management.

Prerequisites

Monte Carlo Job Submission

The submitter of a Monte Carlo request to a Samgrid/runjob site will:
  1. Reserve a request via the Queue.py procedure
  2. Define a jobfile dataset in SAM consisting of:
    • D0RunII release tarball
    • Magnetic field tarball
    • Mc_runjob tarball
    • Cardfile tarball (if generator phase required)
  3. Prepare a montecarlo Job Description File (JDF)
  4. Submit the montecarlo JDF to SamGrid
  5. Define a sam dataset definition that identifies the files to be merged. (Optional, not needed if first merge of request)
  6. Prepare a merge JDF
  7. Submit the merge JDF to SamGrid upon completion of the montecarlo job
  8. Mark the request as finished upon completion of the merge job

Job Description File

The JDF consists of attribute = value pairs. The complete list of attributes is available in the
Samgrid Manual. Critical attributes are those the user would be interested in modifying while boilerplate attributes are usually standard and not changed. The JDF critical attributes are:

MC Request sg/runjob montecarlo type JDF Example

executable = /bin/ls
sam_experiment = d0
sam_universe = prd
group = mcc99
instances = 1
check_consistency = true
job_type = dzero_monte_carlo
station_name = manhep
runjob_requestid = 13669
runjob_numevts = 20000
d0_release_version= p14.07.00
jobfiles_dataset = jms_set14
minbias_dataset = overlapset_mcp14_cteq5l-tuneA_simulated
+notify_user = you_the_user@fnal.gov
The attributes in red are critical or should be checked by the user.

Needed JIM Submission Info

JIM Merge Jobs

Monte Carlo production jobs store thumbnails in SAM. The Merge jobtype merges the thumbnails belonging to the request and deletes the individual subjob thumbnails. The merge job may execute on any site, not just the one the files were produced on.

Merge JDF differences

Upon completion of the merge job the request is marked finished with a command like:
$ sam modify request --request-id=13669 --status=finished

JIM Structured Jobs

A structured job can be used to combine dzero_monte_carlo and dzero_merge job types into one job. JDF attributes are like those for the dzero_monte_carlo job with the differences noted below.

Structured JDF attribute differences

JIM Monitoring JIM McFarm Submission The submitter of a Monte Carlo request to a Samgrid/mcfarm site will:
  1. Reserve a request via the Queue.py procedure
  2. Prepare a montecarlo Job Description File
    • job_type = mcfarm
    • No minbias_dataset, jobfiles_dataset, phase_dataset attributes
    • These are handled by McFarm, but must be prestaged by farmer
  3. Submit the JDF to SamGrid
  4. Merging is done by McFarm
  5. Mark the request as finished upon completion of the job


Version 00.00.04
Joel Snow
Created June 21, 2004
Revised September 19, 2004